ID F7CR09_HORSE Unreviewed; 633 AA. AC F7CR09; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 21-SEP-2011, entry version 2. DE SubName: Full=Uncharacterized protein; GN Name=HNF1A; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796; RN [1] RP IDENTIFICATION. RC STRAIN=Thoroughbred; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred; RX PubMed=19892987; DOI=10.1126/science.1178158; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). CC -!- SUBCELLULAR LOCATION: Nucleus (By similarity). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR Ensembl; ENSECAT00000021278; ENSECAP00000017518; ENSECAG00000019954. DR InterPro; IPR006899; HNF-1_N. DR InterPro; IPR023219; HNF1_dimer_dom. DR InterPro; IPR006898; HNF1a_C. DR InterPro; IPR006897; HNF1b_C. DR InterPro; IPR001356; Homeobox. DR InterPro; IPR009057; Homeodomain-like. DR InterPro; IPR012287; Homeodomain-rel. DR InterPro; IPR010982; Lambda_DNA-bd. DR Gene3D; G3DSA:1.10.260.40; G3DSA:1.10.260.40; 1. DR Gene3D; G3DSA:1.10.10.60; Homeodomain-rel; 1. DR Pfam; PF04814; HNF-1_N; 1. DR Pfam; PF04813; HNF-1A_C; 1. DR Pfam; PF04812; HNF-1B_C; 1. DR Pfam; PF00046; Homeobox; 1. DR SMART; SM00389; HOX; 1. DR SUPFAM; SSF100957; HNF1_dimer_dom; 1. DR SUPFAM; SSF46689; Homeodomain_like; 1. DR SUPFAM; SSF47413; Lambda_like_DNA; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. PE 3: Inferred from homology; KW Complete proteome; DNA-binding; Homeobox; Nucleus. SQ SEQUENCE 633 AA; 67974 MW; 79A4F505853F0788 CRC64; MVSKLSQLQT ELLAALLESG LSKEALIQAL GRGSPGPFLP GGTGPLDRRP LWKLRHGEVK QVPKATWTVN YILDDDDDNI VPLTEPFLSQ LQNQINIALS LNTLVVSSLL LEDPWRVAKM VKSYLQQHNI PQREVVDTTG LNQSHLSQHL NKGTPMKTQK RAALYTWYVR KQREVAQQFT HAGQGGLIEE PTGDELPTKK GRRNRFKWGP ASQQILFQAY ERQKNPSKEE REALVEECNR AECIQRGVSP SQAQGLGSNL VTEVRVYNWF ANRRKEEAFR HKLAMDTYSG PPPGPGPGPA LPAHSSPGLP PPALSPSKVH GVRYGQPATS EGAEVPSSSG GPLVTVSAPL HQVSPTGLEP SHSLLSTEAK LVSATGGPLP PVSTLTALHS LEQTSPGLNQ QPQNLIMASL PGVMAIGPGE PASLGPTFTN TGASTLVIGL ASTQAQSVPV INSMGSSLTT LQPVQFSQPL HPSYQQPLMP PVQSHVAQSP FMATMAQLQS PHALYSHKPE VAQYTHTGLL PQTMLITDTT NLSALASLTP TKQVFTTDTE ASSESGLHPP VSQATTIHIP SQDSAGIQHL QPAHRLSTSP TVSSSSLVLY QSSDSSNGHS HLLPSNHSVI ETFISTQMAS SSQ //