ID F7CR09_HORSE Unreviewed; 633 AA. AC F7CR09; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 01-APR-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000017518}; GN Name=HNF1A {ECO:0000313|Ensembl:ENSECAP00000017518}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000017518, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000017518, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000017518, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000017518} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000017518}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|RuleBase:RU000682}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000017518}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR Ensembl; ENSECAT00000021278; ENSECAP00000017518; ENSECAG00000019954. DR GeneTree; ENSGT00730000110937; -. DR InParanoid; F7CR09; -. DR OMA; AQSPFMA; -. DR OrthoDB; EOG769ZJ9; -. DR TreeFam; TF320327; -. DR Proteomes; UP000002281; Chromosome 8. DR GO; GO:0005737; C:cytoplasm; IEA:Ensembl. DR GO; GO:0001750; C:photoreceptor outer segment; IEA:Ensembl. DR GO; GO:0045120; C:pronucleus; IEA:Ensembl. DR GO; GO:0005667; C:transcription factor complex; IEA:Ensembl. DR GO; GO:0000979; F:RNA polymerase II core promoter sequence-specific DNA binding; IEA:Ensembl. DR GO; GO:0003700; F:sequence-specific DNA binding transcription factor activity; IEA:Ensembl. DR GO; GO:0015721; P:bile acid and bile salt transport; IEA:Ensembl. DR GO; GO:0006699; P:bile acid biosynthetic process; IEA:Ensembl. DR GO; GO:0001824; P:blastocyst development; IEA:Ensembl. DR GO; GO:0045453; P:bone resorption; IEA:Ensembl. DR GO; GO:0008203; P:cholesterol metabolic process; IEA:Ensembl. DR GO; GO:0006338; P:chromatin remodeling; IEA:Ensembl. DR GO; GO:0030326; P:embryonic limb morphogenesis; IEA:Ensembl. DR GO; GO:0031018; P:endocrine pancreas development; IEA:Ensembl. DR GO; GO:0006633; P:fatty acid biosynthetic process; IEA:Ensembl. DR GO; GO:0015908; P:fatty acid transport; IEA:Ensembl. DR GO; GO:0042593; P:glucose homeostasis; IEA:Ensembl. DR GO; GO:0046323; P:glucose import; IEA:Ensembl. DR GO; GO:0006783; P:heme biosynthetic process; IEA:Ensembl. DR GO; GO:0030073; P:insulin secretion; IEA:Ensembl. DR GO; GO:0001889; P:liver development; IEA:Ensembl. DR GO; GO:0048341; P:paraxial mesoderm formation; IEA:Ensembl. DR GO; GO:0001890; P:placenta development; IEA:Ensembl. DR GO; GO:0045944; P:positive regulation of transcription from RNA polymerase II promoter; IEA:Ensembl. DR GO; GO:0050796; P:regulation of insulin secretion; IEA:Ensembl. DR GO; GO:0030111; P:regulation of Wnt signaling pathway; IEA:Ensembl. DR GO; GO:0035623; P:renal glucose absorption; IEA:Ensembl. DR GO; GO:0009749; P:response to glucose; IEA:Ensembl. DR GO; GO:0006979; P:response to oxidative stress; IEA:Ensembl. DR GO; GO:0043691; P:reverse cholesterol transport; IEA:Ensembl. DR GO; GO:0060395; P:SMAD protein signal transduction; IEA:Ensembl. DR Gene3D; 1.10.10.60; -; 1. DR Gene3D; 1.10.260.40; -; 1. DR InterPro; IPR006899; HNF-1_N. DR InterPro; IPR023219; HNF1_dimer_dom. DR InterPro; IPR006898; HNF1a_C. DR InterPro; IPR006897; HNF1b_C. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR009057; Homeodomain-like. DR InterPro; IPR010982; Lambda_DNA-bd_dom. DR Pfam; PF04814; HNF-1_N; 1. DR Pfam; PF04813; HNF-1A_C; 1. DR Pfam; PF04812; HNF-1B_C; 1. DR Pfam; PF00046; Homeobox; 1. DR SMART; SM00389; HOX; 1. DR SUPFAM; SSF100957; SSF100957; 1. DR SUPFAM; SSF46689; SSF46689; 1. DR SUPFAM; SSF47413; SSF47413; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW DNA-binding {ECO:0000256|RuleBase:RU000682}; KW Homeobox {ECO:0000256|RuleBase:RU000682}; KW Nucleus {ECO:0000256|RuleBase:RU000682}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}. SQ SEQUENCE 633 AA; 67974 MW; 79A4F505853F0788 CRC64; MVSKLSQLQT ELLAALLESG LSKEALIQAL GRGSPGPFLP GGTGPLDRRP LWKLRHGEVK QVPKATWTVN YILDDDDDNI VPLTEPFLSQ LQNQINIALS LNTLVVSSLL LEDPWRVAKM VKSYLQQHNI PQREVVDTTG LNQSHLSQHL NKGTPMKTQK RAALYTWYVR KQREVAQQFT HAGQGGLIEE PTGDELPTKK GRRNRFKWGP ASQQILFQAY ERQKNPSKEE REALVEECNR AECIQRGVSP SQAQGLGSNL VTEVRVYNWF ANRRKEEAFR HKLAMDTYSG PPPGPGPGPA LPAHSSPGLP PPALSPSKVH GVRYGQPATS EGAEVPSSSG GPLVTVSAPL HQVSPTGLEP SHSLLSTEAK LVSATGGPLP PVSTLTALHS LEQTSPGLNQ QPQNLIMASL PGVMAIGPGE PASLGPTFTN TGASTLVIGL ASTQAQSVPV INSMGSSLTT LQPVQFSQPL HPSYQQPLMP PVQSHVAQSP FMATMAQLQS PHALYSHKPE VAQYTHTGLL PQTMLITDTT NLSALASLTP TKQVFTTDTE ASSESGLHPP VSQATTIHIP SQDSAGIQHL QPAHRLSTSP TVSSSSLVLY QSSDSSNGHS HLLPSNHSVI ETFISTQMAS SSQ //