ID F7CR09_HORSE Unreviewed; 633 AA. AC F7CR09; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 22-JUL-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000017518}; GN Name=HNF1A {ECO:0000313|Ensembl:ENSECAP00000017518}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000017518, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000017518, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000017518, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000017518} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000017518}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|RuleBase:RU000682}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000017518}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000017518; -. DR Ensembl; ENSECAT00000021278; ENSECAP00000017518; ENSECAG00000019954. DR GeneTree; ENSGT00730000110937; -. DR InParanoid; F7CR09; -. DR OMA; QSHVAQS; -. DR OrthoDB; EOG769ZJ9; -. DR TreeFam; TF320327; -. DR Proteomes; UP000002281; Chromosome 8. DR GO; GO:0005634; C:nucleus; IBA:GO_Central. DR GO; GO:0001750; C:photoreceptor outer segment; IEA:Ensembl. DR GO; GO:0045120; C:pronucleus; IEA:Ensembl. DR GO; GO:0005667; C:transcription factor complex; IEA:Ensembl. DR GO; GO:0000979; F:RNA polymerase II core promoter sequence-specific DNA binding; IEA:Ensembl. DR GO; GO:0001228; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding transcription factor activity involved in positive regulation of transcription; IEA:Ensembl. DR GO; GO:0003700; F:sequence-specific DNA binding transcription factor activity; IBA:GO_Central. DR GO; GO:0015721; P:bile acid and bile salt transport; IEA:Ensembl. DR GO; GO:0006699; P:bile acid biosynthetic process; IEA:Ensembl. DR GO; GO:0001824; P:blastocyst development; IEA:Ensembl. DR GO; GO:0045453; P:bone resorption; IEA:Ensembl. DR GO; GO:0008203; P:cholesterol metabolic process; IEA:Ensembl. DR GO; GO:0006338; P:chromatin remodeling; IEA:Ensembl. DR GO; GO:0030326; P:embryonic limb morphogenesis; IEA:Ensembl. DR GO; GO:0031018; P:endocrine pancreas development; IEA:Ensembl. DR GO; GO:0006633; P:fatty acid biosynthetic process; IEA:Ensembl. DR GO; GO:0015908; P:fatty acid transport; IEA:Ensembl. DR GO; GO:0006783; P:heme biosynthetic process; IEA:Ensembl. DR GO; GO:0001889; P:liver development; IEA:Ensembl. DR GO; GO:0048341; P:paraxial mesoderm formation; IEA:Ensembl. DR GO; GO:0001890; P:placenta development; IEA:Ensembl. DR GO; GO:0045893; P:positive regulation of transcription, DNA-templated; IBA:GO_Central. DR GO; GO:0008104; P:protein localization; IEA:Ensembl. DR GO; GO:0050796; P:regulation of insulin secretion; IEA:Ensembl. DR GO; GO:0030111; P:regulation of Wnt signaling pathway; IEA:Ensembl. DR GO; GO:0009749; P:response to glucose; IEA:Ensembl. DR GO; GO:0006979; P:response to oxidative stress; IEA:Ensembl. DR GO; GO:0043691; P:reverse cholesterol transport; IEA:Ensembl. DR GO; GO:0060395; P:SMAD protein signal transduction; IEA:Ensembl. DR Gene3D; 1.10.10.60; -; 1. DR Gene3D; 1.10.260.40; -; 1. DR InterPro; IPR006899; HNF-1_N. DR InterPro; IPR023219; HNF1_dimer_dom. DR InterPro; IPR006898; HNF1a_C. DR InterPro; IPR006897; HNF1b_C. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR009057; Homeodomain-like. DR InterPro; IPR010982; Lambda_DNA-bd_dom. DR Pfam; PF04814; HNF-1_N; 1. DR Pfam; PF04813; HNF-1A_C; 1. DR Pfam; PF04812; HNF-1B_C; 1. DR Pfam; PF00046; Homeobox; 1. DR SMART; SM00389; HOX; 1. DR SUPFAM; SSF100957; SSF100957; 1. DR SUPFAM; SSF46689; SSF46689; 1. DR SUPFAM; SSF47413; SSF47413; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW DNA-binding {ECO:0000256|RuleBase:RU000682}; KW Homeobox {ECO:0000256|RuleBase:RU000682}; KW Nucleus {ECO:0000256|RuleBase:RU000682}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}. SQ SEQUENCE 633 AA; 67974 MW; 79A4F505853F0788 CRC64; MVSKLSQLQT ELLAALLESG LSKEALIQAL GRGSPGPFLP GGTGPLDRRP LWKLRHGEVK QVPKATWTVN YILDDDDDNI VPLTEPFLSQ LQNQINIALS LNTLVVSSLL LEDPWRVAKM VKSYLQQHNI PQREVVDTTG LNQSHLSQHL NKGTPMKTQK RAALYTWYVR KQREVAQQFT HAGQGGLIEE PTGDELPTKK GRRNRFKWGP ASQQILFQAY ERQKNPSKEE REALVEECNR AECIQRGVSP SQAQGLGSNL VTEVRVYNWF ANRRKEEAFR HKLAMDTYSG PPPGPGPGPA LPAHSSPGLP PPALSPSKVH GVRYGQPATS EGAEVPSSSG GPLVTVSAPL HQVSPTGLEP SHSLLSTEAK LVSATGGPLP PVSTLTALHS LEQTSPGLNQ QPQNLIMASL PGVMAIGPGE PASLGPTFTN TGASTLVIGL ASTQAQSVPV INSMGSSLTT LQPVQFSQPL HPSYQQPLMP PVQSHVAQSP FMATMAQLQS PHALYSHKPE VAQYTHTGLL PQTMLITDTT NLSALASLTP TKQVFTTDTE ASSESGLHPP VSQATTIHIP SQDSAGIQHL QPAHRLSTSP TVSSSSLVLY QSSDSSNGHS HLLPSNHSVI ETFISTQMAS SSQ //