ID ZFHX2_HUMAN Reviewed; 1427 AA. AC Q9C0A1; DT 12-FEB-2003, integrated into UniProtKB/Swiss-Prot. DT 12-FEB-2003, sequence version 2. DT 12-DEC-2006, entry version 40. DE Zinc finger homeobox protein 2. GN Name=ZFHX2; Synonyms=KIAA1762; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Brain; RX MEDLINE=21082932; PubMed=11214970; DOI=10.1093/dnares/7.6.347; RA Nagase T., Kikuno R., Hattori A., Kondo Y., Okumura K., Ohara O.; RT "Prediction of the coding sequences of unidentified human genes. XIX. RT The complete sequences of 100 new cDNA clones from brain which code RT for large proteins in vitro."; RL DNA Res. 7:347-355(2000). CC -!- SUBCELLULAR LOCATION: Nucleus (Probable). CC -!- SIMILARITY: Contains 6 C2H2-type zinc fingers. CC -!- SIMILARITY: Contains 3 homeobox DNA-binding domains. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AB051549; BAB21853.1; ALT_INIT; mRNA. DR HSSP; P50480; 1BW5. DR GermOnline; ENSG00000136367; Homo sapiens. DR Ensembl; ENSG00000136367; Homo sapiens. DR KEGG; hsa:85446; -. DR HGNC; HGNC:20152; ZFHX2. DR ArrayExpress; Q9C0A1; -. DR InterPro; IPR001356; Homeobox. DR InterPro; IPR012287; Homeodomain-rel. DR InterPro; IPR009057; Homeodomain_like. DR InterPro; IPR007087; Znf_C2H2. DR Gene3D; G3DSA:1.10.10.60; Homeodomain-rel; 3. DR Pfam; PF00046; Homeobox; 3. DR Pfam; PF00096; zf-C2H2; 3. DR PRINTS; PR00024; HOMEOBOX. DR ProDom; PD000010; Homeobox; 3. DR SMART; SM00389; HOX; 3. DR SMART; SM00355; ZnF_C2H2; 6. DR PROSITE; PS00027; HOMEOBOX_1; 1. DR PROSITE; PS50071; HOMEOBOX_2; 3. DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 6. DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 4. KW Activator; DNA-binding; Homeobox; Metal-binding; Nuclear protein; KW Repeat; Repressor; Transcription; Transcription regulation; Zinc; KW Zinc-finger. FT CHAIN 1 1427 Zinc finger homeobox protein 2. FT /FTId=PRO_0000047243. FT ZN_FING 46 70 C2H2-type 1. FT ZN_FING 103 127 C2H2-type 2. FT ZN_FING 335 358 C2H2-type 3. FT DNA_BIND 450 509 Homeobox 1. FT ZN_FING 624 646 C2H2-type 4. FT DNA_BIND 712 771 Homeobox 2. FT DNA_BIND 920 979 Homeobox 3. FT ZN_FING 1001 1025 C2H2-type 5. FT ZN_FING 1350 1374 C2H2-type 6. FT COMPBIAS 176 326 Pro-rich. FT COMPBIAS 554 614 Glu-rich. FT COMPBIAS 776 873 Pro-rich. FT COMPBIAS 1048 1279 Pro-rich. SQ SEQUENCE 1427 AA; 152101 MW; 178D3FEA1493AA23 CRC64; MAEEEEGTTG ELRSAEPAPA DSRHPLTYRK TTNFALDKFL DPARPYKCTV CKESFTQKNI LLVHYNSVSH LHKMKKAAID PSAPARGEAG APPTTTAATD KPFKCTVCRV SYNQSSTLEI HMRSVLHQTR SRGTKTDSKI EGPERSQEEP KEGETEGEVG TEKKGPDTSG FISGLPFLSP PPPPLDLHRF PAPLFTPPVL PPFPLVPESL LKLQQQQLLL PFYLHDLKVG PKLTLAGPAP VLSLPAATPP PPPQPPKAEL AEREWERPPM AKEGNEAGPS SPPDPLPNEA ARTAAKALLE NFGFELVIQY NEGKQAVPPP PTPPPPEALG GGDKLACGAC GKLFSNMLIL KTHEEHVHRR FLPFEALSRY AAQFRKSYDS LYPPLAEPPK PPDGSLDSPV PHLGPPFLVP EPEAGGTRAP EERSRAGGHW PIEEEESSRG NLPPLVPAGR RFSRTKFTEF QTQALQSFFE TSAYPKDGEV ERLASLLGLA SRVVVVWFQN ARQKARKNAC EGGSMPTGGG TGGASGCRRC HATFSCVFEL VRHLKKCYDD QTLEEEEEEA ERGEEEEEVE EEEVEEEQGL EPPAGPEGPL PEPPDGEELS QAEATKAGGK EPEEKATPSP SPAHTCDQCA ISFSSQDLLT SHRRLHFLPS LQPSAPPQLL DLPLLVFGER NPLVAATSPM PGPPLKRKHE DGSLSPTGSE AGGGGEGEPP RDKRLRTTIL PEQLEILYRW YMQDSNPTRK MLDCISEEVG LKKRVVQVWF QNTRARERKG QFRSTPGGVP SPAVKPPATA TPASLPKFNL LLGKVDDGTG REAPKREAPA FPYPTATLAS GPQPFLPPGK EATTPTPEPP LPLLPPPPPS EEEGPEEPPK ASPESEACSL SAGDLSDSSA SSLAEPESPG AGGTSGGPGG GTGVPDGMGQ RRYRTQMSSL QLKIMKACYE AYRTPTMQEC EVLGEEIGLP KRVIQVWFQN ARAKEKKAKL QGTAAGSTGG SSEGLLAAQR TDCPYCDVKY DFYVSCRGHL FSRQHLAKLK EAVRAQLKSE SKCYDLAPAP EAPPALKAPP ATTPASMPLG AAPTLPRLAP VLLSGPALAQ PPLGNLAPFN SGPAASSGLL GLATSVLPTT TVVQTAGPGR PLPQRPMPDQ TNTSTAGTTD PVPGPPTEPL GDKVSSERKP VAGPTSSSND ALKNLKALKT TVPALLGGQF LPFPLPPAGG TAPPAVFGPQ LQGAYFQQLY GMKKGLFPMN PMIPQTLIGL LPNALLQPPP QPPEPTATAP PKPPELPAPG EGEAGEVDEL LTGSTGISTV DVTHRYLCRQ CKMAFDGEAP ATAHQRSFCF FGRGSGGSMP PPLRVPICTY HCLACEVLLS GREALASHLR SSAHRRKAAP PQGGPPISIT NAATAASAAV AFAKEEARLP HTDSNPKTTT TSTLLAL //