ID ZFHX2_HUMAN Reviewed; 2572 AA. AC Q9C0A1; Q9UPU6; DT 12-FEB-2003, integrated into UniProtKB/Swiss-Prot. DT 13-JUL-2010, sequence version 3. DT 22-APR-2020, entry version 159. DE RecName: Full=Zinc finger homeobox protein 2; DE AltName: Full=Zinc finger homeodomain protein 2; DE Short=ZFH-2; GN Name=ZFHX2; Synonyms=KIAA1056, KIAA1762, ZNF409; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2), AND NUCLEOTIDE SEQUENCE RP [LARGE SCALE MRNA] OF 1113-2572 (ISOFORM 1). RC TISSUE=Brain; RX PubMed=11214970; DOI=10.1093/dnares/7.6.347; RA Nagase T., Kikuno R., Hattori A., Kondo Y., Okumura K., Ohara O.; RT "Prediction of the coding sequences of unidentified human genes. XIX. The RT complete sequences of 100 new cDNA clones from brain which code for large RT proteins in vitro."; RL DNA Res. 7:347-355(2000). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12508121; DOI=10.1038/nature01348; RA Heilig R., Eckenberg R., Petit J.-L., Fonknechten N., Da Silva C., RA Cattolico L., Levy M., Barbe V., De Berardinis V., Ureta-Vidal A., RA Pelletier E., Vico V., Anthouard V., Rowen L., Madan A., Qin S., Sun H., RA Du H., Pepin K., Artiguenave F., Robert C., Cruaud C., Bruels T., RA Jaillon O., Friedlander L., Samson G., Brottier P., Cure S., Segurens B., RA Aniere F., Samain S., Crespeau H., Abbasi N., Aiach N., Boscus D., RA Dickhoff R., Dors M., Dubois I., Friedman C., Gouyvenoux M., James R., RA Madan A., Mairey-Estrada B., Mangenot S., Martins N., Menard M., Oztas S., RA Ratcliffe A., Shaffer T., Trask B., Vacherie B., Bellemere C., Belser C., RA Besnard-Gonnet M., Bartol-Mavel D., Boutard M., Briez-Silla S., RA Combette S., Dufosse-Laurent V., Ferron C., Lechaplais C., Louesse C., RA Muselet D., Magdelenat G., Pateau E., Petit E., Sirvain-Trukniewicz P., RA Trybou A., Vega-Czarny N., Bataille E., Bluet E., Bordelais I., Dubois M., RA Dumont C., Guerin T., Haffray S., Hammadi R., Muanga J., Pellouin V., RA Robert D., Wunderle E., Gauguet G., Roy A., Sainte-Marthe L., Verdier J., RA Verdier-Discala C., Hillier L.W., Fulton L., McPherson J., Matsuda F., RA Wilson R., Scarpelli C., Gyapay G., Wincker P., Saurin W., Quetier F., RA Waterston R., Hood L., Weissenbach J.; RT "The DNA sequence and analysis of human chromosome 14."; RL Nature 421:601-607(2003). RN [3] RP FUNCTION, SUBCELLULAR LOCATION, INVOLVEMENT IN MARSIS, AND VARIANT MARSIS RP LYS-1913. RX PubMed=29253101; DOI=10.1093/brain/awx326; RA Habib A.M., Matsuyama A., Okorokov A.L., Santana-Varela S., Bras J.T., RA Aloisi A.M., Emery E.C., Bogdanov Y.D., Follenfant M., Gossage S.J., RA Gras M., Humphrey J., Kolesnikov A., Le Cann K., Li S., Minett M.S., RA Pereira V., Ponsolles C., Sikandar S., Torres J.M., Yamaoka K., Zhao J., RA Komine Y., Yamamori T., Maniatis N., Panov K.I., Houlden H., Ramirez J.D., RA Bennett D.L.H., Marsili L., Bachiocco V., Wood J.N., Cox J.J.; RT "A novel human pain insensitivity disorder caused by a point mutation in RT ZFHX2."; RL Brain 141:365-376(2018). CC -!- FUNCTION: Transcriptional regulator that is critical for the regulation CC of pain perception and processing of noxious stimuli. CC {ECO:0000269|PubMed:29253101}. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:29253101}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; CC IsoId=Q9C0A1-1; Sequence=Displayed; CC Name=2; CC IsoId=Q9C0A1-2; Sequence=VSP_039496, VSP_039497; CC -!- DISEASE: Marsili syndrome (MARSIS) [MIM:147430]: An autosomal dominant CC disorder characterized by congenital pain insensitivity. Painless CC cutaneous thermal burns and bone fractures are present in affected CC individuals. Corneal reflex is absent, sweating is decreased or absent. CC Patients have normal cognitive abilities, and display no evidence of CC distal weakness. {ECO:0000269|PubMed:29253101}. Note=The disease may be CC caused by mutations affecting the gene represented in this entry. CC -!- SEQUENCE CAUTION: CC Sequence=BAA83008.2; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305}; CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AB028979; BAA83008.2; ALT_INIT; mRNA. DR EMBL; AB051549; BAB21853.1; -; mRNA. DR EMBL; AL132855; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL135999; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR CCDS; CCDS55907.1; -. [Q9C0A1-1] DR RefSeq; NP_207646.2; NM_033400.2. [Q9C0A1-1] DR RefSeq; XP_011535547.1; XM_011537245.2. [Q9C0A1-1] DR RefSeq; XP_011535548.1; XM_011537246.2. [Q9C0A1-1] DR RefSeq; XP_011535549.1; XM_011537247.2. [Q9C0A1-1] DR RefSeq; XP_016877203.1; XM_017021714.1. [Q9C0A1-1] DR SMR; Q9C0A1; -. DR BioGrid; 124533; 2. DR STRING; 9606.ENSP00000413418; -. DR iPTMnet; Q9C0A1; -. DR PhosphoSitePlus; Q9C0A1; -. DR BioMuta; ZFHX2; -. DR DMDM; 300669698; -. DR EPD; Q9C0A1; -. DR jPOST; Q9C0A1; -. DR MassIVE; Q9C0A1; -. DR PaxDb; Q9C0A1; -. DR PeptideAtlas; Q9C0A1; -. DR PRIDE; Q9C0A1; -. DR ProteomicsDB; 79976; -. [Q9C0A1-1] DR ProteomicsDB; 79977; -. [Q9C0A1-2] DR Antibodypedia; 51729; 41 antibodies. DR Ensembl; ENST00000419474; ENSP00000413418; ENSG00000136367. [Q9C0A1-1] DR GeneID; 85446; -. DR KEGG; hsa:85446; -. DR UCSC; uc010tno.2; human. [Q9C0A1-1] DR CTD; 85446; -. DR DisGeNET; 85446; -. DR GeneCards; ZFHX2; -. DR HGNC; HGNC:20152; ZFHX2. DR HPA; ENSG00000136367; Tissue enhanced (brain, testis). DR MalaCards; ZFHX2; -. DR MIM; 147430; phenotype. DR MIM; 617828; gene. DR neXtProt; NX_Q9C0A1; -. DR OpenTargets; ENSG00000136367; -. DR eggNOG; KOG1146; Eukaryota. DR eggNOG; ENOG410XVMY; LUCA. DR GeneTree; ENSGT00940000160537; -. DR HOGENOM; CLU_001401_1_0_1; -. DR InParanoid; Q9C0A1; -. DR KO; K09379; -. DR OMA; TVFCCPY; -. DR OrthoDB; 15351at2759; -. DR PhylomeDB; Q9C0A1; -. DR TreeFam; TF323288; -. DR ChiTaRS; ZFHX2; human. DR GenomeRNAi; 85446; -. DR Pharos; Q9C0A1; Tdark. DR PRO; PR:Q9C0A1; -. DR Proteomes; UP000005640; Chromosome 14. DR RNAct; Q9C0A1; protein. DR Bgee; ENSG00000136367; Expressed in buccal mucosa cell and 112 other tissues. DR ExpressionAtlas; Q9C0A1; baseline and differential. DR Genevisible; Q9C0A1; HS. DR GO; GO:0000790; C:nuclear chromatin; ISA:NTNU_SB. DR GO; GO:0005634; C:nucleus; IDA:UniProtKB. DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; ISA:NTNU_SB. DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central. DR GO; GO:0044212; F:transcription regulatory region DNA binding; IBA:GO_Central. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR GO; GO:0030534; P:adult behavior; IEA:Ensembl. DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IBA:GO_Central. DR GO; GO:0045893; P:positive regulation of transcription, DNA-templated; IBA:GO_Central. DR GO; GO:0051930; P:regulation of sensory perception of pain; IMP:UniProtKB. DR CDD; cd00086; homeodomain; 3. DR InterPro; IPR009057; Homeobox-like_sf. DR InterPro; IPR017970; Homeobox_CS. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2. DR InterPro; IPR027028; ZFHX2. DR InterPro; IPR036236; Znf_C2H2_sf. DR InterPro; IPR013087; Znf_C2H2_type. DR PANTHER; PTHR45891:SF1; PTHR45891:SF1; 1. DR Pfam; PF00046; Homeodomain; 3. DR SMART; SM00389; HOX; 3. DR SMART; SM00355; ZnF_C2H2; 15. DR SMART; SM00451; ZnF_U1; 7. DR SUPFAM; SSF46689; SSF46689; 3. DR SUPFAM; SSF57667; SSF57667; 5. DR PROSITE; PS00027; HOMEOBOX_1; 1. DR PROSITE; PS50071; HOMEOBOX_2; 3. DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 9. DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 5. PE 1: Evidence at protein level; KW Alternative splicing; Disease mutation; DNA-binding; Homeobox; KW Metal-binding; Nucleus; Reference proteome; Repeat; Transcription; KW Transcription regulation; Zinc; Zinc-finger. FT CHAIN 1..2572 FT /note="Zinc finger homeobox protein 2" FT /id="PRO_0000047243" FT ZN_FING 453..476 FT /note="C2H2-type 1" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 508..532 FT /note="C2H2-type 2" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 821..845 FT /note="C2H2-type 3" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 870..894 FT /note="C2H2-type 4" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1009..1032 FT /note="C2H2-type 5" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1191..1217 FT /note="C2H2-type 6" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1248..1272 FT /note="C2H2-type 7" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1480..1503 FT /note="C2H2-type 8" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT DNA_BIND 1595..1654 FT /note="Homeobox 1" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" FT ZN_FING 1670..1696 FT /note="C2H2-type 9; degenerate" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1769..1791 FT /note="C2H2-type 10" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT DNA_BIND 1857..1916 FT /note="Homeobox 2" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" FT DNA_BIND 2065..2124 FT /note="Homeobox 3" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" FT ZN_FING 2451..2471 FT /note="C2H2-type 11; degenerate" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 2495..2519 FT /note="C2H2-type 12" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT COMPBIAS 605..710 FT /note="Pro-rich" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00015" FT COMPBIAS 1061..1144 FT /note="Pro-rich" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00015" FT COMPBIAS 1321..1471 FT /note="Pro-rich" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00015" FT COMPBIAS 1699..1759 FT /note="Glu-rich" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00007" FT COMPBIAS 1921..2018 FT /note="Pro-rich" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00015" FT COMPBIAS 2193..2424 FT /note="Pro-rich" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00015" FT VAR_SEQ 854..862 FT /note="FLLDMEGAE -> RTETGLLIK (in isoform 2)" FT /evidence="ECO:0000303|PubMed:11214970" FT /id="VSP_039496" FT VAR_SEQ 863..2572 FT /note="Missing (in isoform 2)" FT /evidence="ECO:0000303|PubMed:11214970" FT /id="VSP_039497" FT VARIANT 1913 FT /note="R -> K (in MARSIS; dbSNP:rs1555344723)" FT /evidence="ECO:0000269|PubMed:29253101" FT /id="VAR_081116" FT CONFLICT 473 FT /note="R -> Q (in Ref. 1; BAA83008)" FT /evidence="ECO:0000305" FT CONFLICT 550 FT /note="P -> T (in Ref. 1; BAA83008)" FT /evidence="ECO:0000305" SQ SEQUENCE 2572 AA; 274176 MW; 239C8050F65B25C2 CRC64; MATLNSASTT GTTPSPGHNA PSLPSDTFSS STPSDPVTKD PPAASSTSEN MRSSEPGGQL LESGCGLVPP KEIGEPQEGP DCGHFPPNDP GVEKDKEQEE EEEGLPPMDL SNHLFFTAGG EAYLVAKLSL PGGSELLLPK GFPWGEAGIK EEPSLPFLAY PPPSHLTALH IQHGFDPIQG FSSSDQILSH DTSAPSPAAC EERHGAFWSY QLAPNPPGDP KDGPMGNSGG NHVAVFWLCL LCRLGFSKPQ AFMDHTQSHG VKLTPAQYQG LSGSPAVLQE GDEGCKALIS FLEPKLPARP SSDIPLDNSS TVNMEANVAQ TEDGPPEAEV QALILLDEEV MALSPPSPPT ATWDPSPTQA KESPVAAGEA GPDWFPEGQE EDGGLCPPLN QSSPTSKEGG TLPAPVGSPE DPSDPPQPYR LADDYTPAPA AFQGLSLSSH MSLLHSRNSC KTLKCPKCNW HYKYQQTLDV HMREKHPESN SHCSYCSAGG AHPRLARGES YNCGYKPYRC DVCNYSTTTK GNLSIHMQSD KHLANLQGFQ AGPGGQGSPP EASLPPSAGD KEPKTKSSWQ CKVCSYETNI SRNLRIHMTS EKHMQNVLML HQGLPLGLPP GLMGPGPPPP PGATPTSPPE LFQYFGPQAL GQPQTPLAGP GLRPDKPLEA QLLLNGFHHV GAPARKFPTS APGSLSPDAH LPPSQLLGSS SDSLPTSPPP DDSLSLKVFR CLVCQAFSTD SLELLLYHCS IGRSLPEAEW KEVAGDTHRC KLCCYGTQLK ANFQLHLKTD KHAQKYQLAA HLREGGGAMG TPSPASLGDG APYGSVSPLH LRCNICDFES NSKEKMQLHA RGAAHEENSQ IYKFLLDMEG AEAGAELGLY HCLLCAWETP SRLAVLQHLR TPAHRDAQAQ RRLQLLQNGP TTEEGLAALQ SILSFSHGQL RTPGKAPVTP LAEPPTPEKD AQNKTEQLAS EETENKTGPS RDSANQTTVY CCPYCSFLSP ESSQVRAHTL SQHAVQPKYR CPLCQEQLVG RPALHFHLSH LHNVVPECVE KLLLVATTVE MTFTTKVLSA PTLSPLDNGQ EPPTHGPEPT PSRDQAAEGP NLTPEASPDP LPEPPLASVE VPDKPSGSPG QPPSPAPSPV PEPDAQAEDV APPPTMAEEE EGTTGELRSA EPAPADSRHP LTYRKTTNFA LDKFLDPARP YKCTVCKESF TQKNILLVHY NSVSHLHKMK KAAIDPSAPA RGEAGAPPTT TAATDKPFKC TVCRVSYNQS STLEIHMRSV LHQTRSRGTK TDSKIEGPER SQEEPKEGET EGEVGTEKKG PDTSGFISGL PFLSPPPPPL DLHRFPAPLF TPPVLPPFPL VPESLLKLQQ QQLLLPFYLH DLKVGPKLTL AGPAPVLSLP AATPPPPPQP PKAELAEREW ERPPMAKEGN EAGPSSPPDP LPNEAARTAA KALLENFGFE LVIQYNEGKQ AVPPPPTPPP PEALGGGDKL ACGACGKLFS NMLILKTHEE HVHRRFLPFE ALSRYAAQFR KSYDSLYPPL AEPPKPPDGS LDSPVPHLGP PFLVPEPEAG GTRAPEERSR AGGHWPIEEE ESSRGNLPPL VPAGRRFSRT KFTEFQTQAL QSFFETSAYP KDGEVERLAS LLGLASRVVV VWFQNARQKA RKNACEGGSM PTGGGTGGAS GCRRCHATFS CVFELVRHLK KCYDDQTLEE EEEEAERGEE EEEVEEEEVE EEQGLEPPAG PEGPLPEPPD GEELSQAEAT KAGGKEPEEK ATPSPSPAHT CDQCAISFSS QDLLTSHRRL HFLPSLQPSA PPQLLDLPLL VFGERNPLVA ATSPMPGPPL KRKHEDGSLS PTGSEAGGGG EGEPPRDKRL RTTILPEQLE ILYRWYMQDS NPTRKMLDCI SEEVGLKKRV VQVWFQNTRA RERKGQFRST PGGVPSPAVK PPATATPASL PKFNLLLGKV DDGTGREAPK REAPAFPYPT ATLASGPQPF LPPGKEATTP TPEPPLPLLP PPPPSEEEGP EEPPKASPES EACSLSAGDL SDSSASSLAE PESPGAGGTS GGPGGGTGVP DGMGQRRYRT QMSSLQLKIM KACYEAYRTP TMQECEVLGE EIGLPKRVIQ VWFQNARAKE KKAKLQGTAA GSTGGSSEGL LAAQRTDCPY CDVKYDFYVS CRGHLFSRQH LAKLKEAVRA QLKSESKCYD LAPAPEAPPA LKAPPATTPA SMPLGAAPTL PRLAPVLLSG PALAQPPLGN LAPFNSGPAA SSGLLGLATS VLPTTTVVQT AGPGRPLPQR PMPDQTNTST AGTTDPVPGP PTEPLGDKVS SERKPVAGPT SSSNDALKNL KALKTTVPAL LGGQFLPFPL PPAGGTAPPA VFGPQLQGAY FQQLYGMKKG LFPMNPMIPQ TLIGLLPNAL LQPPPQPPEP TATAPPKPPE LPAPGEGEAG EVDELLTGST GISTVDVTHR YLCRQCKMAF DGEAPATAHQ RSFCFFGRGS GGSMPPPLRV PICTYHCLAC EVLLSGREAL ASHLRSSAHR RKAAPPQGGP PISITNAATA ASAAVAFAKE EARLPHTDSN PKTTTTSTLL AL //