ID HME2_HUMAN Reviewed; 333 AA. AC P19622; A4D252; Q549U3; Q9UD58; DT 01-FEB-1991, integrated into UniProtKB/Swiss-Prot. DT 11-JUL-2002, sequence version 3. DT 28-JUN-2023, entry version 190. DE RecName: Full=Homeobox protein engrailed-2; DE Short=Homeobox protein en-2; DE Short=Hu-En-2; GN Name=EN2; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA]. RX PubMed=1363401; DOI=10.1002/dvg.1020130505; RA Logan C., Hanks M.C., Noble-Topham S., Nallainathan D., Provart N.J., RA Joyner A.L.; RT "Cloning and sequence comparison of the mouse, human, and chicken engrailed RT genes reveal potential functional domains and regulatory regions."; RL Dev. Genet. 13:345-358(1992). RN [2] RP SEQUENCE REVISION TO 229. RA Logan C., Hanks M.C., Noble-Topham S., Nallainathan D., Provart N.J., RA Joyner A.L.; RL Submitted (APR-2000) to the EMBL/GenBank/DDBJ databases. RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12853948; DOI=10.1038/nature01782; RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A., Pepin K.H., RA Wagner-McPherson C., Layman D., Maas J., Jaeger S., Walker R., Wylie K., RA Sekhon M., Becker M.C., O'Laughlin M.D., Schaller M.E., Fewell G.A., RA Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., Sun H., RA Edwards J., Bradshaw-Cordum H., Ali J., Andrews S., Isak A., Vanbrunt A., RA Nguyen C., Du F., Lamar B., Courtney L., Kalicki J., Ozersky P., RA Bielicki L., Scott K., Holmes A., Harkins R., Harris A., Strong C.M., RA Hou S., Tomlinson C., Dauphin-Kohlberg S., Kozlowicz-Reilly A., Leonard S., RA Rohlfing T., Rock S.M., Tin-Wollam A.-M., Abbott A., Minx P., Maupin R., RA Strowmatt C., Latreille P., Miller N., Johnson D., Murray J., RA Woessner J.P., Wendl M.C., Yang S.-P., Schultz B.R., Wallis J.W., RA Spieth J., Bieri T.A., Nelson J.O., Berkowicz N., Wohldmann P.E., RA Cook L.L., Hickenbotham M.T., Eldred J., Williams D., Bedell J.A., RA Mardis E.R., Clifton S.W., Chissoe S.L., Marra M.A., Raymond C., Haugen E., RA Gillett W., Zhou Y., James R., Phelps K., Iadanoto S., Bubb K., Simms E., RA Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., Baertsch R.A., RA Brent M.R., Keibler E., Flicek P., Bork P., Suyama M., Bailey J.A., RA Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., Eddy S.R., RA McPherson J.D., Olson M.V., Eichler E.E., Green E.D., Waterston R.H., RA Wilson R.K.; RT "The DNA sequence of human chromosome 7."; RL Nature 424:157-164(2003). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12690205; DOI=10.1126/science.1083423; RA Scherer S.W., Cheung J., MacDonald J.R., Osborne L.R., Nakabayashi K., RA Herbrick J.-A., Carson A.R., Parker-Katiraee L., Skaug J., Khaja R., RA Zhang J., Hudek A.K., Li M., Haddad M., Duggan G.E., Fernandez B.A., RA Kanematsu E., Gentles S., Christopoulos C.C., Choufani S., Kwasnicka D., RA Zheng X.H., Lai Z., Nusskern D.R., Zhang Q., Gu Z., Lu F., Zeesman S., RA Nowaczyk M.J., Teshima I., Chitayat D., Shuman C., Weksberg R., RA Zackai E.H., Grebe T.A., Cox S.R., Kirkpatrick S.J., Rahman N., RA Friedman J.M., Heng H.H.Q., Pelicci P.G., Lo-Coco F., Belloni E., RA Shaffer L.G., Pober B., Morton C.C., Gusella J.F., Bruns G.A.P., Korf B.R., RA Quade B.J., Ligon A.H., Ferguson H., Higgins A.W., Leach N.T., RA Herrick S.R., Lemyre E., Farra C.G., Kim H.-G., Summers A.M., Gripp K.W., RA Roberts W., Szatmari P., Winsor E.J.T., Grzeschik K.-H., Teebi A., RA Minassian B.A., Kere J., Armengol L., Pujana M.A., Estivill X., RA Wilson M.D., Koop B.F., Tosi S., Moore G.E., Boright A.P., Zlotorynski E., RA Kerem B., Kroisel P.M., Petek E., Oscier D.G., Mould S.J., Doehner H., RA Doehner K., Rommens J.M., Vincent J.B., Venter J.C., Li P.W., Mural R.J., RA Adams M.D., Tsui L.-C.; RT "Human chromosome 7: DNA sequence and biology."; RL Science 300:767-772(2003). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Brain; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA project: RT the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [6] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 230-333. RX PubMed=2565873; DOI=10.1016/0888-7543(89)90324-8; RA Poole S.J., Law M.L., Kao F.T., Lau Y.-F.C.; RT "Isolation and chromosomal localization of the human En-2 gene."; RL Genomics 4:225-231(1989). RN [7] RP NUCLEOTIDE SEQUENCE [MRNA] OF 252-290. RC TISSUE=Bone marrow; RX PubMed=7518789; DOI=10.1016/0378-1119(94)90380-8; RA Moretti P., Simmons P., Thomas P., Haylock D., Rathjen P., Vadas M., RA D'Andrea R.; RT "Identification of homeobox genes expressed in human haemopoietic RT progenitor cells."; RL Gene 144:213-219(1994). RN [8] RP POSSIBLE INVOLVEMENT IN SUSCEPTIBILITY TO AUTISM. RX PubMed=16252243; DOI=10.1086/497705; RA Benayed R., Gharani N., Rossman I., Mancuso V., Lazar G., Kamdar S., RA Bruse S.E., Tischfield S., Smith B.J., Zimmerman R.A., DiCicco-Bloom E., RA Brzustowicz L.M., Millonig J.H.; RT "Support for the homeobox transcription factor gene ENGRAILED 2 as an RT autism spectrum disorder susceptibility locus."; RL Am. J. Hum. Genet. 77:851-868(2005). CC -!- SUBCELLULAR LOCATION: Nucleus. CC -!- DISEASE: Note=Genetic variations in EN2 may be associated with CC susceptibility to autism. CC -!- SIMILARITY: Belongs to the engrailed homeobox family. {ECO:0000305}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; L12701; AAA53504.2; -; Genomic_DNA. DR EMBL; L12700; AAA53504.2; JOINED; Genomic_DNA. DR EMBL; AC008060; AAQ96875.1; -; Genomic_DNA. DR EMBL; CH236954; EAL23909.1; -; Genomic_DNA. DR EMBL; BC104970; AAI04971.1; -; mRNA. DR EMBL; BC104972; AAI04973.1; -; mRNA. DR EMBL; J03066; AAF68670.1; -; Genomic_DNA. DR CCDS; CCDS5940.1; -. DR PIR; E48423; E48423. DR RefSeq; NP_001418.2; NM_001427.3. DR AlphaFoldDB; P19622; -. DR BMRB; P19622; -. DR SMR; P19622; -. DR BioGRID; 108335; 6. DR STRING; 9606.ENSP00000297375; -. DR DrugBank; DB03309; N-cyclohexyltaurine. DR GlyGen; P19622; 1 site, 1 O-linked glycan (1 site). DR iPTMnet; P19622; -. DR PhosphoSitePlus; P19622; -. DR BioMuta; EN2; -. DR DMDM; 21903415; -. DR MassIVE; P19622; -. DR PaxDb; P19622; -. DR PeptideAtlas; P19622; -. DR ProteomicsDB; 53681; -. DR Antibodypedia; 18860; 307 antibodies from 30 providers. DR DNASU; 2020; -. DR Ensembl; ENST00000297375.4; ENSP00000297375.4; ENSG00000164778.4. DR GeneID; 2020; -. DR KEGG; hsa:2020; -. DR MANE-Select; ENST00000297375.4; ENSP00000297375.4; NM_001427.4; NP_001418.2. DR UCSC; uc003wmb.3; human. DR AGR; HGNC:3343; -. DR CTD; 2020; -. DR DisGeNET; 2020; -. DR GeneCards; EN2; -. DR HGNC; HGNC:3343; EN2. DR HPA; ENSG00000164778; Tissue enriched (brain). DR MIM; 131310; gene. DR neXtProt; NX_P19622; -. DR OpenTargets; ENSG00000164778; -. DR Orphanet; 106; NON RARE IN EUROPE: Autism. DR PharmGKB; PA27780; -. DR VEuPathDB; HostDB:ENSG00000164778; -. DR eggNOG; KOG0493; Eukaryota. DR GeneTree; ENSGT00940000159935; -. DR HOGENOM; CLU_051739_2_0_1; -. DR InParanoid; P19622; -. DR OMA; GESGDQC; -. DR OrthoDB; 2897502at2759; -. DR PhylomeDB; P19622; -. DR TreeFam; TF106461; -. DR PathwayCommons; P19622; -. DR SignaLink; P19622; -. DR SIGNOR; P19622; -. DR BioGRID-ORCS; 2020; 19 hits in 1176 CRISPR screens. DR ChiTaRS; EN2; human. DR GeneWiki; EN2_(gene); -. DR GenomeRNAi; 2020; -. DR Pharos; P19622; Tbio. DR PRO; PR:P19622; -. DR Proteomes; UP000005640; Chromosome 7. DR RNAct; P19622; protein. DR Bgee; ENSG00000164778; Expressed in cerebellar cortex and 47 other tissues. DR Genevisible; P19622; HS. DR GO; GO:0000785; C:chromatin; ISA:NTNU_SB. DR GO; GO:0001650; C:fibrillar center; IDA:HPA. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005730; C:nucleolus; IDA:HPA. DR GO; GO:0005654; C:nucleoplasm; IDA:HPA. DR GO; GO:0005634; C:nucleus; IBA:GO_Central. DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; ISA:NTNU_SB. DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central. DR GO; GO:1990837; F:sequence-specific double-stranded DNA binding; IDA:ARUK-UCL. DR GO; GO:0071542; P:dopaminergic neuron differentiation; IEA:Ensembl. DR GO; GO:1990403; P:embryonic brain development; IEA:Ensembl. DR GO; GO:0030902; P:hindbrain development; IEA:Ensembl. DR GO; GO:0030901; P:midbrain development; IEA:Ensembl. DR GO; GO:0043524; P:negative regulation of neuron apoptotic process; IEA:Ensembl. DR GO; GO:0048666; P:neuron development; IEA:Ensembl. DR GO; GO:0030182; P:neuron differentiation; IBA:GO_Central. DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl. DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central. DR CDD; cd00086; homeodomain; 1. DR Gene3D; 1.10.10.60; Homeodomain-like; 1. DR InterPro; IPR019549; Homeobox-engrailed_C-terminal. DR InterPro; IPR009057; Homeobox-like_sf. DR InterPro; IPR017970; Homeobox_CS. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR000747; Homeobox_engrailed. DR InterPro; IPR020479; Homeobox_metazoa. DR InterPro; IPR019737; Homoebox-engrailed_CS. DR PANTHER; PTHR24341; HOMEOBOX PROTEIN ENGRAILED; 1. DR PANTHER; PTHR24341:SF5; HOMEOBOX PROTEIN ENGRAILED-2; 1. DR Pfam; PF10525; Engrail_1_C_sig; 1. DR Pfam; PF00046; Homeodomain; 1. DR PRINTS; PR00026; ENGRAILED. DR PRINTS; PR00024; HOMEOBOX. DR SMART; SM00389; HOX; 1. DR SUPFAM; SSF46689; Homeodomain-like; 1. DR PROSITE; PS00033; ENGRAILED; 1. DR PROSITE; PS00027; HOMEOBOX_1; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. PE 2: Evidence at transcript level; KW Autism; Autism spectrum disorder; Developmental protein; DNA-binding; KW Homeobox; Nucleus; Reference proteome. FT CHAIN 1..333 FT /note="Homeobox protein engrailed-2" FT /id="PRO_0000196067" FT DNA_BIND 244..303 FT /note="Homeobox" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" FT REGION 1..49 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 95..206 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 223..250 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 192..206 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT VARIANT 121 FT /note="L -> F (in dbSNP:rs3735653)" FT /id="VAR_021985" SQ SEQUENCE 333 AA; 34211 MW; ACF5399E383D6257 CRC64; MEENDPKPGE AAAAVEGQRQ PESSPGGGSG GGGGSSPGEA DTGRRRALML PAVLQAPGNH QHPHRITNFF IDNILRPEFG RRKDAGTCCA GAGGGRGGGA GGEGGASGAE GGGGAGGSEQ LLGSGSREPR QNPPCAPGAG GPLPAAGSDS PGDGEGGSKT LSLHGGAKKG GDPGGPLDGS LKARGLGGGD LSVSSDSDSS QAGANLGAQP MLWPAWVYCT RYSDRPSSGP RSRKPKKKNP NKEDKRPRTA FTAEQLQRLK AEFQTNRYLT EQRRQSLAQE LSLNESQIKI WFQNKRAKIK KATGNKNTLA VHLMAQGLYN HSTTAKEGKS DSE //