ID HME2_HUMAN Reviewed; 333 AA. AC P19622; A4D252; Q549U3; Q9UD58; DT 01-FEB-1991, integrated into UniProtKB/Swiss-Prot. DT 11-JUL-2002, sequence version 3. DT 02-DEC-2020, entry version 178. DE RecName: Full=Homeobox protein engrailed-2; DE Short=Homeobox protein en-2; DE Short=Hu-En-2; GN Name=EN2; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA]. RX PubMed=1363401; DOI=10.1002/dvg.1020130505; RA Logan C., Hanks M.C., Noble-Topham S., Nallainathan D., Provart N.J., RA Joyner A.L.; RT "Cloning and sequence comparison of the mouse, human, and chicken engrailed RT genes reveal potential functional domains and regulatory regions."; RL Dev. Genet. 13:345-358(1992). RN [2] RP SEQUENCE REVISION TO 229. RA Logan C., Hanks M.C., Noble-Topham S., Nallainathan D., Provart N.J., RA Joyner A.L.; RL Submitted (APR-2000) to the EMBL/GenBank/DDBJ databases. RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12853948; DOI=10.1038/nature01782; RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A., Pepin K.H., RA Wagner-McPherson C., Layman D., Maas J., Jaeger S., Walker R., Wylie K., RA Sekhon M., Becker M.C., O'Laughlin M.D., Schaller M.E., Fewell G.A., RA Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., Sun H., RA Edwards J., Bradshaw-Cordum H., Ali J., Andrews S., Isak A., Vanbrunt A., RA Nguyen C., Du F., Lamar B., Courtney L., Kalicki J., Ozersky P., RA Bielicki L., Scott K., Holmes A., Harkins R., Harris A., Strong C.M., RA Hou S., Tomlinson C., Dauphin-Kohlberg S., Kozlowicz-Reilly A., Leonard S., RA Rohlfing T., Rock S.M., Tin-Wollam A.-M., Abbott A., Minx P., Maupin R., RA Strowmatt C., Latreille P., Miller N., Johnson D., Murray J., RA Woessner J.P., Wendl M.C., Yang S.-P., Schultz B.R., Wallis J.W., RA Spieth J., Bieri T.A., Nelson J.O., Berkowicz N., Wohldmann P.E., RA Cook L.L., Hickenbotham M.T., Eldred J., Williams D., Bedell J.A., RA Mardis E.R., Clifton S.W., Chissoe S.L., Marra M.A., Raymond C., Haugen E., RA Gillett W., Zhou Y., James R., Phelps K., Iadanoto S., Bubb K., Simms E., RA Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., Baertsch R.A., RA Brent M.R., Keibler E., Flicek P., Bork P., Suyama M., Bailey J.A., RA Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., Eddy S.R., RA McPherson J.D., Olson M.V., Eichler E.E., Green E.D., Waterston R.H., RA Wilson R.K.; RT "The DNA sequence of human chromosome 7."; RL Nature 424:157-164(2003). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12690205; DOI=10.1126/science.1083423; RA Scherer S.W., Cheung J., MacDonald J.R., Osborne L.R., Nakabayashi K., RA Herbrick J.-A., Carson A.R., Parker-Katiraee L., Skaug J., Khaja R., RA Zhang J., Hudek A.K., Li M., Haddad M., Duggan G.E., Fernandez B.A., RA Kanematsu E., Gentles S., Christopoulos C.C., Choufani S., Kwasnicka D., RA Zheng X.H., Lai Z., Nusskern D.R., Zhang Q., Gu Z., Lu F., Zeesman S., RA Nowaczyk M.J., Teshima I., Chitayat D., Shuman C., Weksberg R., RA Zackai E.H., Grebe T.A., Cox S.R., Kirkpatrick S.J., Rahman N., RA Friedman J.M., Heng H.H.Q., Pelicci P.G., Lo-Coco F., Belloni E., RA Shaffer L.G., Pober B., Morton C.C., Gusella J.F., Bruns G.A.P., Korf B.R., RA Quade B.J., Ligon A.H., Ferguson H., Higgins A.W., Leach N.T., RA Herrick S.R., Lemyre E., Farra C.G., Kim H.-G., Summers A.M., Gripp K.W., RA Roberts W., Szatmari P., Winsor E.J.T., Grzeschik K.-H., Teebi A., RA Minassian B.A., Kere J., Armengol L., Pujana M.A., Estivill X., RA Wilson M.D., Koop B.F., Tosi S., Moore G.E., Boright A.P., Zlotorynski E., RA Kerem B., Kroisel P.M., Petek E., Oscier D.G., Mould S.J., Doehner H., RA Doehner K., Rommens J.M., Vincent J.B., Venter J.C., Li P.W., Mural R.J., RA Adams M.D., Tsui L.-C.; RT "Human chromosome 7: DNA sequence and biology."; RL Science 300:767-772(2003). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Brain; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA project: RT the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [6] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 230-333. RX PubMed=2565873; DOI=10.1016/0888-7543(89)90324-8; RA Poole S.J., Law M.L., Kao F.T., Lau Y.-F.C.; RT "Isolation and chromosomal localization of the human En-2 gene."; RL Genomics 4:225-231(1989). RN [7] RP NUCLEOTIDE SEQUENCE [MRNA] OF 252-290. RC TISSUE=Bone marrow; RX PubMed=7518789; DOI=10.1016/0378-1119(94)90380-8; RA Moretti P., Simmons P., Thomas P., Haylock D., Rathjen P., Vadas M., RA D'Andrea R.; RT "Identification of homeobox genes expressed in human haemopoietic RT progenitor cells."; RL Gene 144:213-219(1994). RN [8] RP POSSIBLE INVOLVEMENT IN SUSCEPTIBILITY TO AUTISM. RX PubMed=16252243; DOI=10.1086/497705; RA Benayed R., Gharani N., Rossman I., Mancuso V., Lazar G., Kamdar S., RA Bruse S.E., Tischfield S., Smith B.J., Zimmerman R.A., DiCicco-Bloom E., RA Brzustowicz L.M., Millonig J.H.; RT "Support for the homeobox transcription factor gene ENGRAILED 2 as an RT autism spectrum disorder susceptibility locus."; RL Am. J. Hum. Genet. 77:851-868(2005). CC -!- SUBCELLULAR LOCATION: Nucleus. CC -!- DISEASE: Note=Genetic variations in EN2 may be associated with CC susceptibility to autism. CC -!- SIMILARITY: Belongs to the engrailed homeobox family. {ECO:0000305}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; L12701; AAA53504.2; -; Genomic_DNA. DR EMBL; L12700; AAA53504.2; JOINED; Genomic_DNA. DR EMBL; AC008060; AAQ96875.1; -; Genomic_DNA. DR EMBL; CH236954; EAL23909.1; -; Genomic_DNA. DR EMBL; BC104970; AAI04971.1; -; mRNA. DR EMBL; BC104972; AAI04973.1; -; mRNA. DR EMBL; J03066; AAF68670.1; -; Genomic_DNA. DR CCDS; CCDS5940.1; -. DR PIR; E48423; E48423. DR RefSeq; NP_001418.2; NM_001427.3. DR BMRB; P19622; -. DR SMR; P19622; -. DR BioGRID; 108335; 5. DR STRING; 9606.ENSP00000297375; -. DR DrugBank; DB03309; N-cyclohexyltaurine. DR iPTMnet; P19622; -. DR PhosphoSitePlus; P19622; -. DR BioMuta; EN2; -. DR DMDM; 21903415; -. DR MassIVE; P19622; -. DR PaxDb; P19622; -. DR PeptideAtlas; P19622; -. DR PRIDE; P19622; -. DR ProteomicsDB; 53681; -. DR Antibodypedia; 18860; 268 antibodies. DR Ensembl; ENST00000297375; ENSP00000297375; ENSG00000164778. DR GeneID; 2020; -. DR KEGG; hsa:2020; -. DR UCSC; uc003wmb.3; human. DR CTD; 2020; -. DR DisGeNET; 2020; -. DR EuPathDB; HostDB:ENSG00000164778.4; -. DR GeneCards; EN2; -. DR HGNC; HGNC:3343; EN2. DR HPA; ENSG00000164778; Tissue enriched (brain). DR MIM; 131310; gene. DR neXtProt; NX_P19622; -. DR OpenTargets; ENSG00000164778; -. DR Orphanet; 106; NON RARE IN EUROPE: Autism. DR PharmGKB; PA27780; -. DR eggNOG; KOG0493; Eukaryota. DR GeneTree; ENSGT00940000159935; -. DR HOGENOM; CLU_051739_2_0_1; -. DR InParanoid; P19622; -. DR OMA; YNHSTAA; -. DR OrthoDB; 1575614at2759; -. DR PhylomeDB; P19622; -. DR TreeFam; TF106461; -. DR PathwayCommons; P19622; -. DR BioGRID-ORCS; 2020; 5 hits in 867 CRISPR screens. DR ChiTaRS; EN2; human. DR GeneWiki; EN2_(gene); -. DR GenomeRNAi; 2020; -. DR Pharos; P19622; Tbio. DR PRO; PR:P19622; -. DR Proteomes; UP000005640; Chromosome 7. DR RNAct; P19622; protein. DR Bgee; ENSG00000164778; Expressed in cerebellar hemisphere and 68 other tissues. DR Genevisible; P19622; HS. DR GO; GO:0001650; C:fibrillar center; IDA:HPA. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0000790; C:nuclear chromatin; ISA:NTNU_SB. DR GO; GO:0005730; C:nucleolus; IDA:HPA. DR GO; GO:0005654; C:nucleoplasm; IDA:HPA. DR GO; GO:0005634; C:nucleus; IBA:GO_Central. DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; ISA:NTNU_SB. DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central. DR GO; GO:1990837; F:sequence-specific double-stranded DNA binding; IDA:ARUK-UCL. DR GO; GO:1990403; P:embryonic brain development; IEA:Ensembl. DR GO; GO:0030902; P:hindbrain development; IEA:Ensembl. DR GO; GO:0030901; P:midbrain development; IEA:Ensembl. DR GO; GO:0007275; P:multicellular organism development; TAS:ProtInc. DR GO; GO:0043524; P:negative regulation of neuron apoptotic process; IEA:Ensembl. DR GO; GO:0048666; P:neuron development; IEA:Ensembl. DR GO; GO:0030182; P:neuron differentiation; IBA:GO_Central. DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl. DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central. DR CDD; cd00086; homeodomain; 1. DR InterPro; IPR019549; Homeobox-engrailed_C-terminal. DR InterPro; IPR009057; Homeobox-like_sf. DR InterPro; IPR017970; Homeobox_CS. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR000747; Homeobox_engrailed. DR InterPro; IPR020479; Homeobox_metazoa. DR InterPro; IPR019737; Homoebox-engrailed_CS. DR Pfam; PF10525; Engrail_1_C_sig; 1. DR Pfam; PF00046; Homeodomain; 1. DR PRINTS; PR00026; ENGRAILED. DR PRINTS; PR00024; HOMEOBOX. DR SMART; SM00389; HOX; 1. DR SUPFAM; SSF46689; SSF46689; 1. DR PROSITE; PS00033; ENGRAILED; 1. DR PROSITE; PS00027; HOMEOBOX_1; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. PE 2: Evidence at transcript level; KW Autism; Autism spectrum disorder; Developmental protein; DNA-binding; KW Homeobox; Nucleus; Polymorphism; Reference proteome. FT CHAIN 1..333 FT /note="Homeobox protein engrailed-2" FT /id="PRO_0000196067" FT DNA_BIND 244..303 FT /note="Homeobox" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" FT VARIANT 121 FT /note="L -> F (in dbSNP:rs3735653)" FT /id="VAR_021985" SQ SEQUENCE 333 AA; 34211 MW; ACF5399E383D6257 CRC64; MEENDPKPGE AAAAVEGQRQ PESSPGGGSG GGGGSSPGEA DTGRRRALML PAVLQAPGNH QHPHRITNFF IDNILRPEFG RRKDAGTCCA GAGGGRGGGA GGEGGASGAE GGGGAGGSEQ LLGSGSREPR QNPPCAPGAG GPLPAAGSDS PGDGEGGSKT LSLHGGAKKG GDPGGPLDGS LKARGLGGGD LSVSSDSDSS QAGANLGAQP MLWPAWVYCT RYSDRPSSGP RSRKPKKKNP NKEDKRPRTA FTAEQLQRLK AEFQTNRYLT EQRRQSLAQE LSLNESQIKI WFQNKRAKIK KATGNKNTLA VHLMAQGLYN HSTTAKEGKS DSE //