ID H9XVP0_DROME Unreviewed; 3003 AA. AC H9XVP0; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 27-SEP-2017, entry version 49. DE SubName: Full=Zn finger homeodomain 2, isoform B {ECO:0000313|EMBL:AFH06785.1}; GN Name=zfh2 {ECO:0000313|EMBL:AFH06785.1, GN ECO:0000313|FlyBase:FBgn0004607}; GN Synonyms=Dmel\CG1449 {ECO:0000313|EMBL:AFH06785.1}, ZFH-2 GN {ECO:0000313|EMBL:AFH06785.1}, Zfh-2 {ECO:0000313|EMBL:AFH06785.1}, GN zfh-2 {ECO:0000313|EMBL:AFH06785.1}, ZFH2 GN {ECO:0000313|EMBL:AFH06785.1}, Zfh2 {ECO:0000313|EMBL:AFH06785.1}; GN ORFNames=CG1449 {ECO:0000313|EMBL:AFH06785.1, GN ECO:0000313|FlyBase:FBgn0004607}, Dmel_CG1449 GN {ECO:0000313|EMBL:AFH06785.1}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AFH06785.1, ECO:0000313|Proteomes:UP000000803}; RN [1] {ECO:0000313|EMBL:AFH06785.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., RA Brandon R.C., Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., RA Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Gabor G.L., RA Abril J.F., Agbayani A., An H.J., Andrews-Pfannkoch C., Baldwin D., RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., RA Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., RA Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., RA Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., RA Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., RA Hostin D., Houston K.A., Howland T.J., Wei M.H., Ibegwam C., RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., RA Kimmel B.E., Kodira C.D., Kraft C., Kravitz S., Kulp D., Lai Z., RA Lasko P., Lei Y., Levitsky A.A., Li J., Li Z., Liang Y., Lin X., RA Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., RA Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., RA Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., RA Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., RA Wang Z.Y., Wassarman D.A., Weinstock G.M., Weissenbach J., RA Williams S.M., WoodageT, Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., RA Yeh R.F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] {ECO:0000313|EMBL:AFH06785.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537568; RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A., RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R., RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C., RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; RT "Finishing a whole-genome shotgun: release 3 of the Drosophila RT melanogaster euchromatic genome sequence."; RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002). RN [3] {ECO:0000313|EMBL:AFH06785.1, ECO:0000313|Proteomes:UP000000803} RP GENOME REANNOTATION. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfied E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., RA Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., RA Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a RT systematic review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] {ECO:0000313|EMBL:AFH06785.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537573; RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., RA Ashburner M., Celniker S.E.; RT "The transposable elements of the Drosophila melanogaster euchromatin: RT a genomics perspective."; RL Genome Biol. 3:RESEARCH0084-RESEARCH0084(2002). RN [5] {ECO:0000313|EMBL:AFH06785.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537574; RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., RA Karpen G.H.; RT "Heterochromatic sequences in a Drosophila whole-genome shotgun RT assembly."; RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002). RN [6] {ECO:0000313|EMBL:AFH06785.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022; RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., RA Ashburner M., Anxolabehere D.; RT "Combined evidence annotation of transposable elements in genome RT sequences."; RL PLoS Comput. Biol. 1:166-175(2005). RN [7] {ECO:0000313|EMBL:AFH06785.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569856; DOI=10.1126/science.1139815; RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.; RT "The Release 5.1 annotation of Drosophila melanogaster RT heterochromatin."; RL Science 316:1586-1591(2007). RN [8] {ECO:0000313|EMBL:AFH06785.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569867; DOI=10.1126/science.1139816; RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M., RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A., RA Dimitri P., Karpen G.H., Celniker S.E.; RT "Sequence finishing and mapping of Drosophila melanogaster RT heterochromatin."; RL Science 316:1625-1628(2007). CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE- CC ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AE014135; AFH06785.1; -; Genomic_DNA. DR RefSeq; NP_001245425.1; NM_001258496.3. DR UniGene; Dm.3906; -. DR ProteinModelPortal; H9XVP0; -. DR SMR; H9XVP0; -. DR PaxDb; H9XVP0; -. DR EnsemblMetazoa; FBtr0307167; FBpp0297996; FBgn0004607. DR GeneID; 43795; -. DR CTD; 43795; -. DR FlyBase; FBgn0004607; zfh2. DR eggNOG; KOG1146; Eukaryota. DR eggNOG; ENOG410XYHC; LUCA. DR GeneTree; ENSGT00530000063717; -. DR OrthoDB; EOG091G00JG; -. DR ChiTaRS; zfh2; fly. DR GenomeRNAi; 43795; -. DR Proteomes; UP000000803; Chromosome 4. DR Bgee; FBgn0004607; -. DR ExpressionAtlas; H9XVP0; differential. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro. DR GO; GO:0006355; P:regulation of transcription, DNA-templated; IEA:InterPro. DR CDD; cd00086; homeodomain; 3. DR Gene3D; 3.30.40.10; -; 1. DR InterPro; IPR009057; Homeobox-like. DR InterPro; IPR017970; Homeobox_CS. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR013087; Znf_C2H2_type. DR InterPro; IPR013083; Znf_RING/FYVE/PHD. DR Pfam; PF00046; Homeobox; 3. DR SMART; SM00389; HOX; 3. DR SMART; SM00355; ZnF_C2H2; 15. DR SUPFAM; SSF46689; SSF46689; 3. DR SUPFAM; SSF57667; SSF57667; 5. DR PROSITE; PS00027; HOMEOBOX_1; 2. DR PROSITE; PS50071; HOMEOBOX_2; 3. DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 7. DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 7. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000803}; KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108, KW ECO:0000256|RuleBase:RU000682, ECO:0000313|EMBL:AFH06785.1}; KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108, KW ECO:0000256|RuleBase:RU000682, ECO:0000313|EMBL:AFH06785.1}; KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042}; KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108, KW ECO:0000256|RuleBase:RU000682}; KW Reference proteome {ECO:0000313|Proteomes:UP000000803}; KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042}; KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}. FT DOMAIN 559 587 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 614 643 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 1438 1467 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 1513 1540 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 1541 1569 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 1793 1853 Homeobox. {ECO:0000259|PROSITE:PS50071}. FT DOMAIN 2150 2210 Homeobox. {ECO:0000259|PROSITE:PS50071}. FT DOMAIN 2232 2259 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 2369 2391 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 2756 2816 Homeobox. {ECO:0000259|PROSITE:PS50071}. FT DNA_BIND 1795 1854 Homeobox. {ECO:0000256|PROSITE-ProRule: FT PRU00108}. FT DNA_BIND 2152 2211 Homeobox. {ECO:0000256|PROSITE-ProRule: FT PRU00108}. FT DNA_BIND 2758 2817 Homeobox. {ECO:0000256|PROSITE-ProRule: FT PRU00108}. SQ SEQUENCE 3003 AA; 331762 MW; B9ECCAAE87E4FC72 CRC64; MSSFDVETFN GKIVYNLDGS AHIIATDNTN GGGSGSGQNC YGSTTNSLKN LSKDKGRGQE EKDIEHPSQY HREQSDNKRQ EEAVDNRPGV ESLGSACYKS SPKIHSFRVV SAQDANSTCQ DQIRAFKIQK PILMCFICKL SFGNVKSFSL HANTEHRLNL EELDQQLLNR EYSSAIIQRN MDEKPQISFL QPLANNDASA DTNDTEKLQT ATEGSDATLP SSPQPVFRNV SELEPENKQE TEQNRLLNQD REQEPESDQH TSSSKMAAPS AYIPLSSPKV AGKLTVKFGS LNSATAKTNN LSKVSSTSSP PSTYASGEVL SPSTDNISNH KSTHCNQETE PPSSSSSEVE MKIGSMSTSP QTNDSDVPCS GFLQMQHMTT GGAYTPQVSS FHASLAALAA NESNDNRVKL ITEFLQQQLQ QHQSSLFPSP CPDHPDLNGV DCKTCELLDI QQRSKSPSSS HHQFSQSLPQ LQIQSQPQQT PHRSPCSNSV ALPVSPSASS VASVGNASTA TSSFTIGACS EHINGRPQGV DCARCEMLLN SARLNSGVQM STRNSCKTLK CPQCNWHYKY QETLEIHMRE KHPDGESACG YCLAGQQHPR LARGESYSCG YKPYRCEICN YSTTTKGNLS IHMQSDKHLN NMQELNSSQN MVAAAAAAAV TGKLLLSSSS PQVTAACPSN SGSGAGSGSS NIVGGTASLS GNATPSVTGA NSSNANAGSN TNNAGTKPKP SFRCDICSYD TSVARNLRIH MTSEKHTHNM AVLQNNIKHI QAFNFLQQQQ QSGTGNIASH SSGSFMPEVA LADLAYNQAL MIQLLHQQQQ HQQSANTKLS PSSSPVSTPD QFSFSPKPIK LNHGTGAAMG IGMAMGMGMS HSNEVSCELS GDPHPLTKTD KWPMAFYSCL VCDCYSTNNL DDLNQHLLLD RSRQSSSASS EIMVIHNNNY ICRLCNYKTN LKANFQLHSK TDKHLQKLNF INHIREGGPQ NEYKMQYQQQ QLAANVVQLK CNCCDFHTNS IQKLSLHTQQ MRHDTMRMIF QHLLYIVQQS EMHNKSSGSA EDDPQCACPD EDQQLQLQSS KKLLLCQLCN FTAQNIHEMV QHVKGIRHLQ VEQFICLQRR SENQEIPALN EVFKVTEWVM ENEDVSLAPG LNLARTTTND ATTDASYAAA SSAAVPAIPD VSMFSPTSPS SCATSCDKNL SQIVLPNVNN LGSGVPTTVF KCNLCEYFVQ SKSEIAAHIE TEHSCAESDE FITIPTNTAA LQAFQTAVAA AALAAVHQRC AVINPPTQDT VDEDKDLDTN VSDGPVGIKQ ERLEQEVDRT TSMDVTKDLA SQATDFGAPE SPKVAETEVG VQCPLCLENH FREKQYLEDH LTSVHSVTRD GLSRLLLLVD QKALKKESTD IACPTDKAPY ANTNALERAP TPIENTCNVS LIKSTSANPS QSVSLQGLSC QQCEASFKHE EQLLKHAQQN QHFSLQNGEY LCLAASHISR PCFMTFRTIP TMISHFQDLH MSLIISERHV YKYRCKQCSL AFKTQEKLTT HMLYHSMRDA TKCSFCQRNF RSTQALQKHM EQAHAEDGTP STRTNSPQTP MLSTEETHKH LLAESHAVER VSGSDVSPIE LETHLNKETR HLSPTPMSLD SQSHQKHLAT FAALLKQQQC NSDAGGLHPE ALSMSTGEMP PQLQGLQNLQ HIQQHFGAVA AAAGLPINPV DMLNIMQFHH LMSLNFMNLA PPLVFGANAA GNAVSGPSAL NNSITTSTAT SASGLGDTHL TSGVSSIPVD SGKATAVPPQ TQLNANANSQ LASNQKRART RITDDQLKIL RAHFDINNSP SEESIMEMSQ KANLPMKVVK HWFRNTLFKE RQRNKDSPYN FNNPPSTTLN LEEYERTGQA KVTPLNDTCS VAVTGPMTSS TISLPPSGNI NLSSKENATS KVLAAGKANA SGPVTFSATV PVSTPLSRPE STNSSGNISD YIGNNIFFGQ LGSKEQILPY SLDGQIKSEP QDDMIGATDF AYQTKQHSSF SFLKQQQDLV DPPEQCLTNQ NADTAQDQSL LAGSSLASNC QSQQQINIFE TKSESGSSDV LSRPPSPNSG AAGNVYGSMN DLLNQQLENM GSNMGPPKKM QIVGKTFEKN VAPMVTSGSV STQFESNSSN SSSSSSSTSG GKRANRTRFT DYQIKVLQEF FENNSYPKDS DLEYLSKLLL LSPRVIVVWF QNARQKQRKI YENQPNNTLF ENEETKKQNI NYACKKCNLV FQRYYELIRH QKNHCFKEEN NKKSAKAQIA AAQIAQNLSS EDSNSSMDIH HVGICPPGSA VASHTLSTPG SAAPLPGQYT QHSFGALPSP QHLFAKSSSL TDFSPSTTPT PPQRERSNSL DQIQRPPKFD CDKCELNFNQ LEKLREHQLL HLMNPGNICS DVGQNSNPEA NFGPFGSILQ SLQQAAAQQQ QQHHQQPPTK KRKYSDCSSN ADEMQSLSEL EASQKKHEYL YKYFMQNETS QEVKQQFLMQ QQQKKLEQGN ECDFELDFLT NFYQQNELKK VSNYDFLLQY YRTHEEAKSS QQHTFSSSKK PTIEFLLQYY QLNESKKFFQ LVASPQIIPD VPGYKPSLRI PKSTSDEAPY IGETSLEQAT ELQREKQDEQ LRIDRPSEEN DLSMNKNKVE NINNNNINVD QSNLTETNGG VPSVETKEEC TQESSLIAMD DENKYLCTRS KQKDDKEKSH YLHNLEDFLD ATMIENNSQT LTFNDDEKAC QKDELTQNSN AIEKRSSVSP VNVSSKQNKR LRTTILPEQL NFLYECYQSE SNPSRKMLEE ISKKVNLKKR VVQVWFQNSR AKDKKSRNQR HYAHISDDNS YDGSSGKEVY SDLRSNGITV DTDLETNLQD CQLCQVTQVN IRKHAFSVEH ISKMKKLLEQ TTELYAQSNG SGSEDNDSDR EKRFYNLSKA FLLQHVVTNA TSHAIHTARQ DSDVIAEGNC ILNYDTNGGD SKSHVQHNLP NEVVSEDARK IAGNQELMQQ LFNRNHITVI GGK //