ID H9XVP0_DROME Unreviewed; 3003 AA. AC H9XVP0; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 13-JUN-2012, entry version 1. DE SubName: Full=Zn finger homeodomain 2, isoform B; GN Name=zfh2; ORFNames=Dmel_CG1449; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley; RX MEDLINE=20196006; PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., RA Brandon R.C., Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., RA Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Gabor G.L., RA Abril J.F., Agbayani A., An H.J., Andrews-Pfannkoch C., Baldwin D., RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., RA Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., RA Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., RA Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., RA Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., RA Hostin D., Houston K.A., Howland T.J., Wei M.H., Ibegwam C., RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., RA Kimmel B.E., Kodira C.D., Kraft C., Kravitz S., Kulp D., Lai Z., RA Lasko P., Lei Y., Levitsky A.A., Li J., Li Z., Liang Y., Lin X., RA Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., RA Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., RA Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., RA Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., RA Wang Z.Y., Wassarman D.A., Weinstock G.M., Weissenbach J., RA Williams S.M., WoodageT, Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., RA Yeh R.F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] RP GENOME REANNOTATION. RC STRAIN=Berkeley; RX MEDLINE=22426069; PubMed=12537572; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., RA Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., RA Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a RT systematic review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AE014135; AFH06785.1; -; Genomic_DNA. PE 4: Predicted; KW Complete proteome; DNA-binding; Homeobox; Reference proteome. SQ SEQUENCE 3003 AA; 331762 MW; B9ECCAAE87E4FC72 CRC64; MSSFDVETFN GKIVYNLDGS AHIIATDNTN GGGSGSGQNC YGSTTNSLKN LSKDKGRGQE EKDIEHPSQY HREQSDNKRQ EEAVDNRPGV ESLGSACYKS SPKIHSFRVV SAQDANSTCQ DQIRAFKIQK PILMCFICKL SFGNVKSFSL HANTEHRLNL EELDQQLLNR EYSSAIIQRN MDEKPQISFL QPLANNDASA DTNDTEKLQT ATEGSDATLP SSPQPVFRNV SELEPENKQE TEQNRLLNQD REQEPESDQH TSSSKMAAPS AYIPLSSPKV AGKLTVKFGS LNSATAKTNN LSKVSSTSSP PSTYASGEVL SPSTDNISNH KSTHCNQETE PPSSSSSEVE MKIGSMSTSP QTNDSDVPCS GFLQMQHMTT GGAYTPQVSS FHASLAALAA NESNDNRVKL ITEFLQQQLQ QHQSSLFPSP CPDHPDLNGV DCKTCELLDI QQRSKSPSSS HHQFSQSLPQ LQIQSQPQQT PHRSPCSNSV ALPVSPSASS VASVGNASTA TSSFTIGACS EHINGRPQGV DCARCEMLLN SARLNSGVQM STRNSCKTLK CPQCNWHYKY QETLEIHMRE KHPDGESACG YCLAGQQHPR LARGESYSCG YKPYRCEICN YSTTTKGNLS IHMQSDKHLN NMQELNSSQN MVAAAAAAAV TGKLLLSSSS PQVTAACPSN SGSGAGSGSS NIVGGTASLS GNATPSVTGA NSSNANAGSN TNNAGTKPKP SFRCDICSYD TSVARNLRIH MTSEKHTHNM AVLQNNIKHI QAFNFLQQQQ QSGTGNIASH SSGSFMPEVA LADLAYNQAL MIQLLHQQQQ HQQSANTKLS PSSSPVSTPD QFSFSPKPIK LNHGTGAAMG IGMAMGMGMS HSNEVSCELS GDPHPLTKTD KWPMAFYSCL VCDCYSTNNL DDLNQHLLLD RSRQSSSASS EIMVIHNNNY ICRLCNYKTN LKANFQLHSK TDKHLQKLNF INHIREGGPQ NEYKMQYQQQ QLAANVVQLK CNCCDFHTNS IQKLSLHTQQ MRHDTMRMIF QHLLYIVQQS EMHNKSSGSA EDDPQCACPD EDQQLQLQSS KKLLLCQLCN FTAQNIHEMV QHVKGIRHLQ VEQFICLQRR SENQEIPALN EVFKVTEWVM ENEDVSLAPG LNLARTTTND ATTDASYAAA SSAAVPAIPD VSMFSPTSPS SCATSCDKNL SQIVLPNVNN LGSGVPTTVF KCNLCEYFVQ SKSEIAAHIE TEHSCAESDE FITIPTNTAA LQAFQTAVAA AALAAVHQRC AVINPPTQDT VDEDKDLDTN VSDGPVGIKQ ERLEQEVDRT TSMDVTKDLA SQATDFGAPE SPKVAETEVG VQCPLCLENH FREKQYLEDH LTSVHSVTRD GLSRLLLLVD QKALKKESTD IACPTDKAPY ANTNALERAP TPIENTCNVS LIKSTSANPS QSVSLQGLSC QQCEASFKHE EQLLKHAQQN QHFSLQNGEY LCLAASHISR PCFMTFRTIP TMISHFQDLH MSLIISERHV YKYRCKQCSL AFKTQEKLTT HMLYHSMRDA TKCSFCQRNF RSTQALQKHM EQAHAEDGTP STRTNSPQTP MLSTEETHKH LLAESHAVER VSGSDVSPIE LETHLNKETR HLSPTPMSLD SQSHQKHLAT FAALLKQQQC NSDAGGLHPE ALSMSTGEMP PQLQGLQNLQ HIQQHFGAVA AAAGLPINPV DMLNIMQFHH LMSLNFMNLA PPLVFGANAA GNAVSGPSAL NNSITTSTAT SASGLGDTHL TSGVSSIPVD SGKATAVPPQ TQLNANANSQ LASNQKRART RITDDQLKIL RAHFDINNSP SEESIMEMSQ KANLPMKVVK HWFRNTLFKE RQRNKDSPYN FNNPPSTTLN LEEYERTGQA KVTPLNDTCS VAVTGPMTSS TISLPPSGNI NLSSKENATS KVLAAGKANA SGPVTFSATV PVSTPLSRPE STNSSGNISD YIGNNIFFGQ LGSKEQILPY SLDGQIKSEP QDDMIGATDF AYQTKQHSSF SFLKQQQDLV DPPEQCLTNQ NADTAQDQSL LAGSSLASNC QSQQQINIFE TKSESGSSDV LSRPPSPNSG AAGNVYGSMN DLLNQQLENM GSNMGPPKKM QIVGKTFEKN VAPMVTSGSV STQFESNSSN SSSSSSSTSG GKRANRTRFT DYQIKVLQEF FENNSYPKDS DLEYLSKLLL LSPRVIVVWF QNARQKQRKI YENQPNNTLF ENEETKKQNI NYACKKCNLV FQRYYELIRH QKNHCFKEEN NKKSAKAQIA AAQIAQNLSS EDSNSSMDIH HVGICPPGSA VASHTLSTPG SAAPLPGQYT QHSFGALPSP QHLFAKSSSL TDFSPSTTPT PPQRERSNSL DQIQRPPKFD CDKCELNFNQ LEKLREHQLL HLMNPGNICS DVGQNSNPEA NFGPFGSILQ SLQQAAAQQQ QQHHQQPPTK KRKYSDCSSN ADEMQSLSEL EASQKKHEYL YKYFMQNETS QEVKQQFLMQ QQQKKLEQGN ECDFELDFLT NFYQQNELKK VSNYDFLLQY YRTHEEAKSS QQHTFSSSKK PTIEFLLQYY QLNESKKFFQ LVASPQIIPD VPGYKPSLRI PKSTSDEAPY IGETSLEQAT ELQREKQDEQ LRIDRPSEEN DLSMNKNKVE NINNNNINVD QSNLTETNGG VPSVETKEEC TQESSLIAMD DENKYLCTRS KQKDDKEKSH YLHNLEDFLD ATMIENNSQT LTFNDDEKAC QKDELTQNSN AIEKRSSVSP VNVSSKQNKR LRTTILPEQL NFLYECYQSE SNPSRKMLEE ISKKVNLKKR VVQVWFQNSR AKDKKSRNQR HYAHISDDNS YDGSSGKEVY SDLRSNGITV DTDLETNLQD CQLCQVTQVN IRKHAFSVEH ISKMKKLLEQ TTELYAQSNG SGSEDNDSDR EKRFYNLSKA FLLQHVVTNA TSHAIHTARQ DSDVIAEGNC ILNYDTNGGD SKSHVQHNLP NEVVSEDARK IAGNQELMQQ LFNRNHITVI GGK //