ID HMN1_DROME STANDARD; PRT; 659 AA. AC P22807; DT 01-AUG-1991 (REL. 19, CREATED) DT 01-AUG-1991 (REL. 19, LAST SEQUENCE UPDATE) DT 01-MAY-1992 (REL. 22, LAST ANNOTATION UPDATE) DE HOMEOBOX PROTEIN NK-1 (S59/2). GN NK1 OR S59. OS DROSOPHILA MELANOGASTER (FRUIT FLY). OC EUKARYOTA; METAZOA; ARTHROPODA; INSECTA; DIPTERA. RN [1] RP SEQUENCE FROM N.A. RM 91099659 RA DOHRMANN C., AZPIAZU N., FRASCH M.; RL GENES DEV. 4:2098-2111(1990). RN [2] RP SEQUENCE OF 497-625 FROM N.A. RM 90046666 RA KIM Y., NIRENBERG M.; RL PROC. NATL. ACAD. SCI. U.S.A. 86:7716-7720(1989). CC -!- FUNCTION: MAY PLAY A ROLE IN SPECIFIYING THE IDENTITY OF CC PARTICULAR SOMATIC MUSCLES AND NEURONS OF THE CNS. CC -!- DEVELOPMENTAL STAGE: POSTGASTRULATION-STAGE. CC -!- TISSUE SPECIFICITY: MESODERMAL PRECURSOR CELLS OF DISTINCT MUSCLES CC DURING EMBRYOGENESIS, A SUBSET OF NEURONAL CELLS OF THE CNS AND CC THEIR PRECURSORS AND ALSO IN CELLS OF A SMALL REGION OF THE CC MIDGUT. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; X55393; DMS59. DR EMBL; M27289; DMHOXNK1. DR PIR; A36664; A36664. DR PIR; A33976; A33976. DR FLYBASE; 02941; RELEASE 9203. DR PROSITE; PS00027; HOMEOBOX. KW HOMEOBOX; DNA-BINDING; DEVELOPMENTAL PROTEIN; NUCLEAR PROTEIN; KW TANDEM REPEAT. FT DOMAIN 201 239 HIS-RICH. FT REPEAT 221 234 HIS-PRO. FT DOMAIN 364 372 POLY-ALA. FT DOMAIN 477 522 ASP/GLU-RICH (ACIDIC). FT DOMAIN 536 542 POLY-GLY. FT DNA_BIND 546 605 HOMEOBOX. SQ SEQUENCE 659 AA; 69955 MW; 2184751 CN; MVMLQSPAQK ASDSASAQNT AVGGLMSPNS NPDSPKSNTS PDVASADSVV SGTGGGSTPP AAKIPKFIIS ANGAAVAGKQ EQELRYSLER LKQMSSESGS LLSRLSPLQE DSQDKEKPNH NNNNSLTNHN ANSNTRRSQS PPASVGSVSF SSPAQQRKLL ELNAVRHLAR PEPLQHPHAA LLQQHPHLLQ NPQFLAAAQQ HMHHHQHQHH QHPAHPHSHQ HPHPHPHPHP HPHPSAVFHL RAPSSSSTAP PSPATSPLSP PTSPAMHSDQ QMSPPIAPPQ NPPHSSQPPQ QQQVAAPSDM DLERIKLVAA VAARTTQASS TSALASASNS VSNASISISN SSSGSPSGRD LSDYGFRIQL GGLAAAAAAA AATSRQIAAA TYARSDTSEE LNVDGNDEDS NDGSHSTPSV CPVDLTRSVN SSAAANPSSA STSASSDRDA ATKRLAFSVE NILDPNKFTG NKLPSGPFGH PRQWSYERDE EMQERLDDDQ SEDMSAQDLN DMDQDDMCDD GSDIDDPSSE TDSKKGGSRN GDGKSGGGGG GGSKPRRART AFTYEQLVSL ENKFKTTRYL SVCERLNLAL SLSLTETQVK IWFQNRRTKW KKQNPGMDVN SPTIPPPGGG SFGPGAYASG LLYSHAVPYP PYGPYFHPLG AHHLSHSHS //