ID HMN1_DROME STANDARD; PRT; 659 AA. AC P22807; DT 01-AUG-1991 (REL. 19, CREATED) DT 01-AUG-1991 (REL. 19, LAST SEQUENCE UPDATE) DT 01-JUN-1994 (REL. 29, LAST ANNOTATION UPDATE) DE HOMEOBOX PROTEIN NK-1 (S59/2). GN NK1 OR S59. OS DROSOPHILA MELANOGASTER (FRUIT FLY). OC EUKARYOTA; METAZOA; ARTHROPODA; INSECTA; DIPTERA. RN [1] RP SEQUENCE FROM N.A. RM 91099659 RA DOHRMANN C., AZPIAZU N., FRASCH M.; RL GENES DEV. 4:2098-2111(1990). RN [2] RP SEQUENCE OF 497-625 FROM N.A. RM 90046666 RA KIM Y., NIRENBERG M.; RL PROC. NATL. ACAD. SCI. U.S.A. 86:7716-7720(1989). CC -!- FUNCTION: MAY PLAY A ROLE IN SPECIFIYING THE IDENTITY OF CC PARTICULAR SOMATIC MUSCLES AND NEURONS OF THE CNS. CC -!- DEVELOPMENTAL STAGE: POSTGASTRULATION-STAGE. CC -!- TISSUE SPECIFICITY: MESODERMAL PRECURSOR CELLS OF DISTINCT MUSCLES CC DURING EMBRYOGENESIS, A SUBSET OF NEURONAL CELLS OF THE CNS AND CC THEIR PRECURSORS AND ALSO IN CELLS OF A SMALL REGION OF THE CC MIDGUT. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; X55393; DMS59. DR EMBL; M27289; DMHOXNK1. DR PIR; A36664; A36664. DR PIR; A33976; A33976. DR HSSP; P02833; 1SAN. DR FLYBASE; FBGN0002941; NK1. DR PROSITE; PS00027; HOMEOBOX. KW HOMEOBOX; DNA-BINDING; DEVELOPMENTAL PROTEIN; NUCLEAR PROTEIN; KW REPEAT. FT DOMAIN 201 239 HIS-RICH. FT DOMAIN 221 234 7 X 2 AA TANDEM REPEATS OF H-P. FT DOMAIN 364 372 POLY-ALA. FT DOMAIN 477 522 ASP/GLU-RICH (ACIDIC). FT DOMAIN 536 542 POLY-GLY. FT DNA_BIND 545 604 HOMEOBOX. SQ SEQUENCE 659 AA; 69955 MW; 2184751 CN; MVMLQSPAQK ASDSASAQNT AVGGLMSPNS NPDSPKSNTS PDVASADSVV SGTGGGSTPP AAKIPKFIIS ANGAAVAGKQ EQELRYSLER LKQMSSESGS LLSRLSPLQE DSQDKEKPNH NNNNSLTNHN ANSNTRRSQS PPASVGSVSF SSPAQQRKLL ELNAVRHLAR PEPLQHPHAA LLQQHPHLLQ NPQFLAAAQQ HMHHHQHQHH QHPAHPHSHQ HPHPHPHPHP HPHPSAVFHL RAPSSSSTAP PSPATSPLSP PTSPAMHSDQ QMSPPIAPPQ NPPHSSQPPQ QQQVAAPSDM DLERIKLVAA VAARTTQASS TSALASASNS VSNASISISN SSSGSPSGRD LSDYGFRIQL GGLAAAAAAA AATSRQIAAA TYARSDTSEE LNVDGNDEDS NDGSHSTPSV CPVDLTRSVN SSAAANPSSA STSASSDRDA ATKRLAFSVE NILDPNKFTG NKLPSGPFGH PRQWSYERDE EMQERLDDDQ SEDMSAQDLN DMDQDDMCDD GSDIDDPSSE TDSKKGGSRN GDGKSGGGGG GGSKPRRART AFTYEQLVSL ENKFKTTRYL SVCERLNLAL SLSLTETQVK IWFQNRRTKW KKQNPGMDVN SPTIPPPGGG SFGPGAYASG LLYSHAVPYP PYGPYFHPLG AHHLSHSHS //