ID SLOU_DROME Reviewed; 659 AA. AC P22807; Q9VD96; DT 01-AUG-1991, integrated into UniProtKB/Swiss-Prot. DT 01-AUG-1991, sequence version 1. DT 02-NOV-2016, entry version 150. DE RecName: Full=Homeobox protein slou; DE AltName: Full=Homeobox protein NK-1; DE AltName: Full=Protein slouch; DE AltName: Full=S59/2; GN Name=slou; Synonyms=NK1, S59; ORFNames=CG6534; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA]. RX PubMed=1980118; DOI=10.1101/gad.4.12a.2098; RA Dohrmann C., Azpiazu N., Frasch M.; RT "A new Drosophila homeo box gene is expressed in mesodermal precursor RT cells of distinct muscles during embryogenesis."; RL Genes Dev. 4:2098-2111(1990). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., RA Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., RA Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., RA Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., RA Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., RA Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., RA Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., RA Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., RA Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., RA Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., RA Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., RA Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., RA Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., RA Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., RA Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [3] RP GENOME REANNOTATION. RC STRAIN=Berkeley; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., RA Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., RA Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a RT systematic review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 497-625. RX PubMed=2573058; DOI=10.1073/pnas.86.20.7716; RA Kim Y., Nirenberg M.; RT "Drosophila NK-homeobox genes."; RL Proc. Natl. Acad. Sci. U.S.A. 86:7716-7720(1989). CC -!- FUNCTION: May play a role in specifying the identity of particular CC somatic muscles and neurons of the CNS. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}. CC -!- TISSUE SPECIFICITY: Mesodermal precursor cells of distinct muscles CC during embryogenesis, a subset of neuronal cells of the CNS and CC their precursors and also in cells of a small region of the CC midgut. CC -!- DEVELOPMENTAL STAGE: Postgastrulation-stage. CC -!- SIMILARITY: Belongs to the NK-1 homeobox family. {ECO:0000305}. CC -!- SIMILARITY: Contains 1 homeobox DNA-binding domain. CC {ECO:0000255|PROSITE-ProRule:PRU00108}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; X55393; CAA39067.1; -; mRNA. DR EMBL; AE014297; AAF55901.3; -; Genomic_DNA. DR EMBL; M27289; AAA28616.1; -; Genomic_DNA. DR PIR; A36664; A36664. DR RefSeq; NP_476657.1; NM_057309.3. DR UniGene; Dm.3456; -. DR ProteinModelPortal; P22807; -. DR SMR; P22807; -. DR BioGrid; 67513; 11. DR DIP; DIP-17156N; -. DR MINT; MINT-321547; -. DR STRING; 7227.FBpp0303415; -. DR PaxDb; P22807; -. DR PRIDE; P22807; -. DR EnsemblMetazoa; FBtr0084123; FBpp0083521; FBgn0002941. DR GeneID; 42547; -. DR KEGG; dme:Dmel_CG6534; -. DR CTD; 42547; -. DR FlyBase; FBgn0002941; slou. DR eggNOG; KOG0488; Eukaryota. DR eggNOG; ENOG411188D; LUCA. DR GeneTree; ENSGT00850000132248; -. DR InParanoid; P22807; -. DR KO; K09309; -. DR OrthoDB; EOG091G0SCQ; -. DR PhylomeDB; P22807; -. DR GenomeRNAi; 42547; -. DR PRO; PR:P22807; -. DR Proteomes; UP000000803; Chromosome 3R. DR Bgee; FBgn0002941; -. DR ExpressionAtlas; P22807; differential. DR Genevisible; P22807; DM. DR GO; GO:0005634; C:nucleus; IDA:FlyBase. DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro. DR GO; GO:0007501; P:mesodermal cell fate specification; NAS:FlyBase. DR GO; GO:0007521; P:muscle cell fate determination; IEP:UniProtKB. DR GO; GO:0007517; P:muscle organ development; TAS:FlyBase. DR GO; GO:0000122; P:negative regulation of transcription from RNA polymerase II promoter; IMP:FlyBase. DR Gene3D; 1.10.10.60; -; 1. DR InterPro; IPR017970; Homeobox_CS. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR020479; Homeobox_metazoa. DR InterPro; IPR009057; Homeodomain-like. DR Pfam; PF00046; Homeobox; 1. DR PRINTS; PR00024; HOMEOBOX. DR SMART; SM00389; HOX; 1. DR SUPFAM; SSF46689; SSF46689; 1. DR PROSITE; PS00027; HOMEOBOX_1; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. PE 2: Evidence at transcript level; KW Complete proteome; Developmental protein; DNA-binding; Homeobox; KW Nucleus; Reference proteome; Repeat. FT CHAIN 1 659 Homeobox protein slou. FT /FTId=PRO_0000049070. FT REPEAT 221 222 1. FT REPEAT 223 224 2. FT REPEAT 225 226 3. FT REPEAT 227 228 4. FT REPEAT 229 230 5. FT REPEAT 231 232 6. FT REPEAT 233 234 7. FT DNA_BIND 545 604 Homeobox. {ECO:0000255|PROSITE- FT ProRule:PRU00108}. FT REGION 221 234 7 X 2 AA tandem repeats of H-P. FT COMPBIAS 201 239 His-rich. FT COMPBIAS 364 372 Poly-Ala. FT COMPBIAS 477 522 Asp/Glu-rich (acidic). FT COMPBIAS 536 542 Poly-Gly. SQ SEQUENCE 659 AA; 69955 MW; 5D401F55C4670280 CRC64; MVMLQSPAQK ASDSASAQNT AVGGLMSPNS NPDSPKSNTS PDVASADSVV SGTGGGSTPP AAKIPKFIIS ANGAAVAGKQ EQELRYSLER LKQMSSESGS LLSRLSPLQE DSQDKEKPNH NNNNSLTNHN ANSNTRRSQS PPASVGSVSF SSPAQQRKLL ELNAVRHLAR PEPLQHPHAA LLQQHPHLLQ NPQFLAAAQQ HMHHHQHQHH QHPAHPHSHQ HPHPHPHPHP HPHPSAVFHL RAPSSSSTAP PSPATSPLSP PTSPAMHSDQ QMSPPIAPPQ NPPHSSQPPQ QQQVAAPSDM DLERIKLVAA VAARTTQASS TSALASASNS VSNASISISN SSSGSPSGRD LSDYGFRIQL GGLAAAAAAA AATSRQIAAA TYARSDTSEE LNVDGNDEDS NDGSHSTPSV CPVDLTRSVN SSAAANPSSA STSASSDRDA ATKRLAFSVE NILDPNKFTG NKLPSGPFGH PRQWSYERDE EMQERLDDDQ SEDMSAQDLN DMDQDDMCDD GSDIDDPSSE TDSKKGGSRN GDGKSGGGGG GGSKPRRART AFTYEQLVSL ENKFKTTRYL SVCERLNLAL SLSLTETQVK IWFQNRRTKW KKQNPGMDVN SPTIPPPGGG SFGPGAYASG LLYSHAVPYP PYGPYFHPLG AHHLSHSHS //