ID SLOU_DROME Reviewed; 659 AA. AC P22807; Q9VD96; DT 01-AUG-1991, integrated into UniProtKB/Swiss-Prot. DT 01-AUG-1991, sequence version 1. DT 07-APR-2021, entry version 184. DE RecName: Full=Homeobox protein slou; DE AltName: Full=Homeobox protein NK-1; DE AltName: Full=Protein slouch; DE AltName: Full=S59/2; GN Name=slou; Synonyms=NK1, S59; ORFNames=CG6534; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Holometabola; Diptera; Brachycera; Muscomorpha; Ephydroidea; OC Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA]. RX PubMed=1980118; DOI=10.1101/gad.4.12a.2098; RA Dohrmann C., Azpiazu N., Frasch M.; RT "A new Drosophila homeo box gene is expressed in mesodermal precursor cells RT of distinct muscles during embryogenesis."; RL Genes Dev. 4:2098-2111(1990). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C., RA Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., RA Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A., RA An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D., RA Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F., RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., RA Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., RA Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., RA Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F., RA Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., RA Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y., RA Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T., RA Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S., RA Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., RA Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., RA Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [3] RP GENOME REANNOTATION. RC STRAIN=Berkeley; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M., RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic RT review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 497-625. RX PubMed=2573058; DOI=10.1073/pnas.86.20.7716; RA Kim Y., Nirenberg M.; RT "Drosophila NK-homeobox genes."; RL Proc. Natl. Acad. Sci. U.S.A. 86:7716-7720(1989). CC -!- FUNCTION: May play a role in specifying the identity of particular CC somatic muscles and neurons of the CNS. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}. CC -!- TISSUE SPECIFICITY: Mesodermal precursor cells of distinct muscles CC during embryogenesis, a subset of neuronal cells of the CNS and their CC precursors and also in cells of a small region of the midgut. CC -!- DEVELOPMENTAL STAGE: Postgastrulation-stage. CC -!- SIMILARITY: Belongs to the NK-1 homeobox family. {ECO:0000305}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; X55393; CAA39067.1; -; mRNA. DR EMBL; AE014297; AAF55901.3; -; Genomic_DNA. DR EMBL; M27289; AAA28616.1; -; Genomic_DNA. DR PIR; A36664; A36664. DR RefSeq; NP_476657.1; NM_057309.3. DR SMR; P22807; -. DR BioGRID; 67513; 19. DR DIP; DIP-17156N; -. DR IntAct; P22807; 2. DR STRING; 7227.FBpp0083521; -. DR PaxDb; P22807; -. DR EnsemblMetazoa; FBtr0084123; FBpp0083521; FBgn0002941. DR GeneID; 42547; -. DR KEGG; dme:Dmel_CG6534; -. DR CTD; 42547; -. DR FlyBase; FBgn0002941; slou. DR eggNOG; KOG0488; Eukaryota. DR GeneTree; ENSGT00940000172282; -. DR HOGENOM; CLU_401307_0_0_1; -. DR InParanoid; P22807; -. DR OMA; YDRDEEM; -. DR OrthoDB; 858478at2759; -. DR PhylomeDB; P22807; -. DR BioGRID-ORCS; 42547; 0 hits in 3 CRISPR screens. DR GenomeRNAi; 42547; -. DR PRO; PR:P22807; -. DR Proteomes; UP000000803; Chromosome 3R. DR Bgee; FBgn0002941; Expressed in head and 16 other tissues. DR ExpressionAtlas; P22807; baseline and differential. DR Genevisible; P22807; DM. DR GO; GO:0005634; C:nucleus; IDA:FlyBase. DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central. DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central. DR GO; GO:0030154; P:cell differentiation; IBA:GO_Central. DR GO; GO:0007521; P:muscle cell fate determination; IMP:FlyBase. DR GO; GO:0007517; P:muscle organ development; TAS:FlyBase. DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IMP:FlyBase. DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central. DR CDD; cd00086; homeodomain; 1. DR InterPro; IPR009057; Homeobox-like_sf. DR InterPro; IPR017970; Homeobox_CS. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR020479; Homeobox_metazoa. DR Pfam; PF00046; Homeodomain; 1. DR PRINTS; PR00024; HOMEOBOX. DR SMART; SM00389; HOX; 1. DR SUPFAM; SSF46689; SSF46689; 1. DR PROSITE; PS00027; HOMEOBOX_1; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. PE 2: Evidence at transcript level; KW Developmental protein; DNA-binding; Homeobox; Nucleus; Reference proteome; KW Repeat. FT CHAIN 1..659 FT /note="Homeobox protein slou" FT /id="PRO_0000049070" FT REPEAT 221..222 FT /note="1" FT REPEAT 223..224 FT /note="2" FT REPEAT 225..226 FT /note="3" FT REPEAT 227..228 FT /note="4" FT REPEAT 229..230 FT /note="5" FT REPEAT 231..232 FT /note="6" FT REPEAT 233..234 FT /note="7" FT DNA_BIND 545..604 FT /note="Homeobox" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" FT REGION 221..234 FT /note="7 X 2 AA tandem repeats of H-P" FT COMPBIAS 201..239 FT /note="His-rich" FT COMPBIAS 364..372 FT /note="Poly-Ala" FT COMPBIAS 477..522 FT /note="Asp/Glu-rich (acidic)" FT COMPBIAS 536..542 FT /note="Poly-Gly" SQ SEQUENCE 659 AA; 69955 MW; 5D401F55C4670280 CRC64; MVMLQSPAQK ASDSASAQNT AVGGLMSPNS NPDSPKSNTS PDVASADSVV SGTGGGSTPP AAKIPKFIIS ANGAAVAGKQ EQELRYSLER LKQMSSESGS LLSRLSPLQE DSQDKEKPNH NNNNSLTNHN ANSNTRRSQS PPASVGSVSF SSPAQQRKLL ELNAVRHLAR PEPLQHPHAA LLQQHPHLLQ NPQFLAAAQQ HMHHHQHQHH QHPAHPHSHQ HPHPHPHPHP HPHPSAVFHL RAPSSSSTAP PSPATSPLSP PTSPAMHSDQ QMSPPIAPPQ NPPHSSQPPQ QQQVAAPSDM DLERIKLVAA VAARTTQASS TSALASASNS VSNASISISN SSSGSPSGRD LSDYGFRIQL GGLAAAAAAA AATSRQIAAA TYARSDTSEE LNVDGNDEDS NDGSHSTPSV CPVDLTRSVN SSAAANPSSA STSASSDRDA ATKRLAFSVE NILDPNKFTG NKLPSGPFGH PRQWSYERDE EMQERLDDDQ SEDMSAQDLN DMDQDDMCDD GSDIDDPSSE TDSKKGGSRN GDGKSGGGGG GGSKPRRART AFTYEQLVSL ENKFKTTRYL SVCERLNLAL SLSLTETQVK IWFQNRRTKW KKQNPGMDVN SPTIPPPGGG SFGPGAYASG LLYSHAVPYP PYGPYFHPLG AHHLSHSHS //