ID VGNM_SQMVM STANDARD; PRT; 790 AA. AC P36341; DT 01-JUN-1994 (Rel. 29, Created) DT 01-JUN-1994 (Rel. 29, Last sequence update) DT 25-JAN-2005 (Rel. 46, Last annotation update) DE Genome polyprotein M [Contains: 42 kDa coat protein; 22 kDa coat DE protein]. OS Squash mosaic virus (strain melon) (SqMV). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Comoviridae; OC Comovirus. OX NCBI_TaxID=36401; RN [1] RP NUCLEOTIDE SEQUENCE, AND PROTEIN SEQUENCE OF 357-381 AND 606-616. RX MEDLINE=93277375; PubMed=8503782; RA Hu J.S., Pang S.Z., Nagpala P.G., Siemieniak D.R., Slightom J.L., RA Gonsalves D.; RT "The coat protein genes of squash mosaic virus: cloning, sequence RT analysis, and expression in tobacco protoplasts."; RL Arch. Virol. 130:17-31(1993). CC -!- PTM: The N-terminus of the 42 kDa coat protein is blocked. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; M96148; -; NOT_ANNOTATED_CDS. DR PIR; A48356; A48356. DR HSSP; P23009; 1BMV. DR InterPro; IPR003181; Como_LCP. DR InterPro; IPR003182; Como_SCP. DR InterPro; IPR008975; Viral_cap_coat. DR Pfam; PF02247; Como_LCP; 1. DR Pfam; PF02248; Como_SCP; 1. KW Capsid protein; Direct protein sequencing; Glycoprotein; Polyprotein. FT CHAIN 233 606 42 kDa coat protein. FT CHAIN 607 790 22 kDa coat protein. FT CARBOHYD 183 183 N-linked (GlcNAc...) (Potential). FT CARBOHYD 187 187 N-linked (GlcNAc...) (Potential). FT CARBOHYD 205 205 N-linked (GlcNAc...) (Potential). FT CARBOHYD 655 655 N-linked (GlcNAc...) (Potential). FT CARBOHYD 704 704 N-linked (GlcNAc...) (Potential). FT CARBOHYD 742 742 N-linked (GlcNAc...) (Potential). SQ SEQUENCE 790 AA; 86939 MW; BD914EEA2D2BB9C9 CRC64; MDCFTSPDSN ICGGMLLVDT AHLNPDNAIR SVFVAPFIGG RPIRVLLFPD TLVEIAPNMN SRFKLLCTTS NGDVAPDFNL AMVKVNVAGC AVSLTKTYTP TAYLEQELIK EKGAIVQYLN RHTFSMHRNN QMTKEEMQKQ RLSFRLESAL TLQEKHPLHA TFCKSTNFVY KIGGDAKEGS NGNLTVNESQ LSSHSPSAHV LHKHNNSGDN EVEFSEIGVV VPGAGRTKAY GQNELDLAQL SLDDTSSLRG TALQTKLATS RIILSKTMVG NTVLREDLLA TFLQDSNERA AIDLIRTHVI RGKIRCVASI NVPENTGCAL AICFNSGITG AADTDIYTTS SQNAIVWNPA CEKAVELTFN PNPCGDAWNF VFLQQTKAHF AVQCVTGWTT TPLTDLALVL TWHIDRSLCV PKTLTISSAH ASFPINRWMG KLSFPQGPAR VLKRMPLAIG GGAGTKDAIL MNMPNAVISL HRYFRGDFVF EITKMSSPYI KATIAFFIAF GDITEEMTNL ESFPHKLVQF REIQGRTTIT FTQSEFLTAW STQVLSTVNP QKDGCPHLYA LLHDSATSTI EGNFVIGVKL LDIRNYRAYG HNPGFEGARL LGISGQSTMV QQLGTYNPIW MVRTPLESTA QQNFASFTAD LMESTISGDS TGNWNITVYP SPIANLLKVA AWKKGTIRFQ LICRGAAVKQ SDWAASARID LINNLSNKAL PARSWYITKP RGGDIEFDLE IAGPNNGFEM ANSSWAFQTT WYLEIAIDNP KQFTLFELNA CLMEDFEVAG NTLNPPILLS //