ID VGNM_SQMVM STANDARD; PRT; 790 AA. AC P36341; DT 01-JUN-1994 (Rel. 29, Created) DT 01-JUN-1994 (Rel. 29, Last sequence update) DT 01-JUN-1994 (Rel. 29, Last annotation update) DE GENOME POLYPROTEIN M [CONTAINS: 22 KDA AND 42 KDA COAT PROTEINS]. OS Squash mosaic virus (strain melon) (SqMV). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Comoviridae; OC Comovirus. RN [1] RP SEQUENCE FROM N.A., AND SEQUENCE OF 357-381 AND 606-616. RX MEDLINE; 93277375. RA Hu J.S., Pang S.Z., Nagpala P.G., Siemieniak D.R., Slightom J.L., RA Gonsalves D.; RT "The coat protein genes of squash mosaic virus: cloning, sequence RT analysis, and expression in tobacco protoplasts."; RL Arch. Virol. 130:17-31(1993). CC -!- PTM: THE N-TERMINUS OF THE LARGE COAT PROTEIN IS BLOCKED. CC -------------------------------------------------------------------------- CC This SWISS-PROT entry is copyright. It is produced through a collaboration CC between the Swiss Institute of Bioinformatics and the EMBL outstation - CC the European Bioinformatics Institute. There are no restrictions on its CC use by non-profit institutions as long as its content is in no way CC modified and this statement is not removed. Usage by and for commercial CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ CC or send an email to license@isb-sib.ch). CC -------------------------------------------------------------------------- CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; M96148; -; NOT_ANNOTATED_CDS. DR PIR; A48356; A48356. DR HSSP; P23009; 1BMV. KW Coat protein; Polyprotein; Glycoprotein. FT CHAIN 233 606 42 kDa COAT PROTEIN. FT CHAIN 607 790 22 kDa COAT PROTEIN. FT CARBOHYD 183 183 N-LINKED (GLCNAC...) (POTENTIAL). FT CARBOHYD 187 187 N-LINKED (GLCNAC...) (POTENTIAL). FT CARBOHYD 205 205 N-LINKED (GLCNAC...) (POTENTIAL). FT CARBOHYD 655 655 N-LINKED (GLCNAC...) (POTENTIAL). FT CARBOHYD 704 704 N-LINKED (GLCNAC...) (POTENTIAL). FT CARBOHYD 742 742 N-LINKED (GLCNAC...) (POTENTIAL). SQ SEQUENCE 790 AA; 86939 MW; BD914EEA2D2BB9C9 CRC64; MDCFTSPDSN ICGGMLLVDT AHLNPDNAIR SVFVAPFIGG RPIRVLLFPD TLVEIAPNMN SRFKLLCTTS NGDVAPDFNL AMVKVNVAGC AVSLTKTYTP TAYLEQELIK EKGAIVQYLN RHTFSMHRNN QMTKEEMQKQ RLSFRLESAL TLQEKHPLHA TFCKSTNFVY KIGGDAKEGS NGNLTVNESQ LSSHSPSAHV LHKHNNSGDN EVEFSEIGVV VPGAGRTKAY GQNELDLAQL SLDDTSSLRG TALQTKLATS RIILSKTMVG NTVLREDLLA TFLQDSNERA AIDLIRTHVI RGKIRCVASI NVPENTGCAL AICFNSGITG AADTDIYTTS SQNAIVWNPA CEKAVELTFN PNPCGDAWNF VFLQQTKAHF AVQCVTGWTT TPLTDLALVL TWHIDRSLCV PKTLTISSAH ASFPINRWMG KLSFPQGPAR VLKRMPLAIG GGAGTKDAIL MNMPNAVISL HRYFRGDFVF EITKMSSPYI KATIAFFIAF GDITEEMTNL ESFPHKLVQF REIQGRTTIT FTQSEFLTAW STQVLSTVNP QKDGCPHLYA LLHDSATSTI EGNFVIGVKL LDIRNYRAYG HNPGFEGARL LGISGQSTMV QQLGTYNPIW MVRTPLESTA QQNFASFTAD LMESTISGDS TGNWNITVYP SPIANLLKVA AWKKGTIRFQ LICRGAAVKQ SDWAASARID LINNLSNKAL PARSWYITKP RGGDIEFDLE IAGPNNGFEM ANSSWAFQTT WYLEIAIDNP KQFTLFELNA CLMEDFEVAG NTLNPPILLS //