ID V9IS50_9FLAV Unreviewed; 3390 AA. AC V9IS50; DT 19-MAR-2014, integrated into UniProtKB/TrEMBL. DT 19-MAR-2014, sequence version 1. DT 11-JUN-2014, entry version 4. DE SubName: Full=Polyprotein; DE Flags: Precursor; OS Dengue virus 3. OC Viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; OC Flavivirus; Dengue virus group. OX NCBI_TaxID=11069; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=D3PY/AS10/03; RA Alfonso H.L., Amarilla A.A., Goncaves P.F., Barros M.T., RA Silva E.V.D.A., Nunes M., Vasconcelos P.F.C., Vieira D.S., RA Batista W.C., Bobadilla M.L., Vazquez C., Moran M., Figueiredo L.T.M., RA Aquino V.H.; RT "Molecular characterization of dengue virus type 3 isolated in Brazil RT and Paraguay."; RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases. CC -!- FUNCTION: Envelope protein E binding to host cell surface receptor CC is followed by virus internalization through clathrin-mediated CC endocytosis. Envelope protein E is subsequently involved in CC membrane fusion between virion and host late endosomes. CC Synthesized as a homodimer with prM which acts as a chaperone for CC envelope protein E. After cleavage of prM, envelope protein E CC dissociate from small envelope protein M and homodimerizes (By CC similarity). CC -!- CATALYTIC ACTIVITY: Nucleoside triphosphate + RNA(n) = diphosphate CC + RNA(n+1). CC -!- SUBCELLULAR LOCATION: Virion membrane; Multi-pass membrane CC protein. Host endoplasmic reticulum membrane; Multi-pass membrane CC protein (By similarity). CC -!- SIMILARITY: Contains RdRp catalytic domain. CC -!- SIMILARITY: Contains helicase ATP-binding domain. CC -!- SIMILARITY: Contains helicase C-terminal domain. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; JF808129; AFK83764.1; -; Genomic_RNA. DR GO; GO:0044167; C:host cell endoplasmic reticulum membrane; IEA:UniProtKB-SubCell. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0019028; C:viral capsid; IEA:UniProtKB-KW. DR GO; GO:0019031; C:viral envelope; IEA:UniProtKB-KW. DR GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell. DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW. DR GO; GO:0008026; F:ATP-dependent helicase activity; IEA:InterPro. DR GO; GO:0003725; F:double-stranded RNA binding; IEA:InterPro. DR GO; GO:0004482; F:mRNA (guanine-N7-)-methyltransferase activity; IEA:InterPro. DR GO; GO:0004483; F:mRNA (nucleoside-2'-O-)-methyltransferase activity; IEA:InterPro. DR GO; GO:0003724; F:RNA helicase activity; IEA:InterPro. DR GO; GO:0003968; F:RNA-directed RNA polymerase activity; IEA:UniProtKB-KW. DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro. DR GO; GO:0070008; F:serine-type exopeptidase activity; IEA:InterPro. DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro. DR GO; GO:0075512; P:clathrin-mediated endocytosis of virus by host cell; IEA:UniProtKB-KW. DR GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW. DR GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro. DR GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW. DR Gene3D; 2.60.40.350; -; 1. DR Gene3D; 2.60.98.10; -; 2. DR Gene3D; 3.30.387.10; -; 1. DR Gene3D; 3.40.50.150; -; 1. DR Gene3D; 3.40.50.300; -; 2. DR InterPro; IPR011492; DEAD_Flavivir. DR InterPro; IPR000069; Env_glycoprot_M_flavivir. DR InterPro; IPR013755; Flav_gly_cen_dom_subdom1. DR InterPro; IPR001122; Flavi_capsidC. DR InterPro; IPR026470; Flavi_E_Stem/Anchor_dom. DR InterPro; IPR001157; Flavi_NS1. DR InterPro; IPR000752; Flavi_NS2A. DR InterPro; IPR000487; Flavi_NS2B. DR InterPro; IPR000404; Flavi_NS4A. DR InterPro; IPR001528; Flavi_NS4B. DR InterPro; IPR002535; Flavi_propep. DR InterPro; IPR000336; Flavivir/Alphavir_Ig-like. DR InterPro; IPR001850; Flavivirus_NS3_S7. DR InterPro; IPR027287; Flavovir_Ig-like. DR InterPro; IPR014412; Gen_Poly_FLV. DR InterPro; IPR011998; Glycoprot_cen/dimer. DR InterPro; IPR013754; GlyE_dim. DR InterPro; IPR014001; Helicase_ATP-bd. DR InterPro; IPR001650; Helicase_C. DR InterPro; IPR014756; Ig_E-set. DR InterPro; IPR026490; mRNA_cap_0/1_MeTrfase. DR InterPro; IPR027417; P-loop_NTPase. DR InterPro; IPR000208; RNA-dir_pol_flavivirus. DR InterPro; IPR007094; RNA-dir_pol_PSvirus. DR InterPro; IPR002877; rRNA_MeTrfase_FtsJ_dom. DR InterPro; IPR029063; SAM-dependent_MTases-like. DR InterPro; IPR009003; Trypsin-like_Pept_dom. DR Pfam; PF01003; Flavi_capsid; 1. DR Pfam; PF07652; Flavi_DEAD; 1. DR Pfam; PF02832; Flavi_glycop_C; 1. DR Pfam; PF00869; Flavi_glycoprot; 1. DR Pfam; PF01004; Flavi_M; 1. DR Pfam; PF00948; Flavi_NS1; 1. DR Pfam; PF01005; Flavi_NS2A; 1. DR Pfam; PF01002; Flavi_NS2B; 1. DR Pfam; PF01350; Flavi_NS4A; 1. DR Pfam; PF01349; Flavi_NS4B; 1. DR Pfam; PF00972; Flavi_NS5; 1. DR Pfam; PF01570; Flavi_propep; 1. DR Pfam; PF01728; FtsJ; 1. DR Pfam; PF00271; Helicase_C; 1. DR Pfam; PF00949; Peptidase_S7; 1. DR PIRSF; PIRSF003817; Gen_Poly_FLV; 1. DR SMART; SM00487; DEXDc; 1. DR SMART; SM00490; HELICc; 1. DR SUPFAM; SSF50494; SSF50494; 1. DR SUPFAM; SSF52540; SSF52540; 2. DR SUPFAM; SSF53335; SSF53335; 1. DR SUPFAM; SSF56983; SSF56983; 1. DR SUPFAM; SSF81296; SSF81296; 1. DR TIGRFAMs; TIGR04240; flavi_E_stem; 1. DR PROSITE; PS51527; FLAVIVIRUS_NS2B; 1. DR PROSITE; PS51528; FLAVIVIRUS_NS3PRO; 1. DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1. DR PROSITE; PS51194; HELICASE_CTER; 1. DR PROSITE; PS50507; RDRP_SSRNA_POS; 1. DR PROSITE; PS51591; RNA_CAP01_NS5_MT; 1. PE 3: Inferred from homology; KW ATP-binding; Capsid protein; KW Clathrin-mediated endocytosis of virus by host; Disulfide bond; KW Fusion of virus membrane with host endosomal membrane; KW Fusion of virus membrane with host membrane; Helicase; KW Host endoplasmic reticulum; Host membrane; Host-virus interaction; KW Hydrolase; Membrane; Nucleotide-binding; Nucleotidyltransferase; KW Protease; RNA-binding; RNA-directed RNA polymerase; Serine protease; KW Transferase; Transmembrane; Transmembrane helix; KW Viral attachment to host cell; Viral envelope protein; KW Viral penetration into host cytoplasm; Viral RNA replication; Virion; KW Virus endocytosis by host; Virus entry into host cell. FT CHAIN 1 114 C protein. FT /FTId=PRO_5001181478. FT CHAIN 115 278 PrM protein. FT /FTId=PRO_5001181481. FT CHAIN 279 773 E protein. FT /FTId=PRO_5001181484. FT CHAIN 774 1125 NS1 protein. FT /FTId=PRO_5001181486. FT CHAIN 1126 1343 NS2A protein. FT /FTId=PRO_5001181485. FT CHAIN 1344 1473 NS2B protein. FT /FTId=PRO_5001181487. FT CHAIN 1474 2092 NS3 protein. FT /FTId=PRO_5001181482. FT CHAIN 2093 2242 NS4A protein. FT /FTId=PRO_5001181480. FT CHAIN 2243 2490 NS4B protein. FT /FTId=PRO_5001181483. FT CHAIN 2491 3390 NS5 protein. FT /FTId=PRO_5001181479. SQ SEQUENCE 3390 AA; 377814 MW; 594D3241159AD3DB CRC64; MNNQRKKTGK PSINMLKRVR NRVSTGSQLA KRFSKGLLNG QGPMKLVMAF IAFLRFLAIP PTAGVLARWG TFKKSGAIKV LKGFKKEISN MLSIINKRKK TSLCLMMILP AALAFHLTSR DGEPRMIVGK NERGKSLLFK TASGINMCTL IAMDLGEMCD DTVTYKCPHI TEVEPEDIDC WCNLTSTWVT YGTCNQAGEH RRDKRSVALA PHVGMGLDTR TQTWMSAEGA WRQVEKVETW ALRHPGFTIL ALFLAHYIGT SLTQKVVIFI LLMLVTPSMT MRCVGVGNRD FVEGLSGATW VDVVLEHGGC VTTMAKNKPT LDIELQKTEA TQLATLRKLC IEGKITNITT DSRCPTQGEA VLPEEQDQNY VCKHTYVDRG WGNGCGLFGK GSLVTCAKFQ CLEPIEGKVV QYENLKYTVI ITVHTGDQHQ VGNETQGVTA EITPQASTTE AILPEYGTLG LECSPRTGLD FNEMILLTMK NKAWMVHRQW FFDLPLPWTS GATTETPTWN RKELLVTFKN AHAKKQEVVV LGSQEGAMHT ALTGATEIQN SGGTSIFAGH LKCRLKMDKL ELKGMSYAMC TNTFVLKKEV SETQHGTILI KVEYKGEDAP CKIPFSTEDG QGKAHNGRLI TANPVVTKKE EPVNIEAEPP FGESNIVIGI GDNALKINWY KKGSSIGKMF EATARGARRM AILGDTAWDF GSVGGVLNSL GKMVHQIFGS AYTALFSGVS WVMKIGIGVL LTWIGLNSKN TSMSFSCIAI GIITLYLGAV VQADMGCVIN WKGKELKCGS GIFVTNEVHT WTEQYKFQAD SPKRLATAIA GAWENGVCGI RSTTRMENLL WKQIANELNY ILWENNIKLT VVVGDITGVL EQGKRTLTPQ PMELKYSWKT WGKAKIVTAE TQNSSFIIDG PNTPECPSAS RAWNVWEVED YGFGVFTTNI WLKLREVYTQ LCDHRLMSAA VKDERAVHAD MGYWIESQKN GSWKLEKASL IEVKTCTWPK SHTLWSNGVL ESDMIIPKSL AGPISQHNHR PGYHTQTAGP WHLGKLELDF NYCEGTTVVI TENCGTRGPS LRTTTVSGKL IHEWCCRSCT LPPLRYMGED GCWYGMEIRP ISEKEENMVK SLVSAGSGKV DNCTMGVLCL AILFEDVMRG KFGKKHMIAG VFFTFVLLLS GQITWRDMAH TLIMIGSNAS DRMGMGVTYL ALIATFKIQP FLALGFFLRK LTSRENLLLG VGLAMATTLQ LPEDIEQMAN GIALGLMALK LITQFETYQL WTALISLTCS NTMFTLTVAW RTATLILAGV SLLPVCQSSS MRKTDWLPMA VAAMGVPPLP LFIFSLKDTL KRRSWPLNEG VMAVGLVSIL ASSLLRNDVP MAGPLVAGGL LIACYVITGT SADLTVEKAA DITWEEEAEQ TGVSHNLMTT VDDDGTMRIK DDETENILTV LLKTALTIVS GVFPYSIPAT LLVWHTWQKQ TQRSGVLWDV PSPPETQKAE QEEGVYRIKQ QGIFGKTQVG VGVQKEGVFH TMWHVTRGAV LTYNGKRLEP NWASVKKDLI SYGGGWRLSA QWQKGEEVQV IAVEPGKNPK NFQTMPGTFQ TTTGEIGAIA LDFKPGTSGS PIINREGKVV GLYGNGVVTK NGGYVSGIAQ TNAEPDGPTP ELEEEMFKKR NLTIMDLHPG SGKTRKYLPA IVREAIKRRL RTLILAPTRV VAAEMEEALK GLPIRYQTTA TKSEHTGREI VDLMCHATFT MRLLSPVRVP TYNLIIMDEA HFTDPASIAA RGYISTRVGM GEAAAIFMTA TPPGTADAFP QSNAPIQDEE RDIPERSWNS GNEWITDFAG KTVWFVPSIK AGNDIANCLR KNGKKVIQLS RKTFDTEYQK TKLNDWDFVV TTDISEMGAN FKADRVIDPR RCLKPVILTD GPERVILAGP MPVTAASAAQ RRGRVGRNPQ KENDQYIFTG QPLNNDEDHA HWTEAKMLLD NINTPEGIIP ALFEPEREKS AAIDGEYRLK GESRKTFVEL MRRGDLPVWL AHKVASEGIK YTDRKWCFDG QRNNQILEEN MDVEIWTKEG EKKKLRPRWL DARTYSDPLA LKEFKDFAAG RKSIALDLVT EIGRVPSHLA HRTRNALDNL VMLHTSEHGG RAYRHAVEEL PETMETLLLL GLMILLTGGA MLFLISGKGI GKTSIGLICV IASSGMLWMA EIPLQWIASA IVLEFFMMVL LIPEPEKQRT PQDNQLAYVV IGILTLAAII AANEMGLLET TKRDLGMSKE PGVVSPTSYL DVDLHPASAW TLYAVATTVI TPMLRHTIEN STANVSLAAI ANQAVVLMGL DKGWPISKMD LGVPLLALGC YSQVNPLTLT AAVLLLITHY AIIGPGLQAK ATREAQKRTA AGIMKNPTVD GIMTIDLDPV IYDSKFEKQL GQVMLLVLCA VQLLLMRTSW ALCEALTLAT GPITTLWEGS PGKFWNTTIA VSMANIFRGS YLAGAGLAFS IMKSVGTGKR GTGSQGETLG EKWKKKLNQL SRKEFDLYKK SGITEVDRTE AKEGLKRGEI THHAVSRGSA KLQWFVERNM VIPEGRVIDL GCGRGGWSYY CAGLKKVTEV RGYTKGGPGH EEPVPMSTYG WNIVKLMSGK DVFYLPPEKC DTLLCDIGES SPSPTVEESR TIRVLKMVEP WLKNNQFCIK VLNPYMPTVI EHLERLQRKH GGMLVRNPLS RNSTHEMYWI SNGTGNIVAS VNMVSRLLLN RFTMTHRRPT IEKDVDLGAG TRHVNAEPET PNMDVIGERI KRTKEEHNST WHYDDENPYK TWAYHGSYEV KATGSASSMI NGVVKLLTKP WDVVPMVTQM AMTDTTPFGQ QRVFKEKVDT RTPRSMPGTR RVMGITAEWL WRTLGRNKKP RLCTREEFTK KVRTNAAMGA VFTEENQWDS AKAAVEDEDF WKLVDREREL HKLGKCGSCV YNMMGKREKK LGEFGKAKGS RAIWYMWLGA RYLEFEALGF LNEDHWFSRE NSYSGVEGEG LHKLGYILRD ISKIPGGAMY ADDTAGWDTR ITEDDLHNEE KITQQMDPEH RQLANAIFKL TYQNKVVKVQ RPTPTGTVMD IISRKDQRGS GQVGTYGLNT FTNMEAQLIR QMEGEGVLSK ADLENPHLPE KKITQWLETK GVERLKRMAI SGDDCVVKPI DDRFANALLA LNDMGKVRKD IPQWQPSKGW HDWQQVPFCS HHFHELIMKD GRKLVVPCRP QDELIGRARI SQGAGWSLRE TACLGKAYAQ MWSLMYFHRR DLRLASNAIC SAVPVHWVPT SRTTWSIHAH HQWMTTEDML TVWNRVWIED NPWMEDKTPV TTWENVPYLG KREDQWCGSL IGLTSRATWA QNIPTAIQQV RSLIGNEEFL DYMPSMKRFR KEEESEGAIW //