ID POLG_JAEVN STANDARD; PRT; 1440 AA. AC P14403; P08769; DT 01-JAN-1990 (Rel. 13, Created) DT 01-JAN-1990 (Rel. 13, Last sequence update) DT 24-JAN-2006 (Rel. 49, Last annotation update) DE Genome polyprotein [Contains: Capsid protein C (Core protein); DE Envelope protein M (Matrix protein); Major envelope protein E; DE Nonstructural protein 1 (NS1); Nonstructural protein 2A (NS2A); DE Flavivirin protease NS2B regulatory subunit; Flavivirin protease NS3 DE catalytic subunit (EC 3.4.21.91)] (Fragment). OS Japanese encephalitis virus (strain Nakayama). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; OC Flavivirus; Japanese encephalitis virus group. OX NCBI_TaxID=11076; RN [1] RP NUCLEOTIDE SEQUENCE [GENOMIC RNA]. RX MEDLINE=87236200; PubMed=3035787; RA McAda P.C., Mason P.W., Schmaljohn C.S., Dalrymple J.M., Mason T.L., RA Fournier M.J.; RT "Partial nucleotide sequence of the Japanese encephalitis virus RT genome."; RL Virology 158:348-360(1987). CC -!- FUNCTION: The small proteins NS2A, NS4A and NS4B are hydrophobic, CC suggesting a possible membrane-related function. NS5 may play a CC role in the viral RNA replication. The NS2B/NS3 protease complex CC processes the viral polyprotein. CC -!- CATALYTIC ACTIVITY: Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds CC in which each of the Xaa can be either Arg or Lys and Yaa can be CC either Ser or Ala. CC -!- SUBUNIT: NS3 and NS2B form a heterodimer. NS3 is the catalytic CC subunit, whereas NS2B strongly stimulates the latter (By CC similarity). CC -!- PTM: Specific enzymatic cleavages in vivo yield mature proteins CC (By similarity). CC -!- MISCELLANEOUS: The virion of this virus is a nucleocapsid covered CC by a lipoprotein envelope. The envelope contains two proteins: the CC protein M and glycoprotein E. The nucleocapsid is a complex of CC protein C and mRNA. In immature particles, there are 60 CC icosaedrally organized trimeric spikes on the surface. Each spike CC consists of three heterodimers of envelope protein M precursor CC (prM) and envelope protein E (By similarity). CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; M16574; AAA46251.1; -; Genomic_RNA. DR PIR; A27844; GNWVJF. DR HSSP; Q88653; 1OKE. DR InterPro; IPR001122; Flavi_capsidC. DR InterPro; IPR011999; Flavi_glycoE_cen. DR InterPro; IPR000069; Flavi_M. DR InterPro; IPR001157; Flavi_NS1. DR InterPro; IPR000752; Flavi_NS2A. DR InterPro; IPR000487; Flavi_NS2B. DR InterPro; IPR002535; Flavi_propep. DR InterPro; IPR000336; Flv_glyE_Ig-like. DR InterPro; IPR011998; Viral_glycoE_cen. DR Pfam; PF01003; Flavi_capsid; 1. DR Pfam; PF02832; Flavi_glycop_C; 1. DR Pfam; PF00869; Flavi_glycoprot; 1. DR Pfam; PF01004; Flavi_M; 1. DR Pfam; PF00948; Flavi_NS1; 1. DR Pfam; PF01005; Flavi_NS2A; 1. DR Pfam; PF01002; Flavi_NS2B; 1. DR Pfam; PF01570; Flavi_propep; 1. DR ProDom; PD001556; Flavi_glycoprotE; 1. DR ProDom; PD001496; Flavi_NS1; 1. KW ATP-binding; Capsid protein; Core protein; Envelope protein; KW Glycoprotein; Helicase; Hydrolase; Membrane; Nucleotide-binding; KW Polyprotein; Structural protein; Transmembrane. FT CHAIN <1 53 Capsid protein C. FT /FTId=PRO_0000037869. FT PROPEP 54 146 FT /FTId=PRO_0000037870. FT CHAIN 147 222 Envelope protein M. FT /FTId=PRO_0000037871. FT CHAIN 223 794 Major envelope protein E. FT /FTId=PRO_0000037872. FT CHAIN 795 1136 Nonstructural protein 1. FT /FTId=PRO_0000037873. FT CHAIN 1137 1301 Nonstructural protein 2A. FT /FTId=PRO_0000037874. FT CHAIN 1302 1432 Flavivirin protease NS2B regulatory FT subunit. FT /FTId=PRO_0000037875. FT CHAIN 1433 >1440 Flavivirin protease NS3 catalytic FT subunit. FT /FTId=PRO_0000037876. FT CARBOHYD 68 68 N-linked (GlcNAc...) (Potential). FT CARBOHYD 376 376 N-linked (GlcNAc...) (Potential). FT CARBOHYD 852 852 N-linked (GlcNAc...) (Potential). FT CARBOHYD 929 929 N-linked (GlcNAc...) (Potential). FT DISULFID 225 252 By similarity. FT DISULFID 282 338 By similarity. FT DISULFID 296 327 By similarity. FT DISULFID 314 343 By similarity. FT DISULFID 412 509 By similarity. FT DISULFID 526 557 By similarity. FT NON_TER 1 1 FT NON_TER 1440 1440 SQ SEQUENCE 1440 AA; 158185 MW; 4D489A365A3C2E6E CRC64; SVAMKHLTSF KRELGTLIDA VNKRGRKQNK RGGNEGSIMW LASLAVVIAC AGAMKLSNFQ GKLLMTVNNT DIADVIVIPN PSKGENRCWV RAIDVGYMCE DTITYECPKL TMGNDPEDVD CWCDNQEVYV QYGRCTRTRH SKRSRRSVSV QTHGESSLVN KKEAWLDSTK ATRYLMKTEN WIVRNPGYAF LAAILGWMLG SNNGQRRWYF TILLLLVAPA YSFNCLGMGN RDFIEGASGA TWVDLVLEGD SCLTIMANDK PTLDVRMINI EAVQLAEVRS YCYHASVTDI STVARCPTTG EAHNEKRADS SYVCKQGFTD RGWGNGCGLF GKGSIDTCAK FSCTSKAIGR TIQPENIKYE VGIFVHGTTT SENHGNYSAQ VGASQAAKFT VTPNAPSITL KLGDYGEVTL DCEPRSGLNT EAFYVMTVGS KSFLVHREWF HDLALPWTPP SSTAWRNREL LMEFEEAHAT KQSVVALGSQ EGGLHQALAG AIVVEYSSSV KLTSGHLKCR LKMDKLALKG TTYGMCTEKF SFAKNPADTG HGTVVIELSY SGSDGPCKIP IVSVASLNDM TPVGRLVTVN PFVATSSANS KVLVEMEPPF GDSYIVVGRG DKQINHHWHK AGSTLGKAFS TTLKGAQRLA ALGDTAWDFG SIGGVFNSIG KAVHQVFGGA FRTLFGGMSW ITQGLMGALL LWMGVNARDR SIALAFLATG GVLVFLATNV HADTGCAIDI TRKEMRCGSG IFVHNDVEAW VDRYKYLPET PRSLAKIVHK AHKEGVCGVR SVTRLEHQMW EAVRDELNVL LKENAVDLSV VVNKPVGRYR SAPKRLSMTQ EKFEMGWKAW GKSILFAPEL ANSTFVVDGP ETKECPDEHR AWNSIEIEDF GFGITSTRVW LKIREESTDE CDGAIIGTAV KGHVAVHSDL SYWIESRYND TWKLERAVFG EVKSCTWPET HTLWGDGVEE SELIIPHTIA GPKSKHNRRE GYKTQNQGPW DENGIVLDFD YCPGTKVTIT EDCGKRGPSV RTTTDSGKLI TDWCCRSCSL PPLRFRTENG CWYGMEIRPV RHDETTLVRS QVDAFNGEMV DPFQLGLLVM FLATQEVLRK RWTARLTIPA VLGALLVLML GGITYTDLAR YVVLVAAAFA EANSGGDVLH LALIAVFKIQ PAFLVMNMLS TRWTNQENVV LVLGAAFFHL ASVDLQIGVH GILNAAAIAW MIVRAITFPT TSSVTMPVLA LLTPGMRALY LDTYRIILLV IGICSLLQER KKTMAKKKGA VLLGLALTST GWFSPTTIAA GLMVCNPNKK RGWPATEFLS AVGLMFAIVG GLAELDIESM SIPFMLAGLM AVSYVVSGKA TDMWLERAAD ISWEMDAAIT GSSRRLDVKL DDDGDFHLID DPGVPWKVWV LRMSCIGLAA LTPWAIVPAA FGYWLTLKTT KRGGVFWDTP //