ID POLG_JAEVN STANDARD; PRT; 1440 AA. AC P14403; P08769; DT 01-JAN-1990 (Rel. 13, Created) DT 01-JAN-1990 (Rel. 13, Last sequence update) DT 05-JUL-2004 (Rel. 44, Last annotation update) DE Genome polyprotein [Contains: Capsid protein C (Core protein); Matrix DE protein (Envelope glycoprotein M); Major envelope protein E; DE Nonstructural protein NS1; Nonstructural protein NS2A; Flavivirin DE (EC 3.4.21.91) (NS2B/NS3 proteinase)] (Fragment). OS Japanese encephalitis virus (strain Nakayama). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; OC Flavivirus. OX NCBI_TaxID=11076; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=87236200; PubMed=3035787; RA McAda P.C., Mason P.W., Schmaljohn C.S., Dalrymple J.M., Mason T.L., RA Fournier M.J.; RT "Partial nucleotide sequence of the Japanese encephalitis virus RT genome."; RL Virology 158:348-360(1987). CC -!- FUNCTION: The small proteins NS2A, NS4A and NS4B are hydrophobic, CC suggesting a possible membrane-related function. NS5 may play a CC role in the viral RNA replication. NS3 and NS2B form a protease CC which processes the viral polyprotein into separate proteins. CC -!- CATALYTIC ACTIVITY: Selective hydrolysis of Xaa-Xaa-|-Xbb bonds in CC which each of the Xaa can be either Arg or Lys and Xbb can be CC either Ser or Ala. CC -!- SUBUNIT: The virion of this virus is a nucleocapsid covered by a CC lipoprotein envelope. The envelope consists of two proteins: CC protein M and glycoprotein E. The nucleocapsid is a complex of CC protein C and mRNA. In immature particles, there are 60 CC icosaedrally organized trimeric spikes on the surface. Each spike CC consists of three heterodimers of envelope protein M precursor CC (prM) and envelope protein E (By similarity). CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; M16574; AAA46251.1; -. DR PIR; A27844; GNWVJF. DR HSSP; Q88653; 1OKE. DR InterPro; IPR001122; Flavi_capsidC. DR InterPro; IPR000336; Flavi_glycoprotE. DR InterPro; IPR000069; Flavi_M. DR InterPro; IPR001157; Flavi_NS1. DR InterPro; IPR000752; Flavi_NS2A. DR InterPro; IPR000487; Flavi_NS2B. DR InterPro; IPR002535; Flavi_propep. DR Pfam; PF01003; Flavi_capsid; 1. DR Pfam; PF00869; Flavi_glycoprot; 1. DR Pfam; PF02832; Flavi_glycop_C; 1. DR Pfam; PF01004; Flavi_M; 1. DR Pfam; PF00948; Flavi_NS1; 1. DR Pfam; PF01005; Flavi_NS2A; 1. DR Pfam; PF01002; Flavi_NS2B; 1. DR Pfam; PF01570; Flavi_propep; 1. DR ProDom; PD001556; Flavi_glycoprotE; 1. DR ProDom; PD001496; Flavi_NS1; 1. KW Polyprotein; Glycoprotein; Core protein; Coat protein; KW Envelope protein; Hydrolase; Helicase; ATP-binding; Transmembrane; KW Nonstructural protein. FT NON_TER 1 1 FT CHAIN <1 53 Capsid protein C. FT PROPEP 54 146 FT CHAIN 147 222 Envelope glycoprotein M. FT CHAIN 223 794 Major envelope protein E. FT CHAIN 795 1136 Nonstructural protein NS1. FT CHAIN 1137 1301 Nonstructural protein NS2A. FT CHAIN 1302 1432 Flavivirin protease subunit NS2B. FT CHAIN 1433 >1440 Flavivirin protease subunit NS3. FT DISULFID 225 252 By similarity. FT DISULFID 282 338 By similarity. FT DISULFID 296 327 By similarity. FT DISULFID 314 343 By similarity. FT DISULFID 412 509 By similarity. FT DISULFID 526 557 By similarity. FT CARBOHYD 68 68 N-linked (GlcNAc...) (Potential). FT CARBOHYD 376 376 N-linked (GlcNAc...) (Potential). FT CARBOHYD 852 852 N-linked (GlcNAc...) (Potential). FT CARBOHYD 929 929 N-linked (GlcNAc...) (Potential). FT NON_TER 1440 1440 SQ SEQUENCE 1440 AA; 158184 MW; 4D489A365A3C2E6E CRC64; SVAMKHLTSF KRELGTLIDA VNKRGRKQNK RGGNEGSIMW LASLAVVIAC AGAMKLSNFQ GKLLMTVNNT DIADVIVIPN PSKGENRCWV RAIDVGYMCE DTITYECPKL TMGNDPEDVD CWCDNQEVYV QYGRCTRTRH SKRSRRSVSV QTHGESSLVN KKEAWLDSTK ATRYLMKTEN WIVRNPGYAF LAAILGWMLG SNNGQRRWYF TILLLLVAPA YSFNCLGMGN RDFIEGASGA TWVDLVLEGD SCLTIMANDK PTLDVRMINI EAVQLAEVRS YCYHASVTDI STVARCPTTG EAHNEKRADS SYVCKQGFTD RGWGNGCGLF GKGSIDTCAK FSCTSKAIGR TIQPENIKYE VGIFVHGTTT SENHGNYSAQ VGASQAAKFT VTPNAPSITL KLGDYGEVTL DCEPRSGLNT EAFYVMTVGS KSFLVHREWF HDLALPWTPP SSTAWRNREL LMEFEEAHAT KQSVVALGSQ EGGLHQALAG AIVVEYSSSV KLTSGHLKCR LKMDKLALKG TTYGMCTEKF SFAKNPADTG HGTVVIELSY SGSDGPCKIP IVSVASLNDM TPVGRLVTVN PFVATSSANS KVLVEMEPPF GDSYIVVGRG DKQINHHWHK AGSTLGKAFS TTLKGAQRLA ALGDTAWDFG SIGGVFNSIG KAVHQVFGGA FRTLFGGMSW ITQGLMGALL LWMGVNARDR SIALAFLATG GVLVFLATNV HADTGCAIDI TRKEMRCGSG IFVHNDVEAW VDRYKYLPET PRSLAKIVHK AHKEGVCGVR SVTRLEHQMW EAVRDELNVL LKENAVDLSV VVNKPVGRYR SAPKRLSMTQ EKFEMGWKAW GKSILFAPEL ANSTFVVDGP ETKECPDEHR AWNSIEIEDF GFGITSTRVW LKIREESTDE CDGAIIGTAV KGHVAVHSDL SYWIESRYND TWKLERAVFG EVKSCTWPET HTLWGDGVEE SELIIPHTIA GPKSKHNRRE GYKTQNQGPW DENGIVLDFD YCPGTKVTIT EDCGKRGPSV RTTTDSGKLI TDWCCRSCSL PPLRFRTENG CWYGMEIRPV RHDETTLVRS QVDAFNGEMV DPFQLGLLVM FLATQEVLRK RWTARLTIPA VLGALLVLML GGITYTDLAR YVVLVAAAFA EANSGGDVLH LALIAVFKIQ PAFLVMNMLS TRWTNQENVV LVLGAAFFHL ASVDLQIGVH GILNAAAIAW MIVRAITFPT TSSVTMPVLA LLTPGMRALY LDTYRIILLV IGICSLLQER KKTMAKKKGA VLLGLALTST GWFSPTTIAA GLMVCNPNKK RGWPATEFLS AVGLMFAIVG GLAELDIESM SIPFMLAGLM AVSYVVSGKA TDMWLERAAD ISWEMDAAIT GSSRRLDVKL DDDGDFHLID DPGVPWKVWV LRMSCIGLAA LTPWAIVPAA FGYWLTLKTT KRGGVFWDTP //