ID POLG_JAEVN STANDARD; PRT; 1440 AA. AC P14403; P08769; DT 01-JAN-1990 (Rel. 13, Created) DT 01-JAN-1990 (Rel. 13, Last sequence update) DT 16-OCT-2001 (Rel. 40, Last annotation update) DE Genome polyprotein [Contains: Capsid protein C (Core protein); Matrix DE protein (Envelope protein M); Major envelope protein E; Nonstructural DE proteins NS1, NS2A, and NS2B; Protease/helicase (EC 3.4.21.98) (NS3)] DE (Fragment). OS Japanese encephalitis virus (strain Nakayama). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; OC Flavivirus. OX NCBI_TaxID=11076; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=87236200; PubMed=3035787; RA McAda P.C., Mason P.W., Schmaljohn C.S., Dalrymple J.M., Mason T.L., RA Fournier M.J.; RT "Partial nucleotide sequence of the Japanese encephalitis virus RT genome."; RL Virology 158:348-360(1987). CC -!- FUNCTION: THE SMALL PROTEINS NS2A, NS2B, NS4A AND NS4B ARE CC HYDROPHOBIC, SUGGESTING A POSSIBLE MEMBRANE-RELATED FUNCTION. CC NS3 AND NS5 MAY PLAY A ROLE IN THE VIRAL RNA REPLICATION. CC -!- CATALYTIC ACTIVITY: Hydrolysis of four peptide bonds in the viral CC precursor polyprotein, commonly with Asp or Glu in the P6 CC position, Cys or Thr in P1 and Ser or Ala in P1'. CC -!- SUBUNIT: THE VIRION OF THIS VIRUS IS A NUCLEOCAPSID COVERED BY A CC LIPOPROTEIN ENVELOPE. THE ENVELOPE CONSISTS OF TWO PROTEINS: CC PROTEIN M AND GLYCOPROTEIN E. THE NUCLEOCAPSID IS A COMPLEX OF CC PROTEIN C AND MRNA. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; M16574; AAA46251.1; -. DR PIR; A27844; GNWVJF. DR HSSP; P14336; 1SVB. DR InterPro; IPR000069; Flavi_M. DR InterPro; IPR001157; Flavi_NS1. DR InterPro; IPR000752; Flavi_NS2A. DR InterPro; IPR000487; Flavi_NS2B. DR InterPro; IPR000336; Flavi_glycoprotE. DR InterPro; IPR002535; Flavi_propep. DR Pfam; PF00869; Flavi_glycoprot; 1. DR Pfam; PF02832; Flavi_glycop_C; 1. DR Pfam; PF01004; Flavi_M; 1. DR Pfam; PF00948; Flavi_NS1; 1. DR Pfam; PF01005; Flavi_NS2A; 1. DR Pfam; PF01002; Flavi_NS2B; 1. DR Pfam; PF01570; Flavi_propep; 1. DR ProDom; PD001496; Flavi_NS1; 1. DR ProDom; PD001556; Flavi_glycoprotE; 1. KW Polyprotein; Glycoprotein; Core protein; Coat protein; KW Envelope protein; Hydrolase; Helicase; ATP-binding; Transmembrane; KW Nonstructural protein. FT NON_TER 1 1 FT CHAIN <1 53 CAPSID PROTEIN C. FT PROPEP 54 146 FT CHAIN 147 222 ENVELOPE GLYCOPROTEIN M. FT CHAIN 223 794 MAJOR ENVELOPE PROTEIN E. FT CHAIN 795 1136 NONSTRUCTURAL PROTEIN NS1. FT CHAIN 1137 1301 NONSTRUCTURAL PROTEIN NS2A. FT CHAIN 1302 1432 NONSTRUCTURAL PROTEIN NS2B. FT CHAIN 1433 >1440 PROTEASE/HELICASE (NS3). FT DISULFID 225 252 BY SIMILARITY. FT DISULFID 282 338 BY SIMILARITY. FT DISULFID 296 327 BY SIMILARITY. FT DISULFID 314 343 BY SIMILARITY. FT DISULFID 412 509 BY SIMILARITY. FT DISULFID 526 557 BY SIMILARITY. FT CARBOHYD 68 68 N-LINKED (GLCNAC...) (POTENTIAL). FT CARBOHYD 376 376 N-LINKED (GLCNAC...) (POTENTIAL). FT CARBOHYD 852 852 N-LINKED (GLCNAC...) (POTENTIAL). FT CARBOHYD 929 929 N-LINKED (GLCNAC...) (POTENTIAL). FT NON_TER 1440 1440 SQ SEQUENCE 1440 AA; 158184 MW; 4D489A365A3C2E6E CRC64; SVAMKHLTSF KRELGTLIDA VNKRGRKQNK RGGNEGSIMW LASLAVVIAC AGAMKLSNFQ GKLLMTVNNT DIADVIVIPN PSKGENRCWV RAIDVGYMCE DTITYECPKL TMGNDPEDVD CWCDNQEVYV QYGRCTRTRH SKRSRRSVSV QTHGESSLVN KKEAWLDSTK ATRYLMKTEN WIVRNPGYAF LAAILGWMLG SNNGQRRWYF TILLLLVAPA YSFNCLGMGN RDFIEGASGA TWVDLVLEGD SCLTIMANDK PTLDVRMINI EAVQLAEVRS YCYHASVTDI STVARCPTTG EAHNEKRADS SYVCKQGFTD RGWGNGCGLF GKGSIDTCAK FSCTSKAIGR TIQPENIKYE VGIFVHGTTT SENHGNYSAQ VGASQAAKFT VTPNAPSITL KLGDYGEVTL DCEPRSGLNT EAFYVMTVGS KSFLVHREWF HDLALPWTPP SSTAWRNREL LMEFEEAHAT KQSVVALGSQ EGGLHQALAG AIVVEYSSSV KLTSGHLKCR LKMDKLALKG TTYGMCTEKF SFAKNPADTG HGTVVIELSY SGSDGPCKIP IVSVASLNDM TPVGRLVTVN PFVATSSANS KVLVEMEPPF GDSYIVVGRG DKQINHHWHK AGSTLGKAFS TTLKGAQRLA ALGDTAWDFG SIGGVFNSIG KAVHQVFGGA FRTLFGGMSW ITQGLMGALL LWMGVNARDR SIALAFLATG GVLVFLATNV HADTGCAIDI TRKEMRCGSG IFVHNDVEAW VDRYKYLPET PRSLAKIVHK AHKEGVCGVR SVTRLEHQMW EAVRDELNVL LKENAVDLSV VVNKPVGRYR SAPKRLSMTQ EKFEMGWKAW GKSILFAPEL ANSTFVVDGP ETKECPDEHR AWNSIEIEDF GFGITSTRVW LKIREESTDE CDGAIIGTAV KGHVAVHSDL SYWIESRYND TWKLERAVFG EVKSCTWPET HTLWGDGVEE SELIIPHTIA GPKSKHNRRE GYKTQNQGPW DENGIVLDFD YCPGTKVTIT EDCGKRGPSV RTTTDSGKLI TDWCCRSCSL PPLRFRTENG CWYGMEIRPV RHDETTLVRS QVDAFNGEMV DPFQLGLLVM FLATQEVLRK RWTARLTIPA VLGALLVLML GGITYTDLAR YVVLVAAAFA EANSGGDVLH LALIAVFKIQ PAFLVMNMLS TRWTNQENVV LVLGAAFFHL ASVDLQIGVH GILNAAAIAW MIVRAITFPT TSSVTMPVLA LLTPGMRALY LDTYRIILLV IGICSLLQER KKTMAKKKGA VLLGLALTST GWFSPTTIAA GLMVCNPNKK RGWPATEFLS AVGLMFAIVG GLAELDIESM SIPFMLAGLM AVSYVVSGKA TDMWLERAAD ISWEMDAAIT GSSRRLDVKL DDDGDFHLID DPGVPWKVWV LRMSCIGLAA LTPWAIVPAA FGYWLTLKTT KRGGVFWDTP //