ID POLG_WNV Reviewed; 3430 AA. AC P06935; DT 01-JAN-1988, integrated into UniProtKB/Swiss-Prot. DT 24-OCT-2003, sequence version 2. DT 02-OCT-2024, entry version 211. DE RecName: Full=Genome polyprotein; DE Contains: DE RecName: Full=Peptide 2k; DE Contains: DE RecName: Full=Capsid protein C; DE AltName: Full=Core protein; DE Contains: DE RecName: Full=Protein prM; DE Contains: DE RecName: Full=Peptide pr; DE Contains: DE RecName: Full=Small envelope protein M; DE AltName: Full=Matrix protein; DE Contains: DE RecName: Full=Envelope protein E; DE Contains: DE RecName: Full=Non-structural protein 1; DE Short=NS1; DE Contains: DE RecName: Full=Non-structural protein 2A; DE Short=NS2A; DE Contains: DE RecName: Full=Serine protease subunit NS2B; DE AltName: Full=Flavivirin protease NS2B regulatory subunit; DE AltName: Full=Non-structural protein 2B; DE Contains: DE RecName: Full=Serine protease/Helicase NS3; DE EC=3.4.21.91; DE EC=3.6.1.15 {ECO:0000250|UniProtKB:Q9Q6P4}; DE EC=3.6.4.13 {ECO:0000250|UniProtKB:Q9Q6P4}; DE AltName: Full=Flavivirin protease NS3 catalytic subunit; DE AltName: Full=Non-structural protein 3; DE Contains: DE RecName: Full=Non-structural protein 4A; DE Short=NS4A; DE Contains: DE RecName: Full=Non-structural protein 4B; DE Short=NS4B; DE Contains: DE RecName: Full=RNA-directed RNA polymerase NS5; DE EC=2.1.1.56 {ECO:0000255|PROSITE-ProRule:PRU00924}; DE EC=2.1.1.57 {ECO:0000255|PROSITE-ProRule:PRU00924}; DE EC=2.7.7.48 {ECO:0000255|PROSITE-ProRule:PRU00539}; DE AltName: Full=NS5; OS West Nile virus (WNV). OC Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Flasuviricetes; OC Amarillovirales; Flaviviridae; Orthoflavivirus; Orthoflavivirus nilense. OX NCBI_TaxID=11082; OH NCBI_TaxID=7158; Aedes. OH NCBI_TaxID=8495; Alligator. OH NCBI_TaxID=34610; Amblyomma variegatum (Tropical bont tick). OH NCBI_TaxID=8782; Aves. OH NCBI_TaxID=53527; Culex. OH NCBI_TaxID=9796; Equus caballus (Horse). OH NCBI_TaxID=9606; Homo sapiens (Human). OH NCBI_TaxID=34627; Hyalomma marginatum. OH NCBI_TaxID=308735; Mansonia uniformis. OH NCBI_TaxID=308737; Mimomyia. OH NCBI_TaxID=9940; Ovis aries (Sheep). OH NCBI_TaxID=34630; Rhipicephalus. OH NCBI_TaxID=34861; Sciurus niger (Eastern fox squirrel). RN [1] RP NUCLEOTIDE SEQUENCE [GENOMIC RNA]. RX PubMed=3753811; DOI=10.1016/0042-6822(86)90082-6; RA Castle E., Leidner U., Nowak T., Wengler G., Wengler G.; RT "Primary structure of the West Nile flavivirus genome region coding for all RT nonstructural proteins."; RL Virology 149:10-26(1986). RN [2] RP SEQUENCE REVISION TO 1908; 2018-2036; 2242 AND 2859-2860. RX PubMed=11277701; DOI=10.1006/viro.2000.0795; RA Yamshchikov V.F., Wengler G., Perelygin A.A., Brinton M.A., Compans R.W.; RT "An infectious clone of the West Nile flavivirus."; RL Virology 281:294-304(2001). RN [3] RP NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 1-291. RX PubMed=2992152; DOI=10.1016/0042-6822(85)90156-4; RA Castle E., Nowak T., Leidner U., Wengler G., Wengler G.; RT "Sequence analysis of the viral core protein and the membrane-associated RT proteins V1 and NV2 of the flavivirus West Nile virus and of the genome RT sequence for these proteins."; RL Virology 145:227-236(1985). RN [4] RP NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 255-854. RX PubMed=3855247; DOI=10.1016/0042-6822(85)90129-1; RA Wengler G., Castle E., Leidner U., Nowak T., Wengler G.; RT "Sequence analysis of the membrane protein V3 of the flavivirus West Nile RT virus and of its gene."; RL Virology 147:264-274(1985). RN [5] RP ABSENCE OF GLYCOSYLATION (ENVELOPE PROTEIN E). RX PubMed=2441520; DOI=10.1016/0042-6822(87)90460-0; RA Winkler G., Heinz F.X., Kunz C.; RT "Studies on the glycosylation of flavivirus E proteins and the role of RT carbohydrate in antigenic structure."; RL Virology 159:237-243(1987). RN [6] RP DISULFIDE BONDS (ENVELOPE PROTEIN E). RX PubMed=3811228; DOI=10.1016/0042-6822(87)90443-0; RA Nowak T., Wengler G.; RT "Analysis of disulfides present in the membrane proteins of the West Nile RT flavivirus."; RL Virology 156:127-137(1987). RN [7] RP FUNCTION (ENVELOPE PROTEIN E). RX PubMed=15367621; DOI=10.1128/jvi.78.19.10543-10555.2004; RA Chu J.J., Ng M.L.; RT "Infectious entry of West Nile virus occurs through a clathrin-mediated RT endocytic pathway."; RL J. Virol. 78:10543-10555(2004). RN [8] RP SUBCELLULAR LOCATION (RNA-DIRECTED RNA POLYMERASE NS5). RC STRAIN=E101; RX PubMed=16699025; DOI=10.1128/jvi.01982-05; RA Uchil P.D., Kumar A.V., Satchidanandam V.; RT "Nuclear localization of flavivirus RNA synthesis in infected cells."; RL J. Virol. 80:5451-5464(2006). RN [9] RP FUNCTION (RNA-DIRECTED RNA POLYMERASE NS5), AND MUTAGENESIS OF LYS-2586; RP ASP-2671; LYS-2707 AND GLU-2743. RX PubMed=17267492; DOI=10.1128/jvi.02704-06; RA Zhou Y., Ray D., Zhao Y., Dong H., Ren S., Li Z., Guo Y., Bernard K.A., RA Shi P.-Y., Li H.; RT "Structure and function of flavivirus NS5 methyltransferase."; RL J. Virol. 81:3891-3903(2007). RN [10] RP PROTEOLYTIC PROCESSING (GENOME POLYPROTEIN). RX PubMed=17067286; DOI=10.1042/bj20061136; RA Shiryaev S.A., Kozlov I.A., Ratnikov B.I., Smith J.W., Lebl M., RA Strongin A.Y.; RT "Cleavage preference distinguishes the two-component NS2B-NS3 serine RT proteinases of Dengue and West Nile viruses."; RL Biochem. J. 401:743-752(2007). RN [11] RP INTERACTION WITH HUMAN EXOC1 (CAPSID PROTEIN C), AND SUBCELLULAR LOCATION RP (CAPSID PROTEIN C). RX PubMed=19889084; DOI=10.1111/j.1462-5822.2009.01407.x; RA Bhuvanakantham R., Li J., Tan T.T., Ng M.L.; RT "Human Sec3 protein is a novel transcriptional and translational repressor RT of flavivirus."; RL Cell. Microbiol. 12:453-472(2010). RN [12] RP INTERACTION WITH HUMAN EXOC1 (CAPSID PROTEIN C), FUNCTION (CAPSID PROTEIN RP C), AND MUTAGENESIS OF VAL-14. RX PubMed=23522008; DOI=10.1111/cmi.12143; RA Bhuvanakantham R., Ng M.L.; RT "West Nile virus and dengue virus capsid protein negates the antiviral RT activity of human Sec3 protein through the proteasome pathway."; RL Cell. Microbiol. 15:1688-1706(2013). RN [13] RP INTERACTION WITH HUMAN PAF1 COMPLEX (RNA-DIRECTED RNA POLYMERASE NS5). RX PubMed=30550790; DOI=10.1016/j.cell.2018.11.028; RA Shah P.S., Link N., Jang G.M., Sharp P.P., Zhu T., Swaney D.L., RA Johnson J.R., Von Dollen J., Ramage H.R., Satkamp L., Newton B., RA Huettenhain R., Petit M.J., Baum T., Everitt A., Laufman O., Tassetto M., RA Shales M., Stevenson E., Iglesias G.N., Shokat L., Tripathi S., RA Balasubramaniam V., Webb L.G., Aguirre S., Willsey A.J., Garcia-Sastre A., RA Pollard K.S., Cherry S., Gamarnik A.V., Marazzi I., Taunton J., RA Fernandez-Sesma A., Bellen H.J., Andino R., Krogan N.J.; RT "Comparative Flavivirus-Host Protein Interaction Mapping Reveals Mechanisms RT of Dengue and Zika Virus Pathogenesis."; RL Cell 175:1931-1945(2018). RN [14] RP DOMAIN (RNA-DIRECTED RNA POLYMERASE NS5). RX PubMed=33547379; DOI=10.1038/s41598-021-82751-x; RA Giraud E., Del Val C.O., Caillet-Saguy C., Zehrouni N., Khou C., RA Caillet J., Jacob Y., Pardigon N., Wolff N.; RT "Role of PDZ-binding motif from West Nile virus NS5 protein on viral RT replication."; RL Sci. Rep. 11:3266-3266(2021). RN [15] RP X-RAY CRYSTALLOGRAPHY (1.68 ANGSTROMS) OF 1420-1688. RX PubMed=16532006; DOI=10.1038/nsmb1073; RA Erbel P., Schiering N., D'Arcy A., Renatus M., Kroemer M., Lim S.P., RA Yin Z., Keller T.H., Vasudevan S.G., Hommel U.; RT "Structural basis for the activation of flaviviral NS3 proteases from RT dengue and West Nile virus."; RL Nat. Struct. Mol. Biol. 13:372-373(2006). RN [16] RP X-RAY CRYSTALLOGRAPHY (1.8 ANGSTROMS) OF 1419-1679. RX PubMed=17400917; DOI=10.1110/ps.072753207; RA Aleshin A.E., Shiryaev S.A., Strongin A.Y., Liddington R.C.; RT "Structural evidence for regulation and specificity of flaviviral proteases RT and evolution of the Flaviviridae fold."; RL Protein Sci. 16:795-806(2007). RN [17] RP X-RAY CRYSTALLOGRAPHY (3.0 ANGSTROMS) OF 291-688. RX PubMed=19713934; DOI=10.1038/emboj.2009.245; RA Cherrier M.V., Kaufmann B., Nybakken G.E., Lok S.M., Warren J.T., RA Chen B.R., Nelson C.A., Kostyuchenko V.A., Holdaway H.A., Chipman P.R., RA Kuhn R.J., Diamond M.S., Rossmann M.G., Fremont D.H.; RT "Structural basis for the preferential recognition of immature flaviviruses RT by a fusion-loop antibody."; RL EMBO J. 28:3269-3276(2009). CC -!- FUNCTION: [Capsid protein C]: Plays a role in virus budding by binding CC to the cell membrane and gathering the viral RNA into a nucleocapsid CC that forms the core of a mature virus particle (By similarity). During CC virus entry, may induce genome penetration into the host cytoplasm CC after hemifusion induced by the surface proteins (By similarity). Can CC migrate to the cell nucleus where it modulates host functions (By CC similarity). Overcomes the anti-viral effects of host EXOC1 by CC sequestering and degrading the latter through the proteasome CC degradation pathway (PubMed:23522008). {ECO:0000250|UniProtKB:P17763, CC ECO:0000269|PubMed:23522008}. CC -!- FUNCTION: [Capsid protein C]: Inhibits RNA silencing by interfering CC with host Dicer. {ECO:0000250|UniProtKB:P03314}. CC -!- FUNCTION: [Peptide pr]: Prevents premature fusion activity of envelope CC proteins in trans-Golgi by binding to envelope protein E at pH6.0. CC After virion release in extracellular space, gets dissociated from E CC dimers. {ECO:0000250|UniProtKB:P17763}. CC -!- FUNCTION: [Protein prM]: Acts as a chaperone for envelope protein E CC during intracellular virion assembly by masking and inactivating CC envelope protein E fusion peptide. prM is the only viral peptide CC matured by host furin in the trans-Golgi network probably to avoid CC catastrophic activation of the viral fusion activity in acidic Golgi CC compartment prior to virion release. prM-E cleavage is inefficient, and CC many virions are only partially matured. These uncleaved prM would play CC a role in immune evasion. {ECO:0000250|UniProtKB:P17763}. CC -!- FUNCTION: [Small envelope protein M]: May play a role in virus budding. CC Exerts cytotoxic effects by activating a mitochondrial apoptotic CC pathway through M ectodomain. May display a viroporin activity. CC {ECO:0000250|UniProtKB:P17763}. CC -!- FUNCTION: [Envelope protein E]: Binds to host cell surface receptor and CC mediates fusion between viral and cellular membranes (PubMed:15367621). CC Envelope protein is synthesized in the endoplasmic reticulum in the CC form of heterodimer with protein prM (By similarity). They play a role CC in virion budding in the ER, and the newly formed immature particle is CC covered with 60 spikes composed of heterodimer between precursor prM CC and envelope protein E (By similarity). The virion is transported to CC the Golgi apparatus where the low pH causes dissociation of PrM-E CC heterodimers and formation of E homodimers (By similarity). prM-E CC cleavage is inefficient, and many virions are only partially matured CC (By similarity). These uncleaved prM would play a role in immune CC evasion (By similarity). {ECO:0000250|UniProtKB:P17763}. CC -!- FUNCTION: [Non-structural protein 1]: Involved in immune evasion, CC pathogenesis and viral replication. Once cleaved off the polyprotein, CC is targeted to three destinations: the viral replication cycle, the CC plasma membrane and the extracellular compartment. Essential for viral CC replication. Required for formation of the replication complex and CC recruitment of other non-structural proteins to the ER-derived membrane CC structures. Excreted as a hexameric lipoparticle that plays a role CC against host immune response. Antagonizing the complement function. CC Binds to the host macrophages and dendritic cells. Inhibits signal CC transduction originating from Toll-like receptor 3 (TLR3). CC {ECO:0000250|UniProtKB:Q9Q6P4}. CC -!- FUNCTION: [Non-structural protein 2A]: Component of the viral RNA CC replication complex that functions in virion assembly and antagonizes CC the host alpha/beta interferon antiviral response. CC {ECO:0000250|UniProtKB:P14335}. CC -!- FUNCTION: [Serine protease subunit NS2B]: Required cofactor for the CC serine protease function of NS3. May have membrane-destabilizing CC activity and form viroporins (By similarity). CC {ECO:0000250|UniProtKB:P17763, ECO:0000255|PROSITE-ProRule:PRU00859}. CC -!- FUNCTION: [Serine protease/Helicase NS3]: Displays three enzymatic CC activities: serine protease, NTPase and RNA helicase. NS3 serine CC protease, in association with NS2B, performs its autocleavage and CC cleaves the polyprotein at dibasic sites in the cytoplasm: C-prM, NS2A- CC NS2B, NS2B-NS3, NS3-NS4A, NS4A-2K and NS4B-NS5. NS3 RNA helicase binds CC RNA and unwinds dsRNA in the 3' to 5' direction (By similarity). NS3 CC supports the separation of RNA daughter and template strands during CC viral replication. The helicase part is involved in the inhibition of CC phosphorylation of host STAT1, and thereby inhibition of host type-I CC IFN signaling (By similarity). In addition, NS3 assists the initiation CC of replication by unwinding the RNA secondary structure in the 3' non- CC translated region (NTR). Inhibits STAT2 translocation in the nucleus CC after IFN-alpha treatment (By similarity). CC {ECO:0000250|UniProtKB:P14335, ECO:0000250|UniProtKB:Q9Q6P4, CC ECO:0000255|PROSITE-ProRule:PRU00860}. CC -!- FUNCTION: [Non-structural protein 4A]: Regulates the ATPase activity of CC the NS3 helicase activity. NS4A allows NS3 helicase to conserve energy CC during unwinding. {ECO:0000250|UniProtKB:Q9Q6P4}. CC -!- FUNCTION: [Peptide 2k]: Functions as a signal peptide for NS4B and is CC required for the interferon antagonism activity of the latter. CC {ECO:0000250|UniProtKB:P17763}. CC -!- FUNCTION: [Non-structural protein 4B]: Induces the formation of ER- CC derived membrane vesicles where the viral replication takes place. CC Inhibits interferon (IFN)-induced host STAT1 phosphorylation and CC nuclear translocation, thereby preventing the establishment of cellular CC antiviral state by blocking the IFN-alpha/beta pathway. Inhibits STAT2 CC translocation in the nucleus after IFN-alpha treatment. CC {ECO:0000250|UniProtKB:Q9Q6P4}. CC -!- FUNCTION: [RNA-directed RNA polymerase NS5]: Replicates the viral (+) CC and (-) RNA genome, and performs the capping of genomes in the CC cytoplasm (PubMed:17267492). NS5 methylates viral RNA cap at guanine N- CC 7 and ribose 2'-O positions (PubMed:17267492). Besides its role in RNA CC genome replication, also prevents the establishment of cellular CC antiviral state by blocking the interferon-alpha/beta (IFN-alpha/beta) CC signaling pathway (By similarity). Inhibits host TYK2 and STAT2 CC phosphorylation, thereby preventing activation of JAK-STAT signaling CC pathway (By similarity). {ECO:0000250|UniProtKB:Q9Q6P4, CC ECO:0000269|PubMed:17267492}. CC -!- CATALYTIC ACTIVITY: CC Reaction=Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds in which each of CC the Xaa can be either Arg or Lys and Yaa can be either Ser or Ala.; CC EC=3.4.21.91; CC -!- CATALYTIC ACTIVITY: CC Reaction=a ribonucleoside 5'-triphosphate + RNA(n) = diphosphate + CC RNA(n+1); Xref=Rhea:RHEA:21248, Rhea:RHEA-COMP:14527, Rhea:RHEA- CC COMP:17342, ChEBI:CHEBI:33019, ChEBI:CHEBI:61557, ChEBI:CHEBI:140395; CC EC=2.7.7.48; Evidence={ECO:0000255|PROSITE-ProRule:PRU00539}; CC -!- CATALYTIC ACTIVITY: CC Reaction=a ribonucleoside 5'-triphosphate + H2O = a ribonucleoside 5'- CC diphosphate + H(+) + phosphate; Xref=Rhea:RHEA:23680, CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:43474, CC ChEBI:CHEBI:57930, ChEBI:CHEBI:61557; EC=3.6.1.15; CC -!- CATALYTIC ACTIVITY: CC Reaction=ATP + H2O = ADP + H(+) + phosphate; Xref=Rhea:RHEA:13065, CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616, CC ChEBI:CHEBI:43474, ChEBI:CHEBI:456216; EC=3.6.4.13; CC Evidence={ECO:0000250|UniProtKB:Q9Q6P4}; CC -!- CATALYTIC ACTIVITY: CC Reaction=a 5'-end (5'-triphosphoguanosine)-ribonucleoside in mRNA + S- CC adenosyl-L-methionine = a 5'-end (N(7)-methyl 5'- CC triphosphoguanosine)-ribonucleoside in mRNA + S-adenosyl-L- CC homocysteine; Xref=Rhea:RHEA:67008, Rhea:RHEA-COMP:17166, Rhea:RHEA- CC COMP:17167, ChEBI:CHEBI:57856, ChEBI:CHEBI:59789, ChEBI:CHEBI:156461, CC ChEBI:CHEBI:167617; EC=2.1.1.56; Evidence={ECO:0000255|PROSITE- CC ProRule:PRU00924}; CC -!- CATALYTIC ACTIVITY: CC Reaction=a 5'-end (N(7)-methyl 5'-triphosphoguanosine)-ribonucleoside CC in mRNA + S-adenosyl-L-methionine = a 5'-end (N(7)-methyl 5'- CC triphosphoguanosine)-(2'-O-methyl-ribonucleoside) in mRNA + H(+) + S- CC adenosyl-L-homocysteine; Xref=Rhea:RHEA:67020, Rhea:RHEA-COMP:17167, CC Rhea:RHEA-COMP:17168, ChEBI:CHEBI:15378, ChEBI:CHEBI:57856, CC ChEBI:CHEBI:59789, ChEBI:CHEBI:156461, ChEBI:CHEBI:167609; CC EC=2.1.1.57; Evidence={ECO:0000255|PROSITE-ProRule:PRU00924}; CC -!- SUBUNIT: [Capsid protein C]: Homodimer (By similarity). Interacts (via CC N-terminus) with host EXOC1 (via C-terminus) (PubMed:19889084, CC PubMed:23522008); this interaction results in EXOC1 degradation through CC the proteasome degradation pathway (PubMed:23522008). Interacts with CC host DDX56; this interaction plays an important role in genomic RNA CC encapsidation (By similarity). {ECO:0000250|UniProtKB:P17763, CC ECO:0000250|UniProtKB:Q9Q6P4, ECO:0000269|PubMed:19889084, CC ECO:0000269|PubMed:23522008}. CC -!- SUBUNIT: [Protein prM]: Forms heterodimers with envelope protein E in CC the endoplasmic reticulum and Golgi (By similarity). CC {ECO:0000250|UniProtKB:P17763}. CC -!- SUBUNIT: [Envelope protein E]: Homodimer; in the endoplasmic reticulum CC and Golgi (By similarity). {ECO:0000250|UniProtKB:P17763}. CC -!- SUBUNIT: [Non-structural protein 1]: Homodimer; Homohexamer when CC secreted (By similarity). Interacts with envelope protein E (By CC similarity). NS1 interacts with NS4B (By similarity). Interacts with CC host complement protein CFH; this interaction leads to the degradation CC of C3 (By similarity). {ECO:0000250|UniProtKB:Q9Q6P4}. CC -!- SUBUNIT: [Non-structural protein 2A]: Interacts (via N-terminus) with CC serine protease NS3. {ECO:0000250|UniProtKB:P03314}. CC -!- SUBUNIT: [Serine protease subunit NS2B]: Forms a heterodimer with CC serine protease NS3 (By similarity). May form homooligomers (By CC similarity). {ECO:0000250|UniProtKB:P17763}. CC -!- SUBUNIT: [Serine protease/Helicase NS3]: Forms a heterodimer with NS2B CC (By similarity). Interacts with NS4B (By similarity). Interacts with CC unphosphorylated RNA-directed RNA polymerase NS5; this interaction CC stimulates RNA-directed RNA polymerase NS5 guanylyltransferase activity CC (By similarity). {ECO:0000250|UniProtKB:P17763}. CC -!- SUBUNIT: [Non-structural protein 4B]: Interacts with Serine CC protease/Helicase NS3 (By similarity). Interacts with NS1 (By CC similarity). {ECO:0000250|UniProtKB:P17763, CC ECO:0000250|UniProtKB:Q9Q6P4}. CC -!- SUBUNIT: [RNA-directed RNA polymerase NS5]: Homodimer (By similarity). CC Interacts with host STAT2; this interaction inhibits the CC phosphorylation of the latter, and, when all viral proteins are present CC (polyprotein), targets STAT2 for degradation (By similarity). Interacts CC with host PAF1 complex (PubMed:30550790). CC {ECO:0000250|UniProtKB:P17763, ECO:0000269|PubMed:30550790}. CC -!- INTERACTION: CC P06935; P05106: ITGB3; Xeno; NbExp=4; IntAct=EBI-981051, EBI-702847; CC PRO_0000037746; Q17NZ6: 5563672; Xeno; NbExp=5; IntAct=EBI-2912469, EBI-2912457; CC -!- SUBCELLULAR LOCATION: [Capsid protein C]: Virion CC {ECO:0000250|UniProtKB:P17763}. Host nucleus CC {ECO:0000250|UniProtKB:P17763}. Host cytoplasm CC {ECO:0000269|PubMed:19889084}. Host cytoplasm, host perinuclear region CC {ECO:0000269|PubMed:19889084}. CC -!- SUBCELLULAR LOCATION: [Peptide pr]: Secreted CC {ECO:0000250|UniProtKB:P17763}. CC -!- SUBCELLULAR LOCATION: [Small envelope protein M]: Virion membrane CC {ECO:0000250|UniProtKB:P03314}; Multi-pass membrane protein CC {ECO:0000250|UniProtKB:P03314}. Host endoplasmic reticulum membrane CC {ECO:0000250|UniProtKB:P03314}; Multi-pass membrane protein CC {ECO:0000255}. Note=ER membrane retention is mediated by the CC transmembrane domains. {ECO:0000250|UniProtKB:P03314}. CC -!- SUBCELLULAR LOCATION: [Envelope protein E]: Virion membrane CC {ECO:0000305}; Multi-pass membrane protein CC {ECO:0000250|UniProtKB:P03314}. Host endoplasmic reticulum membrane CC {ECO:0000250|UniProtKB:P03314}; Multi-pass membrane protein CC {ECO:0000255}. Note=ER membrane retention is mediated by the CC transmembrane domains. {ECO:0000250|UniProtKB:P03314}. CC -!- SUBCELLULAR LOCATION: [Non-structural protein 1]: Secreted CC {ECO:0000250|UniProtKB:P17763}. Host endoplasmic reticulum membrane; CC Peripheral membrane protein; Lumenal side CC {ECO:0000250|UniProtKB:P17763}. Note=Located in RE-derived vesicles CC hosting the replication complex. {ECO:0000250|UniProtKB:Q9Q6P4}. CC -!- SUBCELLULAR LOCATION: [Non-structural protein 2A]: Host endoplasmic CC reticulum membrane {ECO:0000250|UniProtKB:P14335}; Multi-pass membrane CC protein {ECO:0000250|UniProtKB:P17763}. CC -!- SUBCELLULAR LOCATION: [Serine protease subunit NS2B]: Host endoplasmic CC reticulum membrane; Multi-pass membrane protein CC {ECO:0000250|UniProtKB:P17763}. CC -!- SUBCELLULAR LOCATION: [Serine protease/Helicase NS3]: Host endoplasmic CC reticulum membrane {ECO:0000255|PROSITE-ProRule:PRU00860}; Peripheral CC membrane protein {ECO:0000255|PROSITE-ProRule:PRU00860}; Cytoplasmic CC side {ECO:0000255|PROSITE-ProRule:PRU00860}. Note=Remains non- CC covalently associated to serine protease subunit NS2B. CC {ECO:0000255|PROSITE-ProRule:PRU00860}. CC -!- SUBCELLULAR LOCATION: [Non-structural protein 4A]: Host endoplasmic CC reticulum membrane {ECO:0000250|UniProtKB:P14335}; Multi-pass membrane CC protein {ECO:0000250|UniProtKB:P17763}. Note=Located in RE-associated CC vesicles hosting the replication complex. CC {ECO:0000250|UniProtKB:P17763}. CC -!- SUBCELLULAR LOCATION: [Non-structural protein 4B]: Host endoplasmic CC reticulum membrane {ECO:0000250|UniProtKB:P17763}; Multi-pass membrane CC protein {ECO:0000250|UniProtKB:P17763}. Note=Located in RE-derived CC vesicles hosting the replication complex. CC {ECO:0000250|UniProtKB:Q9Q6P4}. CC -!- SUBCELLULAR LOCATION: [RNA-directed RNA polymerase NS5]: Host CC endoplasmic reticulum membrane; Peripheral membrane protein; CC Cytoplasmic side. Host nucleus {ECO:0000269|PubMed:16699025}. Host CC cytoplasm {ECO:0000250|UniProtKB:P14335}. Note=Located in RE-associated CC vesicles hosting the replication complex. NS5 protein is mainly CC localized in the nucleus rather than in ER vesicles (By similarity). CC Shuttles between the cytoplasm and nucleus (By similarity). CC {ECO:0000250|UniProtKB:P14335, ECO:0000250|UniProtKB:P17763}. CC -!- DOMAIN: [Small envelope protein M]: The transmembrane domains contain CC an endoplasmic reticulum retention signal. CC {ECO:0000250|UniProtKB:P17763}. CC -!- DOMAIN: [Envelope protein E]: The transmembrane domains contain an CC endoplasmic reticulum retention signal. {ECO:0000250|UniProtKB:P17763}. CC -!- DOMAIN: [RNA-directed RNA polymerase NS5]: Contains a PDZ-binding motif CC that binds to several PDZ-containing cellular proteins. These CC interactions seem necessary for an optimal viral replication. CC {ECO:0000269|PubMed:33547379}. CC -!- PTM: [Genome polyprotein]: Specific enzymatic cleavages in vivo yield CC mature proteins. Cleavages in the lumen of endoplasmic reticulum are CC performed by host signal peptidase, whereas cleavages in the CC cytoplasmic side are performed by serine protease NS3. Signal cleavage CC at the 2K-4B site requires a prior NS3 protease-mediated cleavage at CC the 4A-2K site. {ECO:0000250|UniProtKB:P17763, CC ECO:0000269|PubMed:17067286}. CC -!- PTM: [Protein prM]: Cleaved in post-Golgi vesicles by a host furin, CC releasing the mature small envelope protein M, and peptide pr. This CC cleavage is incomplete as up to 30% of viral particles still carry CC uncleaved prM. {ECO:0000250|UniProtKB:P17763}. CC -!- PTM: [Envelope protein E]: Not N-glycosylated. CC {ECO:0000269|PubMed:2441520}. CC -!- PTM: [Non-structural protein 1]: N-glycosylated. The excreted form is CC glycosylated and this is required for efficient secretion of the CC protein from infected cells. {ECO:0000250|UniProtKB:P17763}. CC -!- PTM: [Serine protease/Helicase NS3]: Acetylated by host KAT5. CC Acetylation modulates NS3 RNA-binding and unwinding activities and CC plays an important positive role for viral replication. CC {ECO:0000250|UniProtKB:Q32ZE1}. CC -!- PTM: [RNA-directed RNA polymerase NS5]: Phosphorylated on serines CC residues. This phosphorylation may trigger NS5 nuclear localization. CC {ECO:0000250|UniProtKB:P17763}. CC -!- SIMILARITY: In the N-terminal section; belongs to the class I-like SAM- CC binding methyltransferase superfamily. mRNA cap 0-1 NS5-type CC methyltransferase family. {ECO:0000255|PROSITE-ProRule:PRU00924}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; M12294; AAA48498.2; -; Genomic_RNA. DR PIR; A25256; GNWVWV. DR RefSeq; NP_041724.2; NC_001563.2. DR PDB; 2FP7; X-ray; 1.68 A; A=1420-1466, B=1517-1688. DR PDB; 2GGV; X-ray; 1.80 A; A=1419-1525, B=1503-1679. DR PDB; 2IJO; X-ray; 2.30 A; A=1419-1482, B=1502-1685. DR PDB; 2P5P; X-ray; 2.80 A; A/B/C=585-701. DR PDB; 2YOL; X-ray; 3.20 A; A=1420-1465, A=1502-1671. DR PDB; 3E90; X-ray; 2.45 A; A/C=1420-1463, B/D=1502-1685. DR PDB; 3I50; X-ray; 3.00 A; E=291-688. DR PDB; 5IDK; X-ray; 1.50 A; A/B/C=1420-1465. DR PDBsum; 2FP7; -. DR PDBsum; 2GGV; -. DR PDBsum; 2IJO; -. DR PDBsum; 2P5P; -. DR PDBsum; 2YOL; -. DR PDBsum; 3E90; -. DR PDBsum; 3I50; -. DR PDBsum; 5IDK; -. DR BMRB; P06935; -. DR SMR; P06935; -. DR IntAct; P06935; 4. DR BindingDB; P06935; -. DR ChEMBL; CHEMBL5419; -. DR MEROPS; S07.003; -. DR ABCD; P06935; 11 sequenced antibodies. DR GeneID; 912267; -. DR KEGG; vg:912267; -. DR BRENDA; 2.7.7.48; 6687. DR BRENDA; 3.4.21.91; 6687. DR EvolutionaryTrace; P06935; -. DR PRO; PR:P06935; -. DR Proteomes; UP000008600; Genome. DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell. DR GO; GO:0044167; C:host cell endoplasmic reticulum membrane; IEA:UniProtKB-SubCell. DR GO; GO:0042025; C:host cell nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell. DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW. DR GO; GO:0032993; C:protein-DNA complex; IMP:CAFA. DR GO; GO:1990904; C:ribonucleoprotein complex; IMP:CAFA. DR GO; GO:0019028; C:viral capsid; IEA:UniProtKB-KW. DR GO; GO:0019031; C:viral envelope; IEA:UniProtKB-KW. DR GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell. DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW. DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:RHEA. DR GO; GO:0003677; F:DNA binding; IMP:CAFA. DR GO; GO:1990814; F:DNA/DNA annealing activity; IMP:CAFA. DR GO; GO:0003725; F:double-stranded RNA binding; IEA:InterPro. DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW. DR GO; GO:0004483; F:mRNA (nucleoside-2'-O-)-methyltransferase activity; IDA:UniProtKB. DR GO; GO:0004482; F:mRNA 5'-cap (guanine-N7-)-methyltransferase activity; IEA:UniProtKB-EC. DR GO; GO:0008233; F:peptidase activity; IDA:CACAO. DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro. DR GO; GO:0003723; F:RNA binding; IDA:DisProt. DR GO; GO:0140691; F:RNA folding chaperone; IDA:DisProt. DR GO; GO:0003724; F:RNA helicase activity; IEA:UniProtKB-EC. DR GO; GO:0033592; F:RNA strand annealing activity; IMP:CAFA. DR GO; GO:0003968; F:RNA-dependent RNA polymerase activity; IEA:UniProtKB-KW. DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro. DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro. DR GO; GO:0075512; P:clathrin-dependent endocytosis of virus by host cell; IEA:UniProtKB-KW. DR GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW. DR GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW. DR GO; GO:0045070; P:positive regulation of viral genome replication; IMP:CACAO. DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW. DR GO; GO:0106005; P:RNA 5'-cap (guanine-N7)-methylation; IDA:UniProtKB. DR GO; GO:0043489; P:RNA stabilization; IMP:CAFA. DR GO; GO:0033668; P:symbiont-mediated suppression of host apoptosis; IDA:CACAO. DR GO; GO:0039574; P:symbiont-mediated suppression of host JAK-STAT cascade via inhibition of host TYK2 activity; IEA:UniProtKB-KW. DR GO; GO:0039576; P:symbiont-mediated suppression of host JAK-STAT cascade via inhibition of JAK1 activity; IEA:UniProtKB-KW. DR GO; GO:0039563; P:symbiont-mediated suppression of host JAK-STAT cascade via inhibition of STAT1 activity; IEA:UniProtKB-KW. DR GO; GO:0039564; P:symbiont-mediated suppression of host JAK-STAT cascade via inhibition of STAT2 activity; IEA:UniProtKB-KW. DR GO; GO:0039502; P:symbiont-mediated suppression of host type I interferon-mediated signaling pathway; IEA:UniProtKB-KW. DR GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro. DR GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW. DR GO; GO:0019049; P:virus-mediated perturbation of host defense response; IEA:UniProtKB-KW. DR CDD; cd20761; capping_2-OMTase_Flaviviridae; 1. DR CDD; cd17931; DEXHc_viral_Ns3; 1. DR CDD; cd12149; Flavi_E_C; 1. DR CDD; cd17038; Flavi_M; 1. DR CDD; cd23204; Flavivirus_RdRp; 1. DR CDD; cd18806; SF2_C_viral; 1. DR DisProt; DP00673; -. DR Gene3D; 1.10.10.930; -; 1. DR Gene3D; 1.10.260.90; -; 1. DR Gene3D; 1.20.1280.260; -; 1. DR Gene3D; 2.40.10.120; -; 2. DR Gene3D; 2.60.40.350; -; 1. DR Gene3D; 1.10.8.970; Flavivirus envelope glycoprotein M-like; 1. DR Gene3D; 2.60.260.50; Flavivirus polyprotein propeptide domain; 1. DR Gene3D; 3.30.70.2840; Flavivirus RNA-directed RNA polymerase, thumb domain; 3. DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2. DR Gene3D; 2.60.98.10; Tick-borne Encephalitis virus Glycoprotein, domain 1; 1. DR Gene3D; 3.40.50.150; Vaccinia Virus protein VP39; 1. DR Gene3D; 3.30.67.10; Viral Envelope Glycoprotein, domain 2; 1. DR Gene3D; 3.30.387.10; Viral Envelope Glycoprotein, domain 3; 1. DR InterPro; IPR043502; DNA/RNA_pol_sf. DR InterPro; IPR000069; Env_glycoprot_M_flavivir. DR InterPro; IPR038302; Env_glycoprot_M_sf_flavivir. DR InterPro; IPR013755; Flav_gly_cen_dom_subdom1. DR InterPro; IPR001122; Flavi_capsidC. DR InterPro; IPR037172; Flavi_capsidC_sf. DR InterPro; IPR011492; Flavi_DEAD. DR InterPro; IPR027287; Flavi_E_Ig-like. DR InterPro; IPR026470; Flavi_E_Stem/Anchor_dom. DR InterPro; IPR038345; Flavi_E_Stem/Anchor_dom_sf. DR InterPro; IPR011998; Flavi_Glycoprot_E_cen/dimer. DR InterPro; IPR001157; Flavi_NS1. DR InterPro; IPR000752; Flavi_NS2A. DR InterPro; IPR000487; Flavi_NS2B. DR InterPro; IPR001850; Flavi_NS3_S7. DR InterPro; IPR000404; Flavi_NS4A. DR InterPro; IPR001528; Flavi_NS4B. DR InterPro; IPR046811; Flavi_NS5_thumb. DR InterPro; IPR002535; Flavi_propep. DR InterPro; IPR038688; Flavi_propep_sf. DR InterPro; IPR047530; Flavi_RdRp. DR InterPro; IPR000208; Flavi_RdRp_fingers/palm. DR InterPro; IPR000336; Flavivir/Alphavir_Ig-like_sf. DR InterPro; IPR014412; Gen_Poly_FLV. DR InterPro; IPR036253; Glycoprot_cen/dimer_sf. DR InterPro; IPR038055; Glycoprot_E_dimer_dom. DR InterPro; IPR013756; GlyE_cen_dom_subdom2. DR InterPro; IPR014001; Helicase_ATP-bd. DR InterPro; IPR001650; Helicase_C-like. DR InterPro; IPR014756; Ig_E-set. DR InterPro; IPR026490; mRNA_cap_0/1_MeTrfase. DR InterPro; IPR049486; NS3-hel_C_flaviviridae. DR InterPro; IPR027417; P-loop_NTPase. DR InterPro; IPR009003; Peptidase_S1_PA. DR InterPro; IPR007094; RNA-dir_pol_PSvirus. DR InterPro; IPR002877; RNA_MeTrfase_FtsJ_dom. DR InterPro; IPR029063; SAM-dependent_MTases_sf. DR NCBIfam; TIGR04240; flavi_E_stem; 1. DR Pfam; PF20907; Flav_NS3-hel_C; 1. DR Pfam; PF01003; Flavi_capsid; 1. DR Pfam; PF07652; Flavi_DEAD; 1. DR Pfam; PF21659; Flavi_E_stem; 1. DR Pfam; PF02832; Flavi_glycop_C; 1. DR Pfam; PF00869; Flavi_glycoprot; 1. DR Pfam; PF01004; Flavi_M; 1. DR Pfam; PF00948; Flavi_NS1; 1. DR Pfam; PF01005; Flavi_NS2A; 1. DR Pfam; PF01002; Flavi_NS2B; 1. DR Pfam; PF01350; Flavi_NS4A; 1. DR Pfam; PF01349; Flavi_NS4B; 1. DR Pfam; PF00972; Flavi_NS5; 1. DR Pfam; PF20483; Flavi_NS5_thumb; 1. DR Pfam; PF01570; Flavi_propep; 1. DR Pfam; PF01728; FtsJ; 1. DR Pfam; PF00949; Peptidase_S7; 1. DR PIRSF; PIRSF003817; Gen_Poly_FLV; 1. DR SMART; SM00487; DEXDc; 1. DR SMART; SM00490; HELICc; 1. DR SUPFAM; SSF56672; DNA/RNA polymerases; 1. DR SUPFAM; SSF81296; E set domains; 1. DR SUPFAM; SSF101257; Flavivirus capsid protein C; 1. DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2. DR SUPFAM; SSF53335; S-adenosyl-L-methionine-dependent methyltransferases; 1. DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1. DR SUPFAM; SSF56983; Viral glycoprotein, central and dimerisation domains; 1. DR PROSITE; PS51527; FLAVIVIRUS_NS2B; 1. DR PROSITE; PS51528; FLAVIVIRUS_NS3PRO; 1. DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1. DR PROSITE; PS51194; HELICASE_CTER; 1. DR PROSITE; PS50507; RDRP_SSRNA_POS; 1. DR PROSITE; PS51591; RNA_CAP01_NS5_MT; 1. PE 1: Evidence at protein level; KW 3D-structure; Acetylation; Activation of host autophagy by virus; KW ATP-binding; Capsid protein; KW Clathrin-mediated endocytosis of virus by host; KW Cleavage on pair of basic residues; Disulfide bond; KW Fusion of virus membrane with host endosomal membrane; KW Fusion of virus membrane with host membrane; Glycoprotein; Helicase; KW Host cytoplasm; Host endoplasmic reticulum; Host membrane; Host nucleus; KW Host-virus interaction; Hydrolase; KW Inhibition of host innate immune response by virus; KW Inhibition of host interferon signaling pathway by virus; KW Inhibition of host JAK1 by virus; Inhibition of host STAT1 by virus; KW Inhibition of host STAT2 by virus; Inhibition of host TYK2 by virus; KW Interferon antiviral system evasion; Membrane; Metal-binding; KW Methyltransferase; mRNA capping; mRNA processing; Multifunctional enzyme; KW Nucleotide-binding; Nucleotidyltransferase; Phosphoprotein; Protease; KW RNA-binding; RNA-directed RNA polymerase; S-adenosyl-L-methionine; KW Secreted; Serine protease; Suppressor of RNA silencing; Transcription; KW Transcription regulation; Transferase; Transmembrane; Transmembrane helix; KW Viral attachment to host cell; Viral envelope protein; Viral immunoevasion; KW Viral penetration into host cytoplasm; Viral RNA replication; Virion; KW Virus endocytosis by host; Virus entry into host cell; Zinc. FT CHAIN 1..3430 FT /note="Genome polyprotein" FT /id="PRO_0000441418" FT CHAIN 1..105 FT /note="Capsid protein C" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037743" FT PROPEP 106..123 FT /note="ER anchor for the capsid protein C, removed in FT mature form by serine protease NS3" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037744" FT CHAIN 124..290 FT /note="Protein prM" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000405150" FT CHAIN 124..215 FT /note="Peptide pr" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000405151" FT CHAIN 216..290 FT /note="Small envelope protein M" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037745" FT CHAIN 291..787 FT /note="Envelope protein E" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037746" FT CHAIN 788..1139 FT /note="Non-structural protein 1" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037747" FT CHAIN 1140..1370 FT /note="Non-structural protein 2A" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037748" FT CHAIN 1371..1501 FT /note="Serine protease subunit NS2B" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037749" FT CHAIN 1502..2120 FT /note="Serine protease/Helicase NS3" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037750" FT CHAIN 2121..2246 FT /note="Non-structural protein 4A" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037751" FT PEPTIDE 2247..2269 FT /note="Peptide 2k" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000405152" FT CHAIN 2270..2525 FT /note="Non-structural protein 4B" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037752" FT CHAIN 2526..3430 FT /note="RNA-directed RNA polymerase NS5" FT /evidence="ECO:0000269|PubMed:17067286" FT /id="PRO_0000037753" FT TOPO_DOM 2..105 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT TRANSMEM 106..126 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 127..248 FT /note="Extracellular" FT /evidence="ECO:0000255" FT TRANSMEM 249..269 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 270..275 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT TRANSMEM 276..290 FT /note="Helical" FT /evidence="ECO:0000305" FT TOPO_DOM 291..739 FT /note="Extracellular" FT /evidence="ECO:0000255" FT TRANSMEM 740..760 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 761..766 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT TRANSMEM 767..787 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 788..1212 FT /note="Extracellular" FT /evidence="ECO:0000255" FT TRANSMEM 1213..1233 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 1234..1243 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT TRANSMEM 1244..1264 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 1265..1278 FT /note="Lumenal" FT /evidence="ECO:0000255" FT TRANSMEM 1279..1299 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 1300..1307 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT TRANSMEM 1308..1328 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 1329..1340 FT /note="Lumenal" FT /evidence="ECO:0000255" FT TRANSMEM 1341..1361 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 1362..1371 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT TRANSMEM 1372..1392 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 1393..1395 FT /note="Lumenal" FT /evidence="ECO:0000255" FT TRANSMEM 1396..1416 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 1417..1473 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT INTRAMEM 1474..1494 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 1495..2170 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT TRANSMEM 2171..2191 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 2192..2196 FT /note="Lumenal" FT /evidence="ECO:0000255" FT INTRAMEM 2197..2217 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 2218 FT /note="Lumenal" FT /evidence="ECO:0000255" FT TRANSMEM 2219..2239 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 2240..2254 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT TRANSMEM 2255..2275 FT /note="Helical; Note=Signal for NS4B" FT /evidence="ECO:0000255" FT TOPO_DOM 2276..2309 FT /note="Lumenal" FT /evidence="ECO:0000255" FT INTRAMEM 2310..2330 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 2331..2377 FT /note="Lumenal" FT /evidence="ECO:0000255" FT TRANSMEM 2378..2398 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 2399..2441 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT TRANSMEM 2442..2462 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 2463..2467 FT /note="Lumenal" FT /evidence="ECO:0000255" FT TRANSMEM 2468..2488 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 2489..3430 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT DOMAIN 1502..1679 FT /note="Peptidase S7" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00860" FT DOMAIN 1682..1838 FT /note="Helicase ATP-binding" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00541" FT DOMAIN 1849..2014 FT /note="Helicase C-terminal" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00542" FT DOMAIN 2526..2791 FT /note="mRNA cap 0-1 NS5-type MT" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT DOMAIN 3055..3207 FT /note="RdRp catalytic" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00539" FT REGION 2..15 FT /note="Interaction with host EXOC1" FT /evidence="ECO:0000269|PubMed:19889084" FT REGION 37..72 FT /note="Hydrophobic; homodimerization of capsid protein C" FT /evidence="ECO:0000250|UniProtKB:P29990" FT REGION 388..401 FT /note="Fusion peptide" FT /evidence="ECO:0000250|UniProtKB:P14336" FT REGION 1424..1463 FT /note="Interacts with and activates NS3 protease" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00859" FT REGION 1686..1689 FT /note="Important for RNA-binding" FT /evidence="ECO:0000250|UniProtKB:P14340" FT REGION 2165..2169 FT /note="Regulates the ATPase activity of NS3 helicase" FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT MOTIF 1786..1789 FT /note="DEAH box" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00541" FT MOTIF 2914..2916 FT /note="Nuclear localization signal" FT /evidence="ECO:0000250|UniProtKB:P14335" FT MOTIF 3428..3430 FT /note="PDZ-binding" FT /evidence="ECO:0000269|PubMed:33547379" FT ACT_SITE 1552 FT /note="Charge relay system; for serine protease NS3 FT activity" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00860" FT ACT_SITE 1576 FT /note="Charge relay system; for serine protease NS3 FT activity" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00860" FT ACT_SITE 1636 FT /note="Charge relay system; for serine protease NS3 FT activity" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00860" FT ACT_SITE 2586 FT /note="For 2'-O-MTase activity" FT /evidence="ECO:0000250|UniProtKB:Q6YMS4" FT ACT_SITE 2671 FT /note="For 2'-O-MTase activity" FT /evidence="ECO:0000250|UniProtKB:Q6YMS4" FT ACT_SITE 2707 FT /note="For 2'-O-MTase activity" FT /evidence="ECO:0000250|UniProtKB:Q6YMS4" FT ACT_SITE 2743 FT /note="For 2'-O-MTase activity" FT /evidence="ECO:0000250|UniProtKB:Q6YMS4" FT BINDING 1695..1702 FT /ligand="ATP" FT /ligand_id="ChEBI:CHEBI:30616" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00541" FT BINDING 2581 FT /ligand="S-adenosyl-L-methionine" FT /ligand_id="ChEBI:CHEBI:59789" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT BINDING 2611 FT /ligand="S-adenosyl-L-methionine" FT /ligand_id="ChEBI:CHEBI:59789" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT BINDING 2612 FT /ligand="S-adenosyl-L-methionine" FT /ligand_id="ChEBI:CHEBI:59789" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT BINDING 2629 FT /ligand="S-adenosyl-L-methionine" FT /ligand_id="ChEBI:CHEBI:59789" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT BINDING 2630 FT /ligand="S-adenosyl-L-methionine" FT /ligand_id="ChEBI:CHEBI:59789" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT BINDING 2656 FT /ligand="S-adenosyl-L-methionine" FT /ligand_id="ChEBI:CHEBI:59789" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT BINDING 2657 FT /ligand="S-adenosyl-L-methionine" FT /ligand_id="ChEBI:CHEBI:59789" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT BINDING 2672 FT /ligand="S-adenosyl-L-methionine" FT /ligand_id="ChEBI:CHEBI:59789" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT BINDING 2745 FT /ligand="S-adenosyl-L-methionine" FT /ligand_id="ChEBI:CHEBI:59789" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT BINDING 2965 FT /ligand="Zn(2+)" FT /ligand_id="ChEBI:CHEBI:29105" FT /ligand_label="1" FT /evidence="ECO:0000250|UniProtKB:P14335" FT BINDING 2969 FT /ligand="Zn(2+)" FT /ligand_id="ChEBI:CHEBI:29105" FT /ligand_label="1" FT /evidence="ECO:0000250|UniProtKB:P14335" FT BINDING 2974 FT /ligand="Zn(2+)" FT /ligand_id="ChEBI:CHEBI:29105" FT /ligand_label="1" FT /evidence="ECO:0000250|UniProtKB:P14335" FT BINDING 2977 FT /ligand="Zn(2+)" FT /ligand_id="ChEBI:CHEBI:29105" FT /ligand_label="1" FT /evidence="ECO:0000250|UniProtKB:P14335" FT BINDING 3242 FT /ligand="Zn(2+)" FT /ligand_id="ChEBI:CHEBI:29105" FT /ligand_label="2" FT /evidence="ECO:0000250|UniProtKB:P14335" FT BINDING 3258 FT /ligand="Zn(2+)" FT /ligand_id="ChEBI:CHEBI:29105" FT /ligand_label="2" FT /evidence="ECO:0000250|UniProtKB:P14335" FT BINDING 3377 FT /ligand="Zn(2+)" FT /ligand_id="ChEBI:CHEBI:29105" FT /ligand_label="2" FT /evidence="ECO:0000250|UniProtKB:P14335" FT SITE 105..106 FT /note="Cleavage; by viral protease NS3" FT /evidence="ECO:0000269|PubMed:17067286" FT SITE 123..124 FT /note="Cleavage; by host signal peptidase" FT /evidence="ECO:0000269|PubMed:17067286" FT SITE 215..216 FT /note="Cleavage; by host furin" FT /evidence="ECO:0000303|PubMed:17067286" FT SITE 290..291 FT /note="Cleavage; by host signal peptidase" FT /evidence="ECO:0000303|PubMed:17067286" FT SITE 787..788 FT /note="Cleavage; by host signal peptidase" FT /evidence="ECO:0000303|PubMed:17067286" FT SITE 1139..1140 FT /note="Cleavage; by host" FT /evidence="ECO:0000303|PubMed:17067286" FT SITE 1370..1371 FT /note="Cleavage; by viral protease NS3" FT /evidence="ECO:0000269|PubMed:17067286" FT SITE 1501..1502 FT /note="Cleavage; by autolysis" FT /evidence="ECO:0000269|PubMed:17067286" FT SITE 1959 FT /note="Involved in NS3 ATPase and RTPase activities" FT /evidence="ECO:0000250|UniProtKB:P14335" FT SITE 1962 FT /note="Involved in NS3 ATPase and RTPase activities" FT /evidence="ECO:0000250|UniProtKB:P14335" FT SITE 2120..2121 FT /note="Cleavage; by autolysis" FT /evidence="ECO:0000303|PubMed:17067286" FT SITE 2246..2247 FT /note="Cleavage; by viral protease NS3" FT /evidence="ECO:0000303|PubMed:17067286" FT SITE 2269..2270 FT /note="Cleavage; by host signal peptidase" FT /evidence="ECO:0000303|PubMed:17067286" FT SITE 2525..2526 FT /note="Cleavage; by viral protease NS3" FT /evidence="ECO:0000269|PubMed:17067286" FT SITE 2538 FT /note="mRNA cap binding" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2541 FT /note="mRNA cap binding; via carbonyl oxygen" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2542 FT /note="mRNA cap binding" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2544 FT /note="mRNA cap binding; via carbonyl oxygen" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2549 FT /note="mRNA cap binding" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924, FT ECO:0000269|PubMed:17067286" FT SITE 2553 FT /note="mRNA cap binding" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2586 FT /note="Essential for 2'-O-methyltransferase activity" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2671 FT /note="Essential for 2'-O-methyltransferase and N-7 FT methyltransferase activity" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2675 FT /note="mRNA cap binding" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2707 FT /note="Essential for 2'-O-methyltransferase activity" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2738 FT /note="mRNA cap binding" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2740 FT /note="mRNA cap binding" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT SITE 2743 FT /note="Essential for 2'-O-methyltransferase activity" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00924" FT MOD_RES 1890 FT /note="N6-acetyllysine; by host" FT /evidence="ECO:0000250|UniProtKB:Q32ZE1" FT MOD_RES 2581 FT /note="Phosphoserine" FT /evidence="ECO:0000250|UniProtKB:P03314" FT CARBOHYD 138 FT /note="N-linked (GlcNAc...) asparagine; by host" FT /evidence="ECO:0000250|UniProtKB:P14335" FT CARBOHYD 917 FT /note="N-linked (GlcNAc...) asparagine; by host" FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT CARBOHYD 962 FT /note="N-linked (GlcNAc...) asparagine; by host" FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT CARBOHYD 994 FT /note="N-linked (GlcNAc...) asparagine; by host" FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT DISULFID 293..320 FT /evidence="ECO:0000269|PubMed:3811228" FT DISULFID 350..406 FT /evidence="ECO:0000269|PubMed:3811228" FT DISULFID 364..395 FT /evidence="ECO:0000269|PubMed:3811228" FT DISULFID 382..411 FT /evidence="ECO:0000269|PubMed:3811228" FT DISULFID 476..574 FT /evidence="ECO:0000269|PubMed:3811228" FT DISULFID 591..622 FT /evidence="ECO:0000269|PubMed:3811228" FT DISULFID 791..802 FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT DISULFID 842..930 FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT DISULFID 966..1010 FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT DISULFID 1067..1116 FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT DISULFID 1078..1099 FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT DISULFID 1100..1103 FT /evidence="ECO:0000250|UniProtKB:Q9Q6P4" FT MUTAGEN 14 FT /note="V->A: Loss of interaction between capsid protein C FT and host EXOC1." FT /evidence="ECO:0000269|PubMed:23522008" FT MUTAGEN 2586 FT /note="K->A: Complete loss of 2'-O-methyltransferase FT activity. No effect on N-7 methyltransferase activity." FT /evidence="ECO:0000269|PubMed:17267492" FT MUTAGEN 2671 FT /note="D->A: Lethal for the virus. Complete loss of 2'-O FT and N-7 methyltransferase activities." FT /evidence="ECO:0000269|PubMed:17267492" FT MUTAGEN 2707 FT /note="K->A: Complete loss of 2'-O-methyltransferase FT activity. No effect on N-7 methyltransferase activity." FT /evidence="ECO:0000269|PubMed:17267492" FT MUTAGEN 2743 FT /note="E->A: Complete loss of 2'-O-methyltransferase FT activity. No effect on N-7 methyltransferase activity." FT /evidence="ECO:0000269|PubMed:17267492" FT STRAND 595..603 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 605..607 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 609..615 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 621..623 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 626..631 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 634..637 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 639..643 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 657..662 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 665..673 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 679..685 FT /evidence="ECO:0007829|PDB:2P5P" FT STRAND 1422..1428 FT /evidence="ECO:0007829|PDB:5IDK" FT HELIX 1434..1439 FT /evidence="ECO:0007829|PDB:2GGV" FT STRAND 1440..1443 FT /evidence="ECO:0007829|PDB:2GGV" FT STRAND 1444..1449 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1451..1453 FT /evidence="ECO:0007829|PDB:2YOL" FT STRAND 1455..1457 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1503..1505 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1521..1532 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1534..1543 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1546..1549 FT /evidence="ECO:0007829|PDB:5IDK" FT HELIX 1551..1554 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1559..1561 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1564..1566 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1568..1572 FT /evidence="ECO:0007829|PDB:5IDK" FT TURN 1573..1576 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1577..1583 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1592..1594 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1596..1600 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1608..1612 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1615..1618 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1623..1627 FT /evidence="ECO:0007829|PDB:5IDK" FT HELIX 1633..1635 FT /evidence="ECO:0007829|PDB:2FP7" FT STRAND 1639..1641 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1647..1650 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1654..1656 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1662..1665 FT /evidence="ECO:0007829|PDB:5IDK" FT STRAND 1672..1674 FT /evidence="ECO:0007829|PDB:2IJO" SQ SEQUENCE 3430 AA; 380110 MW; 42D71B7CB12DC45B CRC64; MSKKPGGPGK NRAVNMLKRG MPRGLSLIGL KRAMLSLIDG KGPIRFVLAL LAFFRFTAIA PTRAVLDRWR GVNKQTAMKH LLSFKKELGT LTSAINRRST KQKKRGGTAG FTILLGLIAC AGAVTLSNFQ GKVMMTVNAT DVTDVITIPT AAGKNLCIVR AMDVGYLCED TITYECPVLA AGNDPEDIDC WCTKSSVYVR YGRCTKTRHS RRSRRSLTVQ THGESTLANK KGAWLDSTKA TRYLVKTESW ILRNPGYALV AAVIGWMLGS NTMQRVVFAI LLLLVAPAYS FNCLGMSNRD FLEGVSGATW VDLVLEGDSC VTIMSKDKPT IDVKMMNMEA ANLADVRSYC YLASVSDLST RAACPTMGEA HNEKRADPAF VCKQGVVDRG WGNGCGLFGK GSIDTCAKFA CTTKATGWII QKENIKYEVA IFVHGPTTVE SHGKIGATQA GRFSITPSAP SYTLKLGEYG EVTVDCEPRS GIDTSAYYVM SVGEKSFLVH REWFMDLNLP WSSAGSTTWR NRETLMEFEE PHATKQSVVA LGSQEGALHQ ALAGAIPVEF SSNTVKLTSG HLKCRVKMEK LQLKGTTYGV CSKAFKFART PADTGHGTVV LELQYTGTDG PCKVPISSVA SLNDLTPVGR LVTVNPFVSV ATANSKVLIE LEPPFGDSYI VVGRGEQQIN HHWHKSGSSI GKAFTTTLRG AQRLAALGDT AWDFGSVGGV FTSVGKAIHQ VFGGAFRSLF GGMSWITQGL LGALLLWMGI NARDRSIAMT FLAVGGVLLF LSVNVHADTG CAIDIGRQEL RCGSGVFIHN DVEAWMDRYK FYPETPQGLA KIIQKAHAEG VCGLRSVSRL EHQMWEAIKD ELNTLLKENG VDLSVVVEKQ NGMYKAAPKR LAATTEKLEM GWKAWGKSII FAPELANNTF VIDGPETEEC PTANRAWNSM EVEDFGFGLT STRMFLRIRE TNTTECDSKI IGTAVKNNMA VHSDLSYWIE SGLNDTWKLE RAVLGEVKSC TWPETHTLWG DGVLESDLII PITLAGPRSN HNRRPGYKTQ NQGPWDEGRV EIDFDYCPGT TVTISDSCEH RGPAARTTTE SGKLITDWCC RSCTLPPLRF QTENGCWYGM EIRPTRHDEK TLVQSRVNAY NADMIDPFQL GLMVVFLATQ EVLRKRWTAK ISIPAIMLAL LVLVFGGITY TDVLRYVILV GAAFAEANSG GDVVHLALMA TFKIQPVFLV ASFLKARWTN QESILLMLAA AFFQMAYYDA KNVLSWEVPD VLNSLSVAWM ILRAISFTNT SNVVVPLLAL LTPGLKCLNL DVYRILLLMV GVGSLIKEKR SSAAKKKGAC LICLALASTG VFNPMILAAG LMACDPNRKR GWPATEVMTA VGLMFAIVGG LAELDIDSMA IPMTIAGLMF AAFVISGKST DMWIERTADI TWESDAEITG SSERVDVRLD DDGNFQLMND PGAPWKIWML RMACLAISAY TPWAILPSVI GFWITLQYTK RGGVLWDTPS PKEYKKGDTT TGVYRIMTRG LLGSYQAGAG VMVEGVFHTL WHTTKGAALM SGEGRLDPYW GSVKEDRLCY GGPWKLQHKW NGHDEVQMIV VEPGKNVKNV QTKPGVFKTP EGEIGAVTLD YPTGTSGSPI VDKNGDVIGL YGNGVIMPNG SYISAIVQGE RMEEPAPAGF EPEMLRKKQI TVLDLHPGAG KTRKILPQII KEAINKRLRT AVLAPTRVVA AEMSEALRGL PIRYQTSAVH REHSGNEIVD VMCHATLTHR LMSPHRVPNY NLFIMDEAHF TDPASIAARG YIATKVELGE AAAIFMTATP PGTSDPFPES NAPISDMQTE IPDRAWNTGY EWITEYVGKT VWFVPSVKMG NEIALCLQRA GKKVIQLNRK SYETEYPKCK NDDWDFVITT DISEMGANFK ASRVIDSRKS VKPTIIEEGD GRVILGEPSA ITAASAAQRR GRIGRNPSQV GDEYCYGGHT NEDDSNFAHW TEARIMLDNI NMPNGLVAQL YQPEREKVYT MDGEYRLRGE ERKNFLEFLR TADLPVWLAY KVAAAGISYH DRKWCFDGPR TNTILEDNNE VEVITKLGER KILRPRWADA RVYSDHQALK SFKDFASGKR SQIGLVEVLG RMPEHFMVKT WEALDTMYVV ATAEKGGRAH RMALEELPDA LQTIVLIALL SVMSLGVFFL LMQRKGIGKI GLGGVILGAA TFFCWMAEVP GTKIAGMLLL SLLLMIVLIP EPEKQRSQTD NQLAVFLICV LTLVGAVAAN EMGWLDKTKN DIGSLLGHRP EARETTLGVE SFLLDLRPAT AWSLYAVTTA VLTPLLKHLI TSDYINTSLT SINVQASALF TLARGFPFVD VGVSALLLAV GCWGQVTLTV TVTAAALLFC HYAYMVPGWQ AEAMRSAQRR TAAGIMKNVV VDGIVATDVP ELERTTPVMQ KKVGQIILIL VSMAAVVVNP SVRTVREAGI LTTAAAVTLW ENGASSVWNA TTAIGLCHIM RGGWLSCLSI MWTLIKNMEK PGLKRGGAKG RTLGEVWKER LNHMTKEEFT RYRKEAITEV DRSAAKHARR EGNITGGHPV SRGTAKLRWL VERRFLEPVG KVVDLGCGRG GWCYYMATQK RVQEVKGYTK GGPGHEEPQL VQSYGWNIVT MKSGVDVFYR PSEASDTLLC DIGESSSSAE VEEHRTVRVL EMVEDWLHRG PKEFCIKVLC PYMPKVIEKM ETLQRRYGGG LIRNPLSRNS THEMYWVSHA SGNIVHSVNM TSQVLLGRME KKTWKGPQFE EDVNLGSGTR AVGKPLLNSD TSKIKNRIER LKKEYSSTWH QDANHPYRTW NYHGSYEVKP TGSASSLVNG VVRLLSKPWD TITNVTTMAM TDTTPFGQQR VFKEKVDTKA PEPPEGVKYV LNETTNWLWA FLARDKKPRM CSREEFIGKV NSNAALGAMF EEQNQWKNAR EAVEDPKFWE MVDEEREAHL RGECNTCIYN MMGKREKKPG EFGKAKGSRA IWFMWLGARF LEFEALGFLN EDHWLGRKNS GGGVEGLGLQ KLGYILKEVG TKPGGKVYAD DTAGWDTRIT KADLENEAKV LELLDGEHRR LARSIIELTY RHKVVKVMRP AADGKTVMDV ISREDQRGSG QVVTYALNTF TNLAVQLVRM MEGEGVIGPD DVEKLGKGKG PKVRTWLFEN GEERLSRMAV SGDDCVVKPL DDRFATSLHF LNAMSKVRKD IQEWKPSTGW YDWQQVPFCS NHFTELIMKD GRTLVVPCRG QDELIGRARI SPGAGWNVRD TACLAKSYAQ MWLLLYFHRR DLRLMANAIC SAVPANWVPT GRTTWSIHAK GEWMTTEDML AVWNRVWIEE NEWMEDKTPV ERWSDVPYSG KREDIWCGSL IGTRTRATWA ENIHVAINQV RSVIGEEKYV DYMSSLRRYE DTIVVEDTVL //