ID POLG_BVDVN STANDARD; PRT; 3988 AA. AC P19711; DT 01-FEB-1991 (Rel. 17, Created) DT 01-FEB-1996 (Rel. 33, Last sequence update) DT 01-FEB-1996 (Rel. 33, Last annotation update) DE Genome polyprotein. OS Bovine viral diarrhea virus (isolate NADL) (BVDV) (Mucosal disease OS virus). OC Viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; OC Pestivirus. OX NCBI_TaxID=11100; RN [1] RP SEQUENCE FROM N.A. RX MEDLINE=88265858; PubMed=2838957; RA Collett M.S., Larson R., Gold C., Strick D., Anderson D.K., RA Purchio A.F.; RT "Molecular cloning and nucleotide sequence of the pestivirus bovine RT viral diarrhea virus."; RL Virology 165:191-199(1988). RN [2] RP GENOMIC ORGANIZATION. RX MEDLINE=88265859; PubMed=2838958; RA Collett M.S., Larson R., Belzer S.K., Retzel E.; RT "Proteins encoded by bovine viral diarrhea virus: the genomic RT organization of a pestivirus."; RL Virology 165:200-208(1988). CC -!- FUNCTION: Pestivirus p80 (p125) may be a bifunctional protein CC with helicase and protease activity. CC -!- PTM: GP116 gives rise to GP62 and GP53; GP62 in turn yields GP48 CC and GP25. CC -!- SIMILARITY: To the HOG cholera virus genome polyprotein. CC -!- SIMILARITY: The protease belongs to peptidase family S31. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; M31182; AAA42854.1; -. DR PIR; A29198; GNWVBV. DR HSSP; P27958; 1A1V. DR MEROPS; C53.001; -. DR MEROPS; S31.001; -. DR InterPro; IPR001410; DEAD. DR InterPro; IPR002166; HCV_RdRP. DR InterPro; IPR001650; Helicase_C. DR InterPro; IPR008751; Peptidase_C53. DR InterPro; IPR000280; Peptidase_S31. DR InterPro; IPR007095; RNA_pol_DS_PS. DR InterPro; IPR007094; RNA_pol_PSvir. DR InterPro; IPR001568; RNase_T2. DR Pfam; PF00271; helicase_C; 1. DR Pfam; PF05550; Peptidase_C53; 1. DR Pfam; PF05578; Peptidase_S31; 1. DR Pfam; PF00998; Viral_RdRP; 1. DR PRINTS; PR00729; CDVENDOPTASE. DR SMART; SM00487; DEXDc; 1. DR SMART; SM00490; HELICc; 1. DR PROSITE; PS00531; RNASE_T2_2; UNKNOWN_1. KW Polyprotein; Glycoprotein; Helicase; Serine protease; Hydrolase. FT CHAIN 1 ?270 P20 (30KD). FT CHAIN ?271 ?1063 GP116/GP62-GP53 (GLYCOPROTEIN). FT CHAIN ? ? GP125/GP54-GP80. FT CHAIN ? 3988 GP133/GP58-GP75. FT CARBOHYD 272 272 N-linked (GlcNAc...) (Potential). FT CARBOHYD 281 281 N-linked (GlcNAc...) (Potential). FT CARBOHYD 296 296 N-linked (GlcNAc...) (Potential). FT CARBOHYD 335 335 N-linked (GlcNAc...) (Potential). FT CARBOHYD 365 365 N-linked (GlcNAc...) (Potential). FT CARBOHYD 370 370 N-linked (GlcNAc...) (Potential). FT CARBOHYD 413 413 N-linked (GlcNAc...) (Potential). FT CARBOHYD 487 487 N-linked (GlcNAc...) (Potential). FT CARBOHYD 597 597 N-linked (GlcNAc...) (Potential). FT CARBOHYD 809 809 N-linked (GlcNAc...) (Potential). FT CARBOHYD 878 878 N-linked (GlcNAc...) (Potential). FT CARBOHYD 922 922 N-linked (GlcNAc...) (Potential). FT CARBOHYD 990 990 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1357 1357 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1419 1419 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1451 1451 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1803 1803 N-linked (GlcNAc...) (Potential). FT CARBOHYD 2224 2224 N-linked (GlcNAc...) (Potential). FT CARBOHYD 2307 2307 N-linked (GlcNAc...) (Potential). FT CARBOHYD 2584 2584 N-linked (GlcNAc...) (Potential). FT CARBOHYD 2772 2772 N-linked (GlcNAc...) (Potential). FT CARBOHYD 2981 2981 N-linked (GlcNAc...) (Potential). FT CARBOHYD 3778 3778 N-linked (GlcNAc...) (Potential). FT CARBOHYD 3867 3867 N-linked (GlcNAc...) (Potential). FT CARBOHYD 3883 3883 N-linked (GlcNAc...) (Potential). SQ SEQUENCE 3988 AA; 449154 MW; 4474212F338661B8 CRC64; MELITNELLY KTYKQKPVGV EEPVYDQAGD PLFGERGAVH PQSTLKLPHK RGERDVPTNL ASLPKRGDCR SGNSRGPVSG IYLKPGPLFY QDYKGPVYHR APLELFEEGS MCETTKRIGR VTGSDGKLYH IYVCIDGCII IKSATRSYQR VFRWVHNRLD CPLWVTTCSD TKEEGATKKK TQKPDRLERG KMKIVPKESE KDSKTKPPDA TIVVEGVKYQ VRKKGKTKSK NTQDGLYHNK NKPQESRKKL EKALLAWAII AIVLFQVTMG ENITQWNLQD NGTEGIQRAM FQRGVNRSLH GIWPEKICTG VPSHLATDIE LKTIHGMMDA SEKTNYTCCR LQRHEWNKHG WCNWYNIEPW ILVMNRTQAN LTEGQPPREC AVTCRYDRAS DLNVVTQARD SPTPLTGCKK GKNFSFAGIL MRGPCNFEIA ASDVLFKEHE RISMFQDTTL YLVDGLTNSL EGARQGTAKL TTWLGKQLGI LGKKLENKSK TWFGAYAASP YCDVDRKIGY IWYTKNCTPA CLPKNTKIVG PGKFGTNAED GKILHEMGGH LSEVLLLSLV VLSDFAPETA SVMYLILHFS IPQSHVDVMD CDKTQLNLTV ELTTAEVIPG SVWNLGKYVC IRPNWWPYET TVVLAFEEVS QVVKLVLRAL RDLTRIWNAA TTTAFLVCLV KIVRGQMVQG ILWLLLITGV QGHLDCKPEF SYAIAKDERI GQLGAEGLTT TWKEYSPGMK LEDTMVIAWC EDGKLMYLQR CTRETRYLAI LHTRALPTSV VFKKLFDGRK QEDVVEMNDN FEFGLCPCDA KPIVRGKFNT TLLNGPAFQM VCPIGWTGTV SCTSFNMDTL ATTVVRTYRR SKPFPHRQGC ITQKNLGEDL HNCILGGNWT CVPGDQLLYK GGSIESCKWC GYQFKESEGL PHYPIGKCKL ENETGYRLVD STSCNREGVA IVPQGTLKCK IGKTTVQVIA MDTKLGPMPC RPYEIISSEG PVEKTACTFN YTKTLKNKYF EPRDSYFQQY MLKGEYQYWF DLEVTDHHRD YFAESILVVV VALLGGRYVL WLLVTYMVLS EQKALGIQYG SGEVVMMGNL LTHNNIEVVT YFLLLYLLLR EESVKKWVLL LYHILVVHPI KSVIVILLMI GDVVKADSGG QEYLGKIDLC FTTVVLIVIG LIIARRDPTI VPLVTIMAAL RVTELTHQPG VDIAVAVMTI TLLMVSYVTD YFRYKKWLQC ILSLVSAVFL IRSLIYLGRI EMPEVTIPNW RPLTLILLYL ISTTIVTRWK VDVAGLLLQC VPILLLVTTL WADFLTLILI LPTYELVKLY YLKTVRTDTE RSWLGGIDYT RVDSIYDVDE SGEGVYLFPS RQKAQGNFSI LLPLIKATLI SCVSSKWQLI YMSYLTLDFM YYMHRKVIEE ISGGTNIISR LVAALIELNW SMEEEESKGL KKFYLLSGRL RNLIIKHKVR NETVASWYGE EEVYGMPKIM TIIKASTLSK SRHCIICTVC EGREWKGGTC PKCGRHGKPI TCGMSLADFE ERHYKRIFIR EGNFEGMCSR CQGKHRRFEM DREPKSARYC AECNRLHPAE EGDFWAESSM LGLKITYFAL MDGKVYDITE WAGCQRVGIS PDTHRVPCHI SFGSRMPFRQ EYNGFVQYTA RGQLFLRNLP VLATKVKMLM VGNLGEEIGN LEHLGWILRG PAVCKKITEH EKCHINILDK LTAFFGIMPR GTTPRAPVRF PTSLLKVRRG LETAWAYTHQ GGISSVDHVT AGKDLLVCDS MGRTRVVCQS NNRLTDETEY GVKTDSGCPD GARCYVLNPE AVNISGSKGA VVHLQKTGGE FTCVTASGTP AFFDLKNLKG WSGLPIFEAS SGRVVGRVKV GKNEESKPTK IMSGIQTVSK NRADLTEMVK KITSMNRGDF KQITLATGAG KTTELPKAVI EEIGRHKRVL VLIPLRAAAE SVYQYMRLKH PSISFNLRIG DMKEGDMATG ITYASYGYFC QMPQPKLRAA MVEYSYIFLD EYHCATPEQL AIIGKIHRFS ESIRVVAMTA TPAGSVTTTG QKHPIEEFIA PEVMKGEDLG SQFLDIAGLK IPVDEMKGNM LVFVPTRNMA VEVAKKLKAK GYNSGYYYSG EDPANLRVVT SQSPYVIVAT NAIESGVTLP DLDTVIDTGL KCEKRVRVSS KIPFIVTGLK RMAVTVGEQA QRRGRVGRVK PGRYYRSQET ATGSKDYHYD LLQAQRYGIE DGINVTKSFR EMNYDWSLYE EDSLLITQLE ILNNLLISED LPAAVKNIMA RTDHPEPIQL AYNSYEVQVP VLFPKIRNGE VTDTYENYSF LNARKLGEDV PVYIYATEDE DLAVDLLGLD WPDPGNQQVV ETGKALKQVT GLSSAENALL VALFGYVGYQ ALSKRHVPMI TDIYTIEDQR LEDTTHLQYA PNAIKTDGTE TELKELASGD VEKIMGAISD YAAGGLEFVK SQAEKIKTAP LFKENAEAAK GYVQKFIDSL IENKEEIIRY GLWGTHTALY KSIAARLGHE TAFATLVLKW LAFGGESVSD HVKQAAVDLV VYYVMNKPSF PGDSETQQEG RRFVASLFIS ALATYTYKTW NYHNLSKVVE PALAYLPYAT SALKMFTPTR LESVVILSTT IYKTYLSIRK GKSDGLLGTG ISAAMEILSQ NPVSVGISVM LGVGAIAAHN AIESSEQKRT LLMKVFVKNF LDQAATDELV KENPEKIIMA LFEAVQTIGN PLRLIYHLYG VYYKGWEAKE LSERTAGRNL FTLIMFEAFE LLGMDSQGKI RNLSGNYILD LIYGLHKQIN RGLKKMVLGW APAPFSCDWT PSDERIRLPT DNYLRVETRC PCGYEMKAFK NVGGKLTKVE ESGPFLCRNR PGRGPVNYRV TKYYDDNLRE IKPVAKLEGQ VEHYYKGVTA KIDYSKGKML LATDKWEVEH GVITRLAKRY TGVGFNGAYL GDEPNHRALV ERDCATITKN TVQFLKMKKG CAFTYDLTIS NLTRLIELVH RNNLEEKEIP TATVTTWLAY TFVNEDVGTI KPVLGERVIP DPVVDINLQP EVQVDTSEVG ITIIGRETLM TTGVTPVLEK VEPDASDNQN SVKIGLDEGN YPGPGIQTHT LTEEIHNRDA RPFIMILGSR NSISNRAKTA RNINLYTGND PREIRDLMAA GRMLVVALRD VDPELSEMVD FKGTFLDREA LEALSLGQPK PKQVTKEAVR NLIEQKKDVE IPNWFASDDP VFLEVALKND KYYLVGDVGE LKDQAKALGA TDQTRIIKEV GSRTYAMKLS SWFLKASNKQ MSLTPLFEEL LLRCPPATKS NKGHMASAYQ LAQGNWEPLG CGVHLGTIPA RRVKIHPYEA YLKLKDFIEE EEKKPRVKDT VIREHNKWIL KKIRFQGNLN TKKMLNPGKL SEQLDREGRK RNIYNHQIGT IMSSAGIRLE KLPIVRAQTD TKTFHEAIRD KIDKSENRQN PELHNKLLEI FHTIAQPTLK HTYGEVTWEQ LEAGVNRKGA AGFLEKKNIG EVLDSEKHLV EQLVRDLKAG RKIKYYETAI PKNEKRDVSD DWQAGDLVVE KRPRVIQYPE AKTRLAITKV MYNWVKQQPV VIPGYEGKTP LFNIFDKVRK EWDSFNEPVA VSFDTKAWDT QVTSKDLQLI GEIQKYYYKK EWHKFIDTIT DHMTEVPVIT ADGEVYIRNG QRGSGQPDTS AGNSMLNVLT MMYGFCESTG VPYKSFNRVA RIHVCGDDGF LITEKGLGLK FANKGMQILH EAGKPQKITE GEKMKVAYRF EDIEFCSHTP VPVRWSDNTS SHMAGRDTAV ILSKMATRLD SSGERGTTAY EKAVAFSFLL MYSWNPLVRR ICLLVLSQQP ETDPSKHATY YYKGDPIGAY KDVIGRNLSE LKRTGFEKLA NLNLSLSTLG VWTKHTSKRI IQDCVAIGKE EGNWLVKPDR LISSKTGHLY IPDKGFTLQG KHYEQLQLRT ETNPVMGVGT ERYKLGPIVN LLLRRLKILL MTAVGVSS //