ID   A0A7D4WTF8_9ALPC        Unreviewed;      4117 AA.
AC   A0A7D4WTF8;
DT   02-DEC-2020, integrated into UniProtKB/TrEMBL.
DT   02-DEC-2020, sequence version 1.
DT   10-FEB-2021, entry version 2.
DE   RecName: Full=3C-like proteinase {ECO:0000256|ARBA:ARBA00016921};
DE            EC=3.4.19.12 {ECO:0000256|ARBA:ARBA00012759};
DE   AltName: Full=Growth factor-like peptide {ECO:0000256|ARBA:ARBA00021930};
DE   AltName: Full=M-PRO {ECO:0000256|ARBA:ARBA00020069};
DE   AltName: Full=Non-structural protein 1 {ECO:0000256|ARBA:ARBA00016002};
DE   AltName: Full=Non-structural protein 10 {ECO:0000256|ARBA:ARBA00019725};
DE   AltName: Full=Non-structural protein 2 {ECO:0000256|ARBA:ARBA00016256};
DE   AltName: Full=Non-structural protein 3 {ECO:0000256|ARBA:ARBA00016254};
DE   AltName: Full=Non-structural protein 4 {ECO:0000256|ARBA:ARBA00016253};
DE   AltName: Full=Non-structural protein 6 {ECO:0000256|ARBA:ARBA00016249};
DE   AltName: Full=Non-structural protein 7 {ECO:0000256|ARBA:ARBA00016247};
DE   AltName: Full=Non-structural protein 8 {ECO:0000256|ARBA:ARBA00016245};
DE   AltName: Full=Non-structural protein 9 {ECO:0000256|ARBA:ARBA00016242};
DE   AltName: Full=PL1-PRO/PL2-PRO {ECO:0000256|ARBA:ARBA00018156};
DE   AltName: Full=PLP1/PLP2 {ECO:0000256|ARBA:ARBA00019319};
DE   AltName: Full=Papain-like proteinases 1/2 {ECO:0000256|ARBA:ARBA00016127};
DE   AltName: Full=Peptide HD2 {ECO:0000256|ARBA:ARBA00013695};
DE   AltName: Full=nsp5 {ECO:0000256|ARBA:ARBA00020487};
DE   AltName: Full=p12 {ECO:0000256|ARBA:ARBA00015478, ECO:0000256|ARBA:ARBA00016731};
DE   AltName: Full=p195 {ECO:0000256|ARBA:ARBA00014441};
DE   AltName: Full=p23 {ECO:0000256|ARBA:ARBA00015540, ECO:0000256|ARBA:ARBA00016713};
DE   AltName: Full=p34 {ECO:0000256|ARBA:ARBA00016597};
DE   AltName: Full=p5 {ECO:0000256|ARBA:ARBA00013606};
DE   AltName: Full=p87 {ECO:0000256|ARBA:ARBA00014747};
DE   AltName: Full=p9 {ECO:0000256|ARBA:ARBA00013598};
GN   Name=ORF1a {ECO:0000313|EMBL:QKV43822.1};
OS   Porcine epidemic diarrhea virus.
OC   Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC   Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
OC   Alphacoronavirus; Pedacovirus.
OX   NCBI_TaxID=28295 {ECO:0000313|EMBL:QKV43822.1};
RN   [1] {ECO:0000313|EMBL:QKV43822.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEDV-2118-1-Orense-Covelas {ECO:0000313|EMBL:QKV43822.1};
RA   de Nova P.J., Cortey M., Diaz I., Rubio P., Martin M., Carvajal A.;
RL   Submitted (NOV-2019) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:QKV43822.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=PEDV-2118-1-Orense-Covelas {ECO:0000313|EMBL:QKV43822.1};
RX   PubMed=32511876;
RA   de Nova P.J.G., Cortey M., Diaz I., Puente H., Rubio P., Martin M.,
RA   Carvajal A.;
RT   "A retrospective study of porcine epidemic diarrhoea virus (PEDV) reveals
RT   the presence of swine enteric coronavirus (SeCoV) since 1993 and the recent
RT   introduction of a recombinant PEDV-SeCoV in Spain.";
RL   Transbound. Emerg. Dis. 0:0-0(2020).
CC   -!- FUNCTION: Nsp7-nsp8 hexadecamer may possibly confer processivity to the
CC       polymerase, maybe by binding to dsRNA or by producing primers utilized
CC       by the latter. {ECO:0000256|ARBA:ARBA00002928}.
CC   -!- FUNCTION: Nsp9 is a ssRNA-binding protein.
CC       {ECO:0000256|ARBA:ARBA00003140}.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Thiol-dependent hydrolysis of ester, thioester, amide, peptide
CC         and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-
CC         residue protein attached to proteins as an intracellular targeting
CC         signal).; EC=3.4.19.12; Evidence={ECO:0000256|ARBA:ARBA00000707};
CC   -!- SUBCELLULAR LOCATION: Host cytoplasm, host perinuclear region
CC       {ECO:0000256|ARBA:ARBA00004407}. Host membrane
CC       {ECO:0000256|ARBA:ARBA00004301}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004301}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004141}.
CC   -!- SIMILARITY: Belongs to the coronaviruses polyprotein 1ab family.
CC       {ECO:0000256|ARBA:ARBA00008087}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; MN692788; QKV43822.1; -; Genomic_RNA.
DR   Gene3D; 1.10.150.420; -; 1.
DR   Gene3D; 1.10.1840.10; -; 1.
DR   Gene3D; 1.10.8.370; -; 1.
DR   Gene3D; 2.40.10.10; -; 2.
DR   Gene3D; 2.40.10.250; -; 1.
DR   Gene3D; 2.40.10.290; -; 1.
DR   Gene3D; 3.40.220.10; -; 2.
DR   Gene3D; 3.90.70.90; -; 2.
DR   InterPro; IPR043613; CoV_NSP2_C.
DR   InterPro; IPR043615; CoV_NSP2_N.
DR   InterPro; IPR043611; CoV_NSP3_C.
DR   InterPro; IPR043612; CoV_NSP4_N.
DR   InterPro; IPR043610; CoV_NSP6.
DR   InterPro; IPR002589; Macro_dom.
DR   InterPro; IPR043472; Macro_dom-like.
DR   InterPro; IPR036333; NSP10_sf_CoV.
DR   InterPro; IPR032505; NSP4_C_CoV.
DR   InterPro; IPR038123; NSP4_C_sf_CoV.
DR   InterPro; IPR014828; NSP7_CoV.
DR   InterPro; IPR037204; NSP7_sf_CoV.
DR   InterPro; IPR014829; NSP8_CoV-like.
DR   InterPro; IPR037230; NSP8_sf_CoV.
DR   InterPro; IPR014822; NSP9_CoV.
DR   InterPro; IPR036499; NSP9_sf_CoV.
DR   InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR   InterPro; IPR013016; Peptidase_C16_CoV.
DR   InterPro; IPR008740; Peptidase_C30_CoV.
DR   InterPro; IPR043477; Peptidase_C30_dom3_CoV.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR043503; PLpro_palm_finger_dom_CoV.
DR   InterPro; IPR018995; RNA_synth_NSP10_CoV.
DR   Pfam; PF09401; CoV_NSP10; 1.
DR   Pfam; PF19212; CoV_NSP2_C; 2.
DR   Pfam; PF19211; CoV_NSP2_N; 1.
DR   Pfam; PF19218; CoV_NSP3_C; 1.
DR   Pfam; PF16348; CoV_NSP4_C; 1.
DR   Pfam; PF19217; CoV_NSP4_N; 1.
DR   Pfam; PF19213; CoV_NSP6; 1.
DR   Pfam; PF08716; CoV_NSP7; 1.
DR   Pfam; PF08717; CoV_NSP8; 1.
DR   Pfam; PF08710; CoV_NSP9; 1.
DR   Pfam; PF08715; CoV_peptidase; 2.
DR   Pfam; PF01661; Macro; 1.
DR   Pfam; PF05409; Peptidase_C30; 1.
DR   SMART; SM00506; A1pp; 1.
DR   SUPFAM; SSF101816; SSF101816; 1.
DR   SUPFAM; SSF140367; SSF140367; 1.
DR   SUPFAM; SSF143076; SSF143076; 1.
DR   SUPFAM; SSF144246; SSF144246; 1.
DR   SUPFAM; SSF50494; SSF50494; 1.
DR   SUPFAM; SSF51126; SSF51126; 1.
DR   SUPFAM; SSF52949; SSF52949; 1.
DR   PROSITE; PS51442; M_PRO; 1.
DR   PROSITE; PS51154; MACRO; 1.
DR   PROSITE; PS51124; PEPTIDASE_C16; 2.
PE   3: Inferred from homology;
KW   Activation of host autophagy by virus {ECO:0000256|ARBA:ARBA00023050};
KW   Host cytoplasm {ECO:0000256|ARBA:ARBA00023200};
KW   Host membrane {ECO:0000256|ARBA:ARBA00022870};
KW   Host-virus interaction {ECO:0000256|ARBA:ARBA00022581};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Inhibition of host innate immune response by virus
KW   {ECO:0000256|ARBA:ARBA00022632};
KW   Inhibition of host IRF3 by virus {ECO:0000256|ARBA:ARBA00022931};
KW   Inhibition of host RLR pathway by virus {ECO:0000256|ARBA:ARBA00022482};
KW   Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Modulation of host ubiquitin pathway by viral deubiquitinase
KW   {ECO:0000256|ARBA:ARBA00022876};
KW   Modulation of host ubiquitin pathway by virus
KW   {ECO:0000256|ARBA:ARBA00022662}; Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Ribosomal frameshifting {ECO:0000256|ARBA:ARBA00022758};
KW   RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW   Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW   ECO:0000256|SAM:Phobius};
KW   Ubl conjugation pathway {ECO:0000256|ARBA:ARBA00022786};
KW   Viral immunoevasion {ECO:0000256|ARBA:ARBA00023280};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT   TRANSMEM        1964..1982
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2025..2044
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2104..2125
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2132..2153
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2165..2185
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2528..2546
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2787..2804
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2859..2887
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3337..3354
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3361..3381
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3401..3419
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3431..3449
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3469..3493
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3500..3520
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          1057..1296
FT                   /note="Peptidase C16"
FT                   /evidence="ECO:0000259|PROSITE:PS51124"
FT   DOMAIN          1286..1465
FT                   /note="Macro"
FT                   /evidence="ECO:0000259|PROSITE:PS51154"
FT   DOMAIN          1691..1951
FT                   /note="Peptidase C16"
FT                   /evidence="ECO:0000259|PROSITE:PS51124"
FT   DOMAIN          2998..3299
FT                   /note="Peptidase C30"
FT                   /evidence="ECO:0000259|PROSITE:PS51442"
FT   REGION          1012..1039
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   4117 AA;  453053 MW;  12D3BC4AE1DFDE21 CRC64;
     MASNHVTLAF ANDAEISAFG FCTASEAVSY YSEAAASGFM LCRFVSFDLA DTVEGLLPED
     YVMVVVGTTK LSAYVDTFGS RPKNICGWLL FSNCNYFLEE LELTFGRRGG NIVPVDQYMC
     GADGKPVLQE SEWEYTDFFA DSEDGQLNIA GITYVKAWIV ERSDVSYASQ NLTSIKSITY
     CSTYEHTFPD GTAMKVARTP KIKKTVVLSE PLATIYREIG SPFVDNGSDA RSIIKRPVFL
     HAFVKCKCGS CHWTVGDWTS YVSTCCGFKC KPVLVASCSA TPGSVVVTRA GAGTGVKYYN
     NMFLRHVADI DGLAFWRILK VQSKDDLACS GKFLEHHEEG FTDPCYFLND SSIATKLKFD
     ILSGKFSDEV KQAIFAGHVV VGSALVDIVD DALGQPWFIR KLGDLASAAW EQLKAVVRGL
     NLLSDEVVLF GKRLSCATLS IVNGVFEFLA EVPEKLAAAV TVFVNFLNEF FESACDCLKV
     GGKTFNKVGS YVLFDNALVK LVKAKVRGPR QAGVCEVRYT SLVIGSTSKV VSKRVENANV
     NLVVVDEDVT LNTTGRTVVV DGLAFFESDG FYRHLADADV VIEHPVYKSA CELKPVFECD
     PIPDFPMPVA ASVAELCVQT DLLLKNYNTP YKTYSCVVRG DKCCITCTLH FTAPSYMEAA
     ANFVDLCTKN IGTAGFHEFY ITAHEQQDLQ GFVTTCCTMS GFECFMPIIP QCPAVLEEID
     GGSIWRSFIT GLNTMWDFCK HLKVSFGLDG IVVTVARKFK RLGALLAEMY NTYLSTVVEN
     LVLAGVSFKY YATSVPKIVL GCCFHSVKSV LASAFQIPVQ AGVEKFKVFL NCVHPVVPRV
     IETSFVELEE TTFKPPALNG SIAIVDGFAF YYDGTLYYPT DGNSVVPICF KKKGGGDVKF
     SDEVSVKTID PVYKVSLEFE FESETIMAVL NKAVGNCIKV TGGWDDVVEY INVAIEVLKD
     HIDVPKYYIY DEEGGTDPNL PVMVSQWPLN DDTISQDLLD VEVITDAPVD FEGDEVDSSE
     PDKLADVANS EPEDDGLNVA PETNVESGVE EVAATWPFIK VTPSTVTKDP FAFDFASYGG
     LKVLRQSHNN CWVTSTLVQL QLLGIVDDPA MELFSAGRVG PMVRKCYESQ KAILGSLGDV
     SACLESLTKD LHTLKITCSV VCGCGTGERI YEGCAFRMTP TLEPFPYGAC AQCAQVLMHT
     FKSIVGTGIF CRDTTALSLD SLVVKPFCAA AFIGKDSGHY VTNFYDAAMA IDGYGRHQIK
     YDTLNTICVK DVNWTAPFVP DFEPVLEPVV KPFYSYKNVD FYQGDFSDLV KLPCDFVVNA
     ANESLSHGGG IAKAIDVYTK GMWQKCLNDY IPGHGPIKVG RGVMLEALGL KVFNVVGPRK
     GKHAPELLVK AYKSVFANSG VALTPFISVG IFSVPLEESL SAFLACVGDR HCKCFCYSDK
     EREAIINYMD GLVDAIFKDA LVDTTPVQED VQQVSQKPVL PNFEPFRIEG AHAFYECNPE
     GLMSLGADKL VLFTNSNLDF CSVGKCLNNV TGGALLEAIN VFKKSNKTVP AGNCVTFECA
     DMISITMVVL PSDGDANYDK NYARAVVKVS KLKGKLLLAV GDATLYSKLS HLSVVGFVST
     PDDVERFYAN KSVVIKVTED TRSVKAVKVE STVTYGQQIG PCLVNDTVVT DNKPVVADVV
     AKVVPSANWD SHYGFDKAGE FHMLDHTGFA FPSEVVNGRR VLKTTDNNCW VNVTCLQLQF
     ARFRFKSAGL QAMCESYCTG DVAMFVHWLY WLTGVDKGQP SDSENALNML SKYIVSAGSV
     TIERVTHDGC CCSKRVVTAP VVNASVLKLG VEDGLCPHGL NYIDKVVVVK GTTIVVNVGK
     PVVAPSHLFL KGVSYTTFLD NGNCVVGHYT VFDHDTGMVH DGDAFVPGDL NVSPVTNVVI
     SEQTAVVIKD PVKKVELDAT KLLDTMNYAS ERFFSFGDFM SRNLITVFLY ILSILGLCFR
     AFRKRDVKVL AGVPQRTGII LRKSVRYNAK ALGVFFKLKL YWFKVLGKFS LGIYALYALL
     FMTIRFTPIG GPVCDDVVAG YANSSFDKDE YCNSVICKVC LYGYQELSDF SHTQVVWQHL
     RDPLIGNVIP FFYLAFLAIF GGVYVKAITL YFICQYLNIL GVFLGLQQSI WFLQLVPFDV
     FGDEIVVFFI VTRVLMFLKH VFLGCDKASC VACSRSARLK RVPVQTIFQG TSKSFYVHAN
     GGSKFCKKHN FFCLNCDSYG PGCTFINDVI ATEVGNVVKL NVQPTGPATI LIDKVEFSNG
     FYYLYSGDTF WKYNFDITDS KYTCKESLKN CSIITDFIVF NNNGSNVNQV KNACVYFSQM
     LCKPVKLVDS ALLASLSVDF GASLHSAFVS VLSNSFGKDL SSCNDMQDCK STLGFDDVPL
     DTFNAAVAEA HRYDVLLTDM SFNNFTTSYA KPEEKLPVHD IATCMRVGAQ IVINNVFVKD
     SIPVVWLVRD FIALSEETRK YIIRTTKVKG ITFMLTFNDC RMHTTIPTVC IANKKGAGLP
     SFSKVKKFFW SLCLFIVAVF FALSFLDFST QVSIDSDYDF KYIESGQLKT FDNPLSCVHN
     VFSNFDQWHD AKFGFTPVNN PSCPIVVGVS DEARTVPGIP AGVYLAGKTL VFAINTIFGT
     SGLCFDASGV ADKGACIFNS ACTTLSGLGG TAVYCYKNGL VEGAKLYSEL APHSYYKMVD
     GNAVSLPEII SRGFGIRTIR TKAMTYCRVG QCVQSAEGVC FGADRFFVYN AESGSDFVCG
     TGLFTLLMNV ISVFSKTVPV TVLSGQILFN CIIAFAAVAV CFLFTKFKRM FGDMSVGVFT
     VGACTLLNNV SYIVTQNTLG MLGYATLYFL CTKGVRYMWI WHLGFLISYI LIAPWWVLMV
     YAFSAIFEFM PNLFKLKVST QLFEGDKFVG SFENAAAGTF VLDMHAYERL ANSISTEKLR
     QYASTYNKYK YYSGSASEAD YRLACFAHLA KAMMDYASNH NDTLYTPPTV SYNSTLQAGL
     RKMAQPSGVV EKCIVRVCYG NMALNGLWLG DTVMCPRHVI ASSTTSTIDY DYALSVLRLH
     NFSISSGNVF LGVVGVTMRG ALLQIKVNQN NVHTPKYTYR TVRPGESFNI LACYDGAAAG
     VYGVNMRSNY TIRGSFINGA CGSPGYNINN GTVEFCYLHQ LELGSGCHVG SDLDGVMYGG
     YEDQPTLQVE GASSLFTENV LAFLYAALIN GSTWWLSSSR IAVDRFNEWA VHNGMTTVGN
     TDCFSILAAK TGVDVQRLLA SIQSLHKNFG GKQILGYTSL TDEFTTGEVI RQMYGVNLQS
     GYVSRACRNV LLVGSFLTFF WSELVSYTKF FWVNPGYVTP MFACLSLLSS LLMFTLKHKT
     FFFQVFLIPA LIVTSCINLA FDVEVYNYLA EHFDYHVSLM GFNAQGLVNI FVCFVVTILH
     GTYTWRFFNT PVSSVTYVVA LLTAAYNYFY ASDILSCAMT LFASVTGNWF VGAVCYKAAV
     YMALRFPTFV AIFGDIKSVM FCYLVLGYFT CCFYGILYWF NRFFKVSVGV YDYTVSAAEF
     KYMVANGLRA PTGTLDSLLL SAKLIVIGGE RNIKLSSVQS KLTDIKCSNV VLLGCLSSMN
     VSANSTEWAY CVDLHNKINL CNDPEKAQEM LLALLAFFLS KNSAFGLDDL LESYFNDNSM
     LQSVASTYVG LPSYVIYENA RQQYEDAVNN GSPPQLVKQL RHAMNVAKSE FDREASTQRK
     LDRMAEQAAA QMYKEARAVN RKSKVVSAMH SLLFGMLRRL DMSSVDTILN LAKDGVVPLS
     VIPAVSATKL NIVTSDIDSY NRIQREGCVH YAGTIWNIID IKDNDGKVVH VKEVTAQNAE
     SLSWPLVLGC ERIVKLQNNE IIPGKLKQRS IKAEGDGIVG EGKALYNNEG GRTFMYAFIS
     DKPDLRVVKW EFDGGCNTIE LEPPRKFLVD SPNGAQIKYL YFVRNLNTLR RGAVLGYIGA
     TVRLQAGKQT EQAINSSLLT LCAFAVDPAK TYIDAVKSGH KPVGNCVKML ANGSGNGQAV
     TNGVEASTNQ DSYGGASVCL YCRAHVEHPS MDGFCRLKGK YVQVPLGTVD PIRFVLENDV
     CKVCGCWLAN GCTCDRSIMQ STDMAYLNEY GALVQLD
//