ID Q6JH47_CVHSA Unreviewed; 4382 AA. AC Q6JH47; DT 05-JUL-2004, integrated into UniProtKB/TrEMBL. DT 05-JUL-2004, sequence version 1. DT 14-MAY-2014, entry version 74. DE SubName: Full=Orf1a polyprotein; OS SARS coronavirus Sino1-11. OC Viruses; ssRNA positive-strand viruses, no DNA stage; Nidovirales; OC Coronaviridae; Coronavirinae; Betacoronavirus. OX NCBI_TaxID=255730; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Sino1-11; RA Jin W.W., Feng J.D., Du Z.L., Hu L.X., Liu Y.X., Liu Y.F., Zhang X.M., RA Gao H., Ning Y., Zhang J.S., Li N., Yin W.D.; RT "Variance analysis of nucleic acid sequence of candidate strain Sino1 RT for identification of SARS inactivated vaccine."; RL Submitted (NOV-2003) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Sino1-11; RX PubMed=16269206; DOI=10.1016/j.vaccine.2004.12.032; RA Zhang J., Liu Y., Hu L., Gao Q., Zhang Z., Zhang X., Chen J., Gong X., RA Song L., Liu Y., Li J., Li S., Huang J., Ning Y., Gao H., Qin C., RA Dong X., Wei J., Dong G., Yin W.; RT "Preparation and characterization of SARS in-house reference RT antiserum."; RL Vaccine 23:5666-5669(2005). RN [3] RP STRUCTURE BY NMR OF 3427-3546. RX PubMed=19319935; DOI=10.1002/pro.76; RA Zhong N., Zhang S., Xue F., Kang X., Zou P., Chen J., Liang C., RA Rao Z., Jin C., Lou Z., Xia B.; RT "C-terminal domain of SARS-CoV main protease can form a 3D domain- RT swapped dimer."; RL Protein Sci. 18:839-844(2009). CC -!- FUNCTION: Nsp7-nsp8 hexadecamer may possibly confer processivity CC to the polymerase, maybe by binding to dsRNA or by producing CC primers utilized by the latter (By similarity). CC -!- FUNCTION: Nsp9 is a ssRNA-binding protein (By similarity). CC -!- SUBCELLULAR LOCATION: Host cytoplasm, host perinuclear region (By CC similarity). CC -!- SUBCELLULAR LOCATION: Host membrane; Multi-pass membrane protein CC (By similarity). CC -!- SIMILARITY: Contains Macro domain. CC -!- SIMILARITY: Contains peptidase C30 domain. CC -!- SIMILARITY: Contains peptidase C3domain. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AY485277; AAR23244.1; -; Genomic_RNA. DR PDB; 2K7X; NMR; -; A=3427-3546. DR PDBsum; 2K7X; -. DR ProteinModelPortal; Q6JH47; -. DR SMR; Q6JH47; 13-127, 819-930, 1002-1176, 1331-1469, 1541-1854, 3241-3546, 3837-3910, 3921-4111, 4118-4230, 4240-4362. DR EvolutionaryTrace; Q6JH47; -. DR GO; GO:0033644; C:host cell membrane; IEA:UniProtKB-SubCell. DR GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro. DR GO; GO:0016817; F:hydrolase activity, acting on acid anhydrides; IEA:InterPro. DR GO; GO:0016788; F:hydrolase activity, acting on ester bonds; IEA:InterPro. DR GO; GO:0008242; F:omega peptidase activity; IEA:InterPro. DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW. DR GO; GO:0003968; F:RNA-directed RNA polymerase activity; IEA:InterPro. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW. DR GO; GO:0019079; P:viral genome replication; IEA:InterPro. DR GO; GO:0019082; P:viral protein processing; IEA:InterPro. DR InterPro; IPR002589; Macro_dom. DR InterPro; IPR021590; NSP1. DR InterPro; IPR024375; Nsp3_coronavir. DR InterPro; IPR014828; NSP7. DR InterPro; IPR014829; NSP8. DR InterPro; IPR014822; NSP9. DR InterPro; IPR008740; Peptidase_C30. DR InterPro; IPR013016; Peptidase_C30/C16. DR InterPro; IPR018995; RNA_synth_NSP10_coronavirus. DR InterPro; IPR024358; SARS-CoV_Nsp3_N. DR InterPro; IPR022733; SARS_polyprot_cleavage. DR InterPro; IPR009003; Trypsin-like_Pept_dom. DR InterPro; IPR014827; Viral_protease. DR Pfam; PF12379; DUF3655; 1. DR Pfam; PF01661; Macro; 1. DR Pfam; PF11501; Nsp1; 1. DR Pfam; PF09401; NSP10; 1. DR Pfam; PF12124; Nsp3_PL2pro; 1. DR Pfam; PF08716; nsp7; 1. DR Pfam; PF08717; nsp8; 1. DR Pfam; PF08710; nsp9; 1. DR Pfam; PF05409; Peptidase_C30; 1. DR Pfam; PF11633; SUD-M; 1. DR Pfam; PF08715; Viral_protease; 1. DR SUPFAM; SSF101816; SSF101816; 1. DR SUPFAM; SSF144246; SSF144246; 1. DR SUPFAM; SSF50494; SSF50494; 1. DR PROSITE; PS51442; M_PRO; 1. DR PROSITE; PS51154; MACRO; 1. DR PROSITE; PS51124; PEPTIDASE_C16; 1. PE 1: Evidence at protein level; KW 3D-structure; Activation of host autophagy by virus; Host cytoplasm; KW Host membrane; Host-virus interaction; Hydrolase; Membrane; KW Metal-binding; Protease; Repeat; Ribosomal frameshifting; RNA-binding; KW Thiol protease; Transmembrane; Transmembrane helix; Zinc; Zinc-finger. SQ SEQUENCE 4382 AA; 486357 MW; F8E9F8EDD0684B52 CRC64; MESLVLGVNE KTHVQLSLPV LQVRDVLVRG FGDSVEEALS EAREHLKNGT CGLVELEKGV LPQLEQPYVF IKRSDALSTN HGHKVVELVA EMDGIQYGRS GITLGVLVPH VGETPIAYRN VLLRKNGNKG AGGHSYGIDL KSYDLGDELG TDPIEDYEQN WNTKHGSGAL RELTRELNGG AVTRYVDNNF CGPDGYPLDC IKDFLARAGK SMCTLSEQLD YIESKRGVYC CRDHEHEIAW FTERSDKSYE HQTPFEIKSA KKFDTFKGEC PKFVFPLNSK VKVIQPRVEK KKTEGFMGRI RSVYPVASPQ ECNNMHLSTL MKCNHCDEVS WQTCDFLKAT CEHCGTENLV IEGPTTCGYL PTNAVVKMPC PACQDPEIGP EHSVADYHNH SNIETRLRKG GRTRCFGGCV FAYVGCYNKR AYWVPRASAD IGSGHTGITG DNVETLNEDL LEILSRERVN INIVGDFHLN EEVAIILASF SASTSAFIDT IKSLDYKSFK TIVESCGNYK VTKGKPVKGA WNIGQQRSVL TPLCGFPSQA AGVIRSIFAR TLDAANHSIP DLQRAAVTIL DGISEQSLRL VDAMVYTSDL LTNSVIIMAY VTGGLVQQTS QWLSNLLGTT VEKLRPIFEW IEAKLSAGVE FLKDAWEILK FLITGVFDIV KGQIQVASDN IKDCVKCFID VVNKALEMCI DQVTIAGAKL RSLNLGEVFI AQSKGLYRQC IRGKEQLQLL MPLKAPKEVT FLEGDSHDTV LTSEEVVLKN GELEALETPV DSFTNGAIVG TPVCVNGLML LEIKDKEQYC ALSPGLLATN NVFRLKGGAP IKGVTFGEDT VWEVQGYKNV RITFELDERV DKVLNEKCSV YTVESGTEVT EFACVVAEAV VKTLQPVSDL LTNMGIDLDE WSVATFYLFD DAGEENFSSR MYCSFYPPDE EEEDDAECEE EEIDETCEHE YGTEDDYQGL PLEFGASAET VRVEEEEEED WLDDTTEQSE IEPEPEPTPE EPVNQFTGYL KLTDNVAIKC VDIVKEAQSA NPMVIVNAAN IHLKHGGGVA GALNKATNGA MQKESDDYIK LNGPLTVGGS CLLSGHNLAK KCLHVVGPNL NAGEDIQLLK AAYENFNSQD ILLAPLLSAG IFGAKPLQSL QVCVQTVRTQ VYIAVNDKAL YEQVVMDYLD NLKPRVEAPK QEEPPNTEDS KTEEKSVVQK PVDVKPKIKA CIDEVTTTLE ETKFLTNKLL LFADINGKLY HDSQNMLRGE DMSFLEKDAP YMVGDVITSG DITCVVIPSK KAGGTTEMLS RALKKVPVDE YITTYPGQGC AGYTLEEAKT ALKKCKSAFY VLPSEAPNAK EEILGTVSWN LREMLAHAEE TRKLMPICMD VRAIMATIQR KYKGIKIQEG IVDYGVRFFF YTSKEPVASI ITKLNSLNEP LVTMPIGYVT HGFNLEEAAR CMRSLKAPAV VSVSSPDAVT TYNGYLTSSS KTSEEHFVET VSLAGSYRDW SYSGQRTELG VEFLKRGDKI VYHTLESPVE FHLDGEVLSL DKLKSLLSLR EVKTIKVFTT VDNTNLHTQL VDMSMTYGQQ FGPTYLDGAD VTKIKPHVNH EGKTFFVLPS DDTLRSEAFE YYHTLDESFL GRYMSALNHT KKWKFPQVGG LTSIKWADNN CYLSSVLLAL QQLEVKFNAP ALQEAYYRAR AGDAANFCAL ILAYSNKTVG ELGDVRETMT HLLQHANLES AKRVLNVVCK HCGQKTTTLT GVEAVMYMGT LSYDNLKTGV SIPCVCGRDA TQYLVQQESS FVMMSAPPAE YKLQQGTFLC ANEYTGNYQC GHYTHITAKE TLYRIDGAHL TKMSEYKGPV TDVFYKETSY TTTIKPVSYK LDGVTYTEIE PKLDGYYKKD NAYYTEQPID LVPTQPLPNA SFDNFKLTCS NTKFADDLNQ MTGFTKPASR ELSVTFFPDL NGDVVAIDYR HYSASFKKGA KLLHKPIVWH INQATTKTTF KPNTWCLRCL WSTKPVDTSN SFEVLAVEDT QGMDNLACES QQPTSEEVVE NPTIQKEVIE CDVKTTEVVG NVILKPSDEG VKVTQELGHE DLMAAYVENT SITIKKPNEL SLALGLKTIA THGIAAINSV PWSKILAYVK PFLGQAAITT SNCAKRLAQR VFNNYMPYVF TLLFQLCTFT KSTNSRIRAS LPTTIAKNSV KSVAKLCLDA GINYVKSPKF SKLFTIAMWL LLLSICLGSL ICVTAAFGVL LSNFGAPSYC NGVRELYLNS SNVTTMDFCE GSFPCSICLS GLDSLDSYPA LETIQVTISS YKLDLTILGL AAEWVLAYML FTKFFYLLGL SAIMQVFFGY FASHFISNSW LMWFIISIVQ MAPVSAMVRM YIFFASFYYI WKSYVHIMDG CTSSTCMMCY KRNRATRVEC TTIVNGMKRS FYVYANGGRG FCKTHNWNCL NCDTFCTGST FISDEVARDL SLQFKRPINP TDQSSYIVDS VAVKNGALHL YFDKAGQKTY ERHPLSHFVN LDNLRANNTK GSLPINVIVF DGKSKCDESA SKSASVYYSQ LMCQPILLLD QALVSDVGDS TEVSVKMFDA YVDTFSATFS VPMEKLKALV ATAHSELAKG VALDGVLSTF VSAARQGVVD TDVDTKDVIE CLKLSHHSDL EVTGDSCNNF MLTYNKVENM TPRDLGACID CNARHINAQV AKSHNVSLIW NVKDYMSLSE QLRKQIRSAA KKNNIPFRLT CATTRQVVNV ITTKISLKGG KIVSTCFKLM LKATLLCVLA ALVCYIVMPV HTLSIHDGYT NEIIGYKAIQ DGVTRDIIST DDCFANKHAG FDAWFSQRGG SYKNDKSCPV VAAIITREIG FIVPGLPGTV LRAINGDFLH FLPRVFSAVG NICYTPSKLI EYSDFATSAC VLAAECTIFK DAMGKPVPYC YDTNLLEGSI SYSELRPDTR YVLMDGSIIQ FPNTYLEGSV RVVTTFDAEY CRHGTCERSE VGICLSTSGR WVLNNEHYRA LSGVFCGVDA MNLIANIFTP LVQPVGALDV SASVVAGGII AILVTCAAYY FMKFRRVFGE YNHVVAANAL LFLMSFTILC LVPAYSFLPG VYSVFYLYLT FYFTNDVSFL AHLQWFAMFS PIVPFWITAI YVFCISLKHC HWFFNNYLRK RVMFNGVTFS TFEEAALCTF LLNKEMYLKL RSETLLPLTQ YNRYLALYNK YKYFSGALDT TSYREAACCH LAKALNDFSN SGADVLYQPP QTSITSAVLQ SGFRKMAFPS GKVEGCMVQV TCGTTTLNGL WLDDTVYCPR HVICTAEDML NPNYEDLLIR KANHSFLVQA GNVQLRVIGH SMQNCLLRLK VDTSNPKTPK YKFVRIQPGQ TFSVLACYNG SPSGVYQCAM RPNHTIKGSF LNGSCGSVGF NIDYDCVSFC YMHHMELPTG VHAGTDLEGK FYGPFVDRQT AQAAGTDTTI TLNVLAWLYA AVINGDRWFL NRFTTTLNDF NLVAMKYNYE PLTQDHVDIL GPLSAQTGIA VLDMCAALKE LLQNGMNGRT ILGSTILEDE FTPFDVVRQC SGVTFQGKFK KIVKGTHHWM LLTFLTSLLI LVQSTQWSLF FFVYENAFLP FTLGIMAIAA CAMLLVKHKH AFLCLFLLPS LATVAYFNMV YMPASWVMRI MTWLELADTS LSGYRLKDCV MYASALVLLI LMTARTVYDD AARRVWTLMN VITLVYKVYY GNALDQAISM WALVISVTSN YSGVVTTIMF LARAIVFVCV EYYPLLFITG NTLQCIMLVY CFLGYCCCCY FGLFCLLNRY FRLTLGVYDY LVSTQEFRYM NSQGLLPPKS SIDAFKLNIK LLGIGGKPCI KVATVQSKMS DVKCTSVVLL SVLQQLRVES SSKLWAQCVQ LHNDILLAKD TTEAFEKMVS LLSVLLSMQG AVDINRLCEE MLDNRATLQA IASEFSSLPS YAAYATAQEA YEQAVANGDS EVVLKKLKKS LNVAKSEFDR DAAMQRKLEK MADQAMTQMY KQARSEDKRA KVTSAMQTML FTMLRKLDND ALNNIINNAR DGCVPLNIIP LTTAAKLMVV VPDYGTYKNT CDGNTFTYAS ALWEIQQVVD ADSKIVQLSE INMDNSPNLA WPLIVTALRA NSAVKLQNNE LSPVALRQMS CAAGTTQTAC TDDNALAYYN NSKGGRFVLA LLSDHQDLKW ARFPKSDGTG TIYTELEPPC RFVTDTPKGP KVKYLYFIKG LNNLNRGMVL GSLAATVRLQ AGNATEVPAN STVLSFCAFA VDPAKAYKDY LASGGQPITN CVKMLCTHTG TGQAITVTPE ANMDQESFGG ASCCLYCRCH IDHPNPKGFC DLKGKYVQIP TTCANDPVGF TLRNTVCTVC GMWKGYGCSC DQLREPLMQS ADASTFLNGF AV //