ID   A0A859GSJ7_SARS2        Unreviewed;      7096 AA.
AC   A0A859GSJ7;
DT   29-SEP-2021, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 1.
DT   29-SEP-2021, entry version 1.
DE   SubName: Full=ORF1ab polyprotein {ECO:0000313|EMBL:QKV25199.1};
GN   Name=ORF1ab {ECO:0000313|EMBL:QKV25199.1};
OS   Severe acute respiratory syndrome coronavirus 2 (2019-nCoV) (SARS-CoV-2).
OC   Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC   Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
OC   Betacoronavirus; Sarbecovirus.
OX   NCBI_TaxID=2697049 {ECO:0000313|EMBL:QKV25199.1};
OH   NCBI_TaxID=9606; Homo sapiens (Human).
RN   [1] {ECO:0000313|EMBL:QKV25199.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=SARS-CoV-2/human/USA/CT-Yale-014/2020
RC   {ECO:0000313|EMBL:QKV25199.1};
RX   PubMed=32511630;
RA   Fauver J.R., Petrone M.E., Hodcroft E.B., Shioda K., Ehrlich H.Y.,
RA   Watts A.G., Vogels C.B.F., Brito A.F., Alpert T., Muyombwe A., Razeq J.,
RA   Downing R., Cheemarla N.R., Wyllie A.L., Kalinich C.C., Ott I., Quick J.,
RA   Loman N.J., Neugebauer K.M., Greninger A.L., Jerome K.R., Roychoundhury P.,
RA   Xie H., Shrestha L., Huang M.L., Pitzer V.E., Iwasaki A., Omer S.B.,
RA   Khan K., Bogoch I., Martinello R.A., Foxman E.F., Landry M.L., Neher R.A.,
RA   Ko A.I., Grubaugh N.D.;
RT   "Coast-to-coast spread of SARS-CoV-2 in the United States revealed by
RT   genomic epidemiology.";
RL   medRxiv 0:0-0(2020).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; MT635212; QKV25199.1; -; Genomic_RNA.
PE   4: Predicted;
SQ   SEQUENCE   7096 AA;  794070 MW;  25A7FB5E451164C5 CRC64;
     MESLVPGFNE KTHVQLSLPV LQVRDVLVRG FGDSVEEVLS EARQHLKDGT CGLVEVEKGV
     LPQLEQPYVF IKRSDARTAP HGHVMVELVA ELEGIQYGRS GETLGVLVPH VGEIPVAYRK
     VLLRKNGNKG AGGHSYGADL KSFDLGDELG TDPYEDFQEN WNTKHSSGVT RELMRELNGG
     AYTRYVDNNF CGPDGYPLEC IKDLLARAGK ASCTLSEQLD FIDTKRGVYC CREHEHEIAW
     YTERSEKSYE LQTPFEIKLA KKFDIFNGEC PNFVFPLNSI IKTIQPRVEK KKLDGFMGRI
     RSVYPVASPN ECNQMCLSTL MKCDHCGETS WQTGDFVKAT CEFCGTENLT KEGATTCGYL
     PQNAVVKIYC PACHNSEVGP EHSLAEYHNE SGLKTILRKG GRTIAFGGCV FSYVGCHNKC
     AYWVPRASAN IGCNHTGVVG EGSEGLNDNL LEILQKEKVN INIVGDFKLN EEIAIILASF
     SASTSAFVET VKGLDYKAFK QIVESCGNFK VTKGKAKKGA WNIGEQKSIL SPLYAFASEA
     ARVVRSIFSR TLETAQNSVR VLQKAAITIL DGISQYSLRL IDAMMFTSDL ATNNLVVMAY
     ITGGVVQLTS QWLTNIFGTV YEKLKPVLDW LEEKFKEGVE FLRDGWEIVK FISTCACEIV
     GGQIVTCAKE IKESVQTFFK LVNKFLALCA DSIIIGGAKL KALNLGETFV THSKGLYRKC
     VKSREETGLL MPLKAPKEII FLEGETLPTE VLTEEVVLKT GDLQPLEQPT SEAVEAPLVG
     TPVCINGLML LEIKDTEKYC ALAPNMMVTN NTFTLKGGAP TKVTFGDDTV IEVQGYKSVN
     ITFELDERID KVLNEKCSAY TVELGTEVNE FACVVADAVI KTLQPVSELL TPLGIDLDEW
     SMATYYLFDE SGEFKLASHM YCSFYPPDED EEEGDCEEEE FEPSTQYEYG TEDDYQGKPL
     EFGATSAALQ PEEEQEEDWL DDDSQQTVGQ QDGSEDNQTT TIQTIVEVQP QLEMELTPVV
     QTIEVNSFSG YLKLTDNVYI KNADIVEEAK KVKPTVVVNA ANVYLKHGGG VAGALNKATN
     NAMQVESDDY IATNGPLKVG GSCVLSGHNL AKHCLHVVGP NVNKGEDIQL LKSAYENFNQ
     HEVLLAPLLS AGIFGADPIH SLRVCVDTVR TNVYLAVFDK NLYDKLVSSF LEMKSEKQVE
     QKIAEIPKEE VKPFITESKP SVEQRKQDDK KIKACVEEVT TTLEETKFLT ENLLLYIDIN
     GNLHPDSATL VSDIDITFLK KDAPYIVGDV VQEGVLTAVV IPTKKAGGTT EMLAKALRKV
     PTDNYITTYP GQGLNGYTVE EAKTVLKKCK SAFYILPSII SNEKQEILGT VSWNLREMLA
     HAEETRKLMP VCVETKAIVS TIQRKYKGIK IQEGVVDYGA RFYFYTSKTT VASLINTLND
     LNETLVTMPL GYVTHGLNLE EAARYMRSLK VPATVSVSSP DAVTAYNGYL TSSSKTPEEH
     FIETISLAGS YKDWSYSGQS TQLGIEFLKR GDKSVYYTSN PTTFHLDGEV ITFDNLKTLL
     SLREVRTIKV FTTVDNINLH TQVVDMSMTY GQQFGPTYLD GADVTKIKPH NSHEGKTFYV
     LPNDDTLRVE AFEYYHTTDP SFLGRYMSAL NHTKKWKYPQ VNGLTSIKWA DNNCYLATAL
     LTLQQIELKF NPPALQDAYY RARAGEAANF CALILAYCNK TVGELGDVRE TMSYLFQHAN
     LDSCKRVLNV VCKTCGQQQT TLKGVEAVMY MGTLSYEQFK KGVQIPCTCG KQATKYLVQQ
     ESPFVMMSAP PAQYELKHGT FTCASEYTGN YQCGHYKHIT SKETLYCIDG ALLTKSSEYK
     GPITDVFYKE NSYTTTIKPV TYKLDGVVCT EIDPKLDNYY KKDNSYFTEQ PIDLVPNQPY
     PNASFDNFKF VCDNIKFADD LNQLTGYKKP ASRELKVTFF PDLNGDVVAI DYKHYTPSFK
     KGAKLLHKPI VWHVNNATNK ATYKPNTWCI RCLWSTKPVE TSNSFDVLKS EDAQGMENLA
     CEDLKPVSEE VVENPTIQKD VLECNVKTTE VVGDIILKPA NNSLKITEEV GHTDLMAAYV
     DNSSLTIKKP NELSRVLGLK TLATHGLAAV NSVPWDTIAN YAKPFLNKVV STTTNIVTRC
     LNRVCTNYMP YFFTLLLQLC TFTRSTNSRI KASMPTTIAK NTVKSVGKFC LEASFNYLKS
     PNFSKLINII IWFLLLSVCL GSLIYSTAAL GVLMSNLGMP SYCTGYREGY LNSTNVTIAT
     YCTGSIPCSV CLSGLDSLDT YPSLETIQIT ISSFKWDLTA FGLVAEWFLA YILFTRFFYV
     LGLAAIMQLF FSYFAVHFIS NSWLMWLIIN LVQMAPISAM VRMYIFFASF YYVWKSYVHV
     VDGCNSSTCM MCYKRNRATR VECTTIVNGV RRSFYVYANG GKGFCKLHNW NCVNCDTFCA
     GSTFISDEVA RDLSLQFKRP INPTDQSSYI VDSVTVKNGS IHLYFDKAGQ KTYERHSLSH
     FVNLDNLRAN NTKGSLPINV IVFDGKSKCE ESSAKSASVY YSQLMCQPIL LLDQALVSDV
     GDSAEVAVKM FDAYVNTFSS TFNVPMEKLK TLVATAEAEL AKNVSLDNVL STFISAARQG
     FVDSDVETKD VVECLKLSHQ SDIEVTGDSC NNYMLTYNKV ENMTPRDLGA CIDCSARHIN
     AQVAKSHNIA LIWNVKDFMS LSEQLRKQIR SAAKKNNLPF KLTCATTRQV VNVVTTKIAL
     KGGKIVNNWL KQLIKVTLVF LFVAAIFYLI TPVHVMSKHT DFSSEIIGYK AIDGGVTRDI
     ASTDTCFANK HADFDTWFSQ RGGSYTNDKA CPLIAAVITR EVGFVVPGLP GTILRTTNGD
     FLHFLPRVFS AVGNICYTPS KLIEYTDFAT SACVLAAECT IFKDASGKPV PYCYDTNVLE
     GSVAYESLRP DTRYVLMDGS IIQFPNTYLE GSVRVVTTFD SEYCRHGTCE RSEAGVCVST
     SGRWVLNNDY YRSLPGVFCG VDAVNLLTNM FTPLIQPIGA LDISASIVAG GIVAIVVTCL
     AYYFMRFRRA FGEYSHVVAF NTLLFLMSFT VLCLTPVYSF LPGVYSVIYL YLTFYLTNDV
     SFLAHIQWMV MFTPLVPFWI TIAYIICIST KHFYWFFSNY LKRRVVFNGV SFSTFEEAAL
     CTFLLNKEMY LKLRSDVLLP LTQYNRYLAL YNKYKYFSGA MDTTSYREAA CCHLAKALND
     FSNSGSDVLY QPPQTSITSA VLQSGFRKMA FPSGKVEGCM VQVTCGTTTL NGLWLDDVVY
     CPRHVICTSE DMLNPNYEDL LIRKSNHNFL VQAGNVQLRV IGHSMQNCVL KLKVDTANPK
     TPKYKFVRIQ PGQTFSVLAC YNGSPSGVYQ CAMRPNFTIK GSFLNGSCGS VGFNIDYDCV
     SFCYMHHMEL PTGVHAGTDL EGNFYGPFVD RQTAQAAGTD TTITVNVLAW LYAAVINGDR
     WFLNRFTTTL NDFNLVAMKY NYEPLTQDHV DILGPLSAQT GIAVLDMCAS LKELLQNGMN
     GRTILGSALL EDEFTPFDVV RQCSGVTFQS AVKRTIKGTH HWLLLTILTS LLVLVQSTQW
     SLFFFLYENA FLPFAMGIIA MSAFAMMFVK HKHAFLCLFL LPSLATVAYF NMVYMPASWV
     MRIMTWLDMV DTSLSGFKLK DCVMYASAVV LLILMTARTV YDDGARRVWT LMNVLTLVYK
     VYYGNALDQA ISMWALIISV TSNYSGVVTT VMFLARGIVF MCVEYCPIFF ITGNTLQCIM
     LVYCFLGYFC TCYFGLFCLL NRYFRLTLGV YDYLVSTQEF RYMNSQGLLP PKNSIDAFKL
     NIKLLGVGGK PCIKVATVQS KMSDVKCTSV VLLSVLQQLR VESSSKLWAQ CVQLHNDILL
     AKDTTEAFEK MVSLLSVLLS MQGAVDINKL CEEMLDNRAT LQAIASEFSS LPSYAAFATA
     QEAYEQAVAN GDSEVVLKKL KKSLNVAKSE FDRDAAMQRK LEKMADQAMT QMYKQARSED
     KRAKVTSAMQ TMLFTMLRKL DNDALNNIIN NARDGCVPLN IIPLTTAAKL MVVIPDYNTY
     KNTCDGTTFT YASALWEIQQ VVDADSKIVQ LSEISMDNSP NLAWPLIVTA LRANSAVKLQ
     NNELSPVALR QMSCAAGTTQ TACTDDNALA YYNTTKGGRF VLALLSDLQD LKWARFPKSD
     GTGTIYTELE PPCRFVTDTP KGPKVKYLYF IKGLNNLNRG MVLGSLAATV RLQAGNATEV
     PANSTVLSFC AFAVDAAKAY KDYLASGGQP ITNCVKMLCT HTGTGQAITV TPEANMDQES
     FGGASCCLYC RCHIDHPNPK GFCDLKGKYV QIPTTCANDP VGFTLKNTVC TVCGMWKGYG
     CSCDQLREPM LQSADAQSFL NRVCGVSAAR LTPCGTGTST DVVYRAFDIY NDKVAGFAKF
     LKTNCCRFQE KDEDDNLIDS YFVVKRHTFS NYQHEETIYN LLKDCPAVAK HDFFKFRIDG
     DMVPHISRQR LTKYTMADLV YALRHFDEGN CDTLKEILVT YNCCDDDYFN KKDWYDFVEN
     PDILRVYANL GERVRQALLK TVQFCDAMRN AGIVGVLTLD NQDLNGNWYD FGDFIQTTPG
     SGVPVVDSYY SLLMPILTLT RALTAESHVD TDLTKPYIKW DLLKYDFTEE RLKLFDRYFK
     YWDQTYHPNC VNCLDDRCIL HCANFNVLFS TVFPLTSFGP LVRKIFVDGV PFVVSTGYHF
     RELGVVHNQD VNLHSSRLSF KELLVYAADP AMHAASGNLL LDKRTTCFSV AALTNNVAFQ
     TVKPGNFNKD FYDFAVSKGF FKEGSSVELK HFFFAQDGNA AISDYDYYRY NLPTMCDIRQ
     LLFVVEVVDK YFDCYDGGCI NANQVIVNNL DKSAGFPFNK WGKARLYYDS MSYEDQDALF
     AYTKRNVIPT ITQMNLKYAI SAKNRARTVA GVSICSTMTN RQFHQKLLKS IAATRGATVV
     IGTSKFYGGW HNMLKTVYSD VENPHLMGWD YPKCDRAMPN MLRIMASLVL ARKHTTCCSL
     SHRFYRLANE CAQVLSEMVM CGGSLYVKPG GTSSGDATTA YANSVFNICQ AVTANVNALL
     STDGNKIADK YVRNLQHRLY ECLYRNRDVD TDFVNEFYAY LRKHFSMMIL SDDAVVCFNS
     TYASQGLVAS IKNFKSVLYY QNNVFMSEAK CWTETDLTKG PHEFCSQHTM LVKQGDDYVY
     LPYPDPSRIL GAGCFVDDIV KTDGTLMIER FVSLAIDAYP LTKHPNQEYA DVFHLYLQYI
     RKLHDELTGH MLDMYSVMLT NDNTSRYWEP EFYEAMYTPH TVLQAVGACV LCNSQTSLRC
     GACIRRPFLC CKCCYDHVIS TSHKLVLSVN PYVCNAPGCD VTDVTQLYLG GMSYYCKSHK
     PPISFPLCAN GQVFGLYKNT CVGSDNVTDF NAIATCDWTN AGDYILANTC TERLKLFAAE
     TLKATEETFK LSYGIATVRE VLSDRELHLS WEVGKPRPPL NRNYVFTGYR VTKNSKVQIG
     EYTFEKGDYG DAVVYRGTTT YKLNVGDYFV LTSHTVMPLS APTLVPQEHY VRITGLYPTL
     NISDEFSSNV ANYQKVGMQK YSTLQGPPGT GKSHFAIGLA LYYPSARIVY TACSHAAVDA
     LCEKALKYLP IDKCSRIIPA RARVECFDKF KVNSTLEQYV FCTVNALPET TADIVVFDEI
     SMATNYDLSV VNARLRAKHY VYIGDPAQLP APRTLLTKGT LEPEYFNSVC RLMKTIGPDM
     FLGTCRRCPA EIVDTVSALV YDNKLKAHKD KSAQCFKMFY KGVITHDVSS AINRPQIGVV
     REFLTRNPAW RKAVFISPYN SQNAVASKIL GLPTQTVDSS QGSEYDYVIF TQTTETAHSC
     NVNRFNVAIT RAKVGILCIM SDRDLYDKLQ FTSLEIPRRN VATLQAENVT GLFKDCSKVI
     TGLHPTQAPT HLSVDTKFKT EGLCVDIPGI PKDMTYRRLI SMMGFKMNYQ VNGYPNMFIT
     REEAIRHVRA WIGFDVEGCH ATREAVGTNL PLQLGFSTGV NLVAVPTGYV DTPNNTDFSR
     VSAKPPPGDQ FKHLIPLMYK GLPWNVVRIK IVQMLSDTLK NLSDRVVFVL WAHGFELTSM
     KYFVKIGPER TCCLCDRRAT CFSTASDTYA CWHHSIGFDY VYNPFMIDVQ QWGFTGNLQS
     NHDLYCQVHG NAHVASCDAI MTRCLAVHEC FVKRVDWTIE YPIIGDELKI NAACRKVQHM
     VVKAALLADK FPVLHDIGNP KAIKCVPQAD VEWKFYDAQP CSDKAYKIEE LFYSYATHSD
     KFTDGVCLFW NCNVDRYPAN SIVCRFDTRV LSNLNLPXXX XXXXXXXXXX XXXXXXXXXX
     XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
     XXXXXXXXXX XXXXXXKQFD TYNLWNTFTR LQSLENVAFN VVNKGHFDGQ QGEVPVSIIN
     NTVYTKVDGV DVELFENKTT LPVNVAFELW AKRNIKPVPE VKILNNLGVD IAANTVIWDY
     KRDAPAHIST IGVCSMTDIA KKPTETICAP LTVFFDGRVD GQVDLFRNAR NGVLITEGSV
     KGLQPSVGPK QASLNGVTLI GEAVKTQFNY YKKVDGVVQQ LPETYFTQSR NLQEFKPRSQ
     MEIDFLELAM DEFIERYKLE GYAFEHIVYG DFSHSQLGGL HLLIGLAKRF KESPFELEDF
     IPMDSTVKNY FITDAQTGSS KCVCSVIDLL LDDFVEIIKS QDLSVVSKVV KVTIDYTEIS
     FMLWCKDGHV ETFYPKLQSS QAWQPGVAMP NLYKMQRMLL EKCDLQNYGD SATLPKGIMM
     NVAKYTQLCQ YLNTLTLAVP YNMRVIHFGA GSDKGVAPGT AVLRQWLPTG TLLVDSDLND
     FVSDADSTLI GDCATVHTAN KWDLIISDMY DPKTKNVTKE NDSKEGFFTY ICGFIQQKLA
     LXGSVAIKIT EHSWNADLYK LMGHFAWWTA FVTNVNASSS EAFLIGCNYL GKPREQIDGY
     VMHANYIFWR NTNPIQLSSY SLFDMSKFPL KLRGTAVMSL KEGQINDMIL SLLSKGRLII
     RENNRVVISS DVLVNN
//