ID   W8ZQY9_ECOLX            Unreviewed;       641 AA.
AC   W8ZQY9;
DT   14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT   14-MAY-2014, sequence version 1.
DT   07-OCT-2020, entry version 24.
DE   SubName: Full=Head-to-tail joining protein W (GpW) from bacteriophage origin {ECO:0000313|EMBL:CDN81628.1};
DE   SubName: Full=Phage terminase large subunit family protein {ECO:0000313|EMBL:MTT22026.1};
GN   ORFNames=EC958_1365 {ECO:0000313|EMBL:CDN81628.1}, GJV31_23140
GN   {ECO:0000313|EMBL:MTT22026.1};
OS   Escherichia coli O25b:H4-ST131.
OC   Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
OC   Enterobacteriaceae; Escherichia.
OX   NCBI_TaxID=941322 {ECO:0000313|EMBL:CDN81628.1, ECO:0000313|Proteomes:UP000032727};
RN   [1] {ECO:0000313|EMBL:CDN81628.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=EC958 {ECO:0000313|EMBL:CDN81628.1};
RA   Beatson S.;
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:CDN81628.1, ECO:0000313|Proteomes:UP000032727}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=EC958 {ECO:0000313|EMBL:CDN81628.1,
RC   ECO:0000313|Proteomes:UP000032727};
RX   PubMed=25126841; DOI=10.1371/journal.pone.0104400;
RA   Forde B.M., Ben Zakour N.L., Stanton-Cook M., Phan M.D., Totsika M.,
RA   Peters K.M., Chan K.G., Schembri M.A., Upton M., Beatson S.A.;
RT   "The complete genome sequence of Escherichia coli EC958: a high quality
RT   reference sequence for the globally disseminated multidrug resistant E.
RT   coli O25b:H4-ST131 clone.";
RL   PLoS ONE 9:e104400-e104400(2014).
RN   [3] {ECO:0000313|EMBL:MTT22026.1, ECO:0000313|Proteomes:UP000429998}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=TYBEAR H45 {ECO:0000313|EMBL:MTT22026.1,
RC   ECO:0000313|Proteomes:UP000429998};
RA   Yang Y., Sommers C., Adenipekun E.O., Jackson C.R., Woodley T.A.,
RA   Barrett J.B., Hiott L.M., Frye J.G., Liu Y.;
RT   "Draft genomic sequence of three E. coli ST131 isolated from patients in
RT   Lagos, Nigeria.";
RL   Submitted (NOV-2019) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; HG941718; CDN81628.1; -; Genomic_DNA.
DR   EMBL; WMHN01000037; MTT22026.1; -; Genomic_DNA.
DR   RefSeq; WP_001027261.1; NZ_WMHN01000037.1.
DR   EnsemblBacteria; CDN81628; CDN81628; EC958_1365.
DR   KEGG; ecos:EC958_1365; -.
DR   HOGENOM; CLU_023850_3_0_6; -.
DR   KO; K21512; -.
DR   Proteomes; UP000032727; Chromosome.
DR   Proteomes; UP000429998; Unassembled WGS sequence.
DR   InterPro; IPR008866; Phage_lambda_GpA.
DR   Pfam; PF05876; Terminase_GpA; 1.
PE   4: Predicted;
SQ   SEQUENCE   641 AA;  73237 MW;  A40299BBDCAC078A CRC64;
     MNISNSQVNR LRHFVRAGLR SLFRPEPQTA VEWADANYYL PKESAYQEGR WETLPFQRAI
     MNAMGSDYIR EVNVVKSARV GYSKMLLGVY AYFIEHKQRN TLIWLPTDGD AENFMKTHVE
     PTIRDIPSLL ALAPWYGKKH RDNTLTMKRF SNGRGFWCLG GKAAKNYREK SVDVAGYDEL
     AAFDEDIEQE GSPTFLGDKR IEGSVWPKSI RGSTPKVRGT CQIERAASES PHCMRFHVAC
     PHCGEEQYLK FGDKETPFGL KWTPDDPSSV FYLCEHNACV IRQQELDFTD ARYICEKTGI
     WTRDGILWFS SSGEEIEPPD SVTFHIWTAY SPFTTWVQIV KDWMKTKGDT GKRKTFVNTT
     LGETWEAKIG ERPDAEVMAE RKEHYSAPVP DRVAYLTAGI DSQLDRYEMR VWGWGPGEES
     WLIDRQIIMG RHDDEQTLLH VDEAINKTYT RRNGAEMSVS RICWDIGGID PTIVYERSKK
     HGLFRVIPIK GASVYGKPVA SMPRKRNKNG VYLTEIGTDT AKEQIYNRFT LTPEGDEPLP
     GAVHFPNNPD IFDLTEAQQL TAEEQVEKWV DGRKKILWDS KKRRNEALDC FVYALAALRI
     SISRWQLDLS ALLASLQEED GAATNKKTLA DYARALSGED E
//