ID   W8ZQY9_ECOLX            Unreviewed;       641 AA.
AC   W8ZQY9;
DT   14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT   14-MAY-2014, sequence version 1.
DT   11-JUN-2014, entry version 2.
DE   SubName: Full=Head-to-tail joining protein W (GpW) from bacteriophage origin;
GN   ORFNames=EC958_1365;
OS   Escherichia coli O25b:H4-ST131 str. EC958.
OC   Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales;
OC   Enterobacteriaceae; Escherichia.
OX   NCBI_TaxID=941322;
RN   [1]
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=EC958;
RX   PubMed=22053197; DOI=10.1371/journal.pone.0026578;
RA   Totsika M., Beatson S.A., Sarkar S., Phan M.D., Petty N.K.,
RA   Bachmann N., Szubert M., Sidjabat H.E., Paterson D.L., Upton M.,
RA   Schembri M.A.;
RT   "Insights into a Multidrug Resistant Escherichia coli Pathogen of the
RT   Globally Disseminated ST131 Lineage: Genome Analysis and Virulence
RT   Mechanisms.";
RL   PLoS ONE 6:e26578-e26578(2011).
RN   [2]
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=EC958;
RA   Beatson S.;
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; HG941718; CDN81628.1; -; Genomic_DNA.
DR   InterPro; IPR008866; Phage_lambda_GpA.
DR   Pfam; PF05876; Terminase_GpA; 1.
PE   4: Predicted;
SQ   SEQUENCE   641 AA;  73237 MW;  A40299BBDCAC078A CRC64;
     MNISNSQVNR LRHFVRAGLR SLFRPEPQTA VEWADANYYL PKESAYQEGR WETLPFQRAI
     MNAMGSDYIR EVNVVKSARV GYSKMLLGVY AYFIEHKQRN TLIWLPTDGD AENFMKTHVE
     PTIRDIPSLL ALAPWYGKKH RDNTLTMKRF SNGRGFWCLG GKAAKNYREK SVDVAGYDEL
     AAFDEDIEQE GSPTFLGDKR IEGSVWPKSI RGSTPKVRGT CQIERAASES PHCMRFHVAC
     PHCGEEQYLK FGDKETPFGL KWTPDDPSSV FYLCEHNACV IRQQELDFTD ARYICEKTGI
     WTRDGILWFS SSGEEIEPPD SVTFHIWTAY SPFTTWVQIV KDWMKTKGDT GKRKTFVNTT
     LGETWEAKIG ERPDAEVMAE RKEHYSAPVP DRVAYLTAGI DSQLDRYEMR VWGWGPGEES
     WLIDRQIIMG RHDDEQTLLH VDEAINKTYT RRNGAEMSVS RICWDIGGID PTIVYERSKK
     HGLFRVIPIK GASVYGKPVA SMPRKRNKNG VYLTEIGTDT AKEQIYNRFT LTPEGDEPLP
     GAVHFPNNPD IFDLTEAQQL TAEEQVEKWV DGRKKILWDS KKRRNEALDC FVYALAALRI
     SISRWQLDLS ALLASLQEED GAATNKKTLA DYARALSGED E
//