ID W8ZQY9_ECOLX Unreviewed; 641 AA. AC W8ZQY9; DT 14-MAY-2014, integrated into UniProtKB/TrEMBL. DT 14-MAY-2014, sequence version 1. DT 01-OCT-2014, entry version 3. DE SubName: Full=Head-to-tail joining protein W (GpW) from bacteriophage origin {ECO:0000313|EMBL:CDN81628.1}; GN ORFNames=EC958_1365 {ECO:0000313|EMBL:CDN81628.1}; OS Escherichia coli O25b:H4-ST131 str. EC958. OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; OC Enterobacteriaceae; Escherichia. OX NCBI_TaxID=941322 {ECO:0000313|EMBL:CDN81628.1}; RN [1] {ECO:0000313|EMBL:CDN81628.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=EC958 {ECO:0000313|EMBL:CDN81628.1}; RX PubMed=22053197; DOI=10.1371/journal.pone.0026578; RA Totsika M., Beatson S.A., Sarkar S., Phan M.D., Petty N.K., RA Bachmann N., Szubert M., Sidjabat H.E., Paterson D.L., Upton M., RA Schembri M.A.; RT "Insights into a Multidrug Resistant Escherichia coli Pathogen of the RT Globally Disseminated ST131 Lineage: Genome Analysis and Virulence RT Mechanisms."; RL PLoS ONE 6:e26578-e26578(2011). RN [2] {ECO:0000313|EMBL:CDN81628.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=EC958 {ECO:0000313|EMBL:CDN81628.1}; RA Beatson S.; RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; HG941718; CDN81628.1; -; Genomic_DNA. DR InterPro; IPR008866; Phage_lambda_GpA. DR Pfam; PF05876; Terminase_GpA; 1. PE 4: Predicted; SQ SEQUENCE 641 AA; 73237 MW; A40299BBDCAC078A CRC64; MNISNSQVNR LRHFVRAGLR SLFRPEPQTA VEWADANYYL PKESAYQEGR WETLPFQRAI MNAMGSDYIR EVNVVKSARV GYSKMLLGVY AYFIEHKQRN TLIWLPTDGD AENFMKTHVE PTIRDIPSLL ALAPWYGKKH RDNTLTMKRF SNGRGFWCLG GKAAKNYREK SVDVAGYDEL AAFDEDIEQE GSPTFLGDKR IEGSVWPKSI RGSTPKVRGT CQIERAASES PHCMRFHVAC PHCGEEQYLK FGDKETPFGL KWTPDDPSSV FYLCEHNACV IRQQELDFTD ARYICEKTGI WTRDGILWFS SSGEEIEPPD SVTFHIWTAY SPFTTWVQIV KDWMKTKGDT GKRKTFVNTT LGETWEAKIG ERPDAEVMAE RKEHYSAPVP DRVAYLTAGI DSQLDRYEMR VWGWGPGEES WLIDRQIIMG RHDDEQTLLH VDEAINKTYT RRNGAEMSVS RICWDIGGID PTIVYERSKK HGLFRVIPIK GASVYGKPVA SMPRKRNKNG VYLTEIGTDT AKEQIYNRFT LTPEGDEPLP GAVHFPNNPD IFDLTEAQQL TAEEQVEKWV DGRKKILWDS KKRRNEALDC FVYALAALRI SISRWQLDLS ALLASLQEED GAATNKKTLA DYARALSGED E //