ID Q19079 PRELIMINARY; PRT; 302 AA. AC Q19079; DT 01-NOV-1996 (TrEMBLrel. 01, Created) DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) DT 01-JUN-2000 (TrEMBLrel. 14, Last annotation update) DE COSMID EGAP7. GN EGAP7.1. OS Caenorhabditis elegans. OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; OC Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6239; RN [1] RP SEQUENCE FROM N.A. RC STRAIN=BRISTOL N2; RX MEDLINE=94150718; PubMed=7906398; RA Wilson R., Ainscough R., Anderson K., Baynes C., Berks M., RA Bonfield J., Burton J., Connell M., Copsey T., Cooper J., Coulson A., RA Craxton M., Dear S., Du Z., Durbin R., Favello A., Fulton L., RA Gardner A., Green P., Hawkins T., Hillier L., Jier M., Johnston L., RA Jones M., Kershaw J., Kirsten J., Laister N., Latreille P., RA Lightning J., Lloyd C., Mcmurray A., Mortimore B., O'Callaghan M., RA Parsons J., Percy C., Rifken L., Roopra A., Saunders D., Shownkeen R., RA Smaldon N., Smith A., Sonnhammer E., Staden R., Sulston J., RA Thierry-Mieg J., Thomas K., Vaudin M., Vaughan K., Waterston R., RA Watson A., Weinstock L., Wilkinson-Sproat J., Wohldman P.; RT "2.2 Mb of contiguous nucleotide sequence from chromosome III of C. RT elegans."; RL Nature 368:32-38(1994). RN [2] RP SEQUENCE FROM N.A. RC STRAIN=BRISTOL N2; RA Miller N.; RL Submitted (MAY-1996) to the EMBL/GenBank/DDBJ databases. RN [3] RP SEQUENCE FROM N.A. RC STRAIN=BRISTOL N2; RA Waterston R.; RL Submitted (MAY-1996) to the EMBL/GenBank/DDBJ databases. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; U58736; AAB00598.1; -. DR INTERPRO; IPR000087; -. DR INTERPRO; IPR002486; -. DR PFAM; PF01391; Collagen; 2. DR PFAM; PF01484; Col_cuticle_N; 1. SQ SEQUENCE 302 AA; 29616 MW; 3191CAB5A23BBAEE CRC64; MEQKCAPRRS LRLLAIASAT LAIVSMLATV IIVPLVYNHV QHLQSVMNSE VDFCKTRSRD LWREMVTVQS ATGGIPARTA RRTRRDNYGA QPIAANPPSS AAGSCCTCQV GPPGPPGPPG RDGRPGAPGR PGNPGPPGRD GALLPGPPPK PPCQKCPPGP PGPAGPPGPK GLPGPQGDAG TSGQDGVPGL PGPPGPSGPQ GAPGVPGEKG PTGEPGKVIN GAPPGPPGPP GPPGPQGPPG PPGKDGQPGK AGPPGLPGDP GEKGSDGLPG PHGGTGPRGP PGQPGSCDHC PPPRTGPGYA RR //