ID CO1A2_SCESX Reviewed; 413 AA. AC C0HLI6; DT 13-NOV-2019, integrated into UniProtKB/Swiss-Prot. DT 13-NOV-2019, sequence version 1. DT 13-NOV-2019, entry version 1. DE RecName: Full=Collagen alpha-2(I) chain {ECO:0000303|PubMed:31171860}; DE AltName: Full=Alpha-2 type I collagen {ECO:0000250|UniProtKB:P08123}; DE Flags: Fragments; OS Scelidotherium sp. (strain SLP-2019) (South American ground sloth). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Xenarthra; Pilosa; Folivora; Mylodontidae; OC Scelidotherium; unclassified Scelidotherium. OX NCBI_TaxID=2546665 {ECO:0000303|PubMed:31171860}; RN [1] {ECO:0000305} RP PROTEIN SEQUENCE, TISSUE SPECIFICITY, AND IDENTIFICATION BY MASS RP SPECTROMETRY. RC TISSUE=Bone {ECO:0000303|PubMed:31171860}; RX PubMed=31171860; DOI=10.1038/s41559-019-0909-z; RA Presslee S., Slater G.J., Pujos F., Forasiepi A.M., Fischer R., RA Molloy K., Mackie M., Olsen J.V., Kramarz A., Taglioretti M., RA Scaglia F., Lezcano M., Lanata J.L., Southon J., Feranec R., Bloch J., RA Hajduk A., Martin F.M., Salas Gismondi R., Reguero M., de Muizon C., RA Greenwood A., Chait B.T., Penkman K., Collins M., MacPhee R.D.E.; RT "Palaeoproteomics resolves sloth relationships."; RL Nat. Ecol. Evol. 0:0-0(2019). CC -!- FUNCTION: Type I collagen is a member of group I collagen CC (fibrillar forming collagen). {ECO:0000305}. CC -!- SUBUNIT: Trimers of one alpha 2(I) and two alpha 1(I) chains. CC {ECO:0000305}. CC -!- SUBCELLULAR LOCATION: Secreted. Secreted, extracellular space. CC Secreted, extracellular space, extracellular matrix {ECO:0000305}. CC -!- TISSUE SPECIFICITY: Expressed in bones. CC {ECO:0000269|PubMed:31171860}. CC -!- PTM: Prolines at the third position of the tripeptide repeating CC unit (G-X-Y) are hydroxylated in some or all of the chains. CC {ECO:0000250|UniProtKB:P08123}. CC -!- MISCELLANEOUS: These protein fragments were extracted from an CC ancient phalanx bone collected at Arroyo Del Moro in Argentina. CC {ECO:0000269|PubMed:31171860}. CC -!- SIMILARITY: Belongs to the fibrillar collagen family. CC {ECO:0000305}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- PE 1: Evidence at protein level; KW Direct protein sequencing; Extinct organism protein; KW Extracellular matrix; Glycoprotein; Hydroxylation; Secreted. FT CHAIN 1 413 Collagen alpha-2(I) chain. FT /FTId=PRO_0000448461. FT MOD_RES 7 7 4-hydroxyproline. FT {ECO:0000250|UniProtKB:P08123}. FT MOD_RES 10 10 4-hydroxyproline. FT {ECO:0000250|UniProtKB:P08123}. FT MOD_RES 21 21 4-hydroxyproline. FT {ECO:0000250|UniProtKB:P08123}. FT MOD_RES 27 27 4-hydroxyproline. FT {ECO:0000250|UniProtKB:P08123}. FT MOD_RES 82 82 5-hydroxylysine; alternate. FT {ECO:0000250|UniProtKB:P08123}. FT MOD_RES 313 313 4-hydroxyproline. FT {ECO:0000250|UniProtKB:P08123}. FT MOD_RES 316 316 4-hydroxyproline. FT {ECO:0000250|UniProtKB:P08123}. FT CARBOHYD 82 82 O-linked (Gal...) hydroxylysine; FT alternate. FT {ECO:0000250|UniProtKB:P08123}. FT UNSURE 6 6 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 14 14 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 78 78 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 102 102 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 151 151 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 170 170 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 188 188 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 197 197 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 206 206 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 216 216 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 270 270 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 279 279 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 318 318 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 324 324 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 342 342 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 386 386 L or I. {ECO:0000303|PubMed:31171860}. FT UNSURE 407 407 L or I. {ECO:0000303|PubMed:31171860}. FT NON_CONS 12 13 {ECO:0000303|PubMed:31171860}. FT NON_CONS 53 54 {ECO:0000303|PubMed:31171860}. FT NON_CONS 61 62 {ECO:0000303|PubMed:31171860}. FT NON_CONS 82 83 {ECO:0000303|PubMed:31171860}. FT NON_CONS 159 160 {ECO:0000303|PubMed:31171860}. FT NON_CONS 210 211 {ECO:0000303|PubMed:31171860}. FT NON_CONS 378 379 {ECO:0000303|PubMed:31171860}. FT NON_TER 1 1 {ECO:0000303|PubMed:31171860}. FT NON_TER 413 413 {ECO:0000303|PubMed:31171860}. SQ SEQUENCE 413 AA; 37111 MW; BD8184C4CA899B13 CRC64; FDFSFLPQPP QEGLMGPRGP PGASGAPGPQ GFQGPAGEPG EPGQTGPAGA RGPGPPGKAG EGVVGPQGAR GFPGTPGLPG FKGEPGAPGE NGTPGQTGAR GLPGERGRVG APGPAGSRGS DGSVGPVGPA GPIGSAGPPG FPGAPGPKGE LGPVGNTGPG PAGPRGEQGL PGVSGPVGPP GNPGANGLTG AKGAAGLPGV AGAPGLPGPR TGARGLVGEP GPAGSKGESG GKGEPGSAGP QGPPGSSGEE GKRGPSGESG STGPTGPPGL RGGPGSRGLP GADGRAGVIG PAGARGASGP AGVRGPSGDT GRPGEPGLMG ARGLPGSPGN VGPAGKEGPA GLPGIDGRPG PIGPAGARGE AGNIGFPGPK GPAGDPGKGE KGHAGLAGNR GAPGPDGNNG AQGPPGLQGV QGG //