ID A0A0G2K4H0_RAT Unreviewed; 797 AA. AC A0A0G2K4H0; DT 28-JUN-2023, integrated into UniProtKB/TrEMBL. DT 28-JUN-2023, sequence version 1. DT 08-NOV-2023, entry version 3. DE SubName: Full=Thrombospondin, type I, domain 1 (Predicted), isoform CRA_b {ECO:0000313|EMBL:EDM08994.1}; DE SubName: Full=Thsd1 protein {ECO:0000313|EMBL:AAI62035.1}; GN Name=Thsd1 {ECO:0000313|EMBL:AAI62035.1, ECO:0000313|RGD:1306998}; GN Synonyms=Thsd1_predicted {ECO:0000313|EMBL:EDM08994.1}; GN ORFNames=rCG_43190 {ECO:0000313|EMBL:EDM08994.1}; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; OC Murinae; Rattus. OX NCBI_TaxID=10116 {ECO:0000313|EMBL:AAI62035.1}; RN [1] {ECO:0000313|EMBL:AAI62035.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Lung {ECO:0000313|EMBL:AAI62035.1}; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RA Gerhard D.S., Wagner L., Feingold E.A., Shenmen C.M., Grouse L.H., RA Schuler G., Klein S.L., Old S., Rasooly R., Good P., Guyer M., Peck A.M., RA Derge J.G., Lipman D., Collins F.S., Jang W., Sherry S., Feolo M., RA Misquitta L., Lee E., Rotmistrovsky K., Greenhut S.F., Schaefer C.F., RA Buetow K., Bonner T.I., Haussler D., Kent J., Kiekhaus M., Furey T., RA Brent M., Prange C., Schreiber K., Shapiro N., Bhat N.K., Hopkins R.F., RA Hsie F., Driscoll T., Soares M.B., Casavant T.L., Scheetz T.E., RA Brown-stein M.J., Usdin T.B., Toshiyuki S., Carninci P., Piao Y., RA Dudekula D.B., Ko M.S., Kawakami K., Suzuki Y., Sugano S., Gruber C.E., RA Smith M.R., Simmons B., Moore T., Waterman R., Johnson S.L., Ruan Y., RA Wei C.L., Mathavan S., Gunaratne P.H., Wu J., Garcia A.M., Hulyk S.W., RA Fuh E., Yuan Y., Sneed A., Kowis C., Hodgson A., Muzny D.M., McPherson J., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madari A., Young A.C., Wetherby K.D., Granite S.J., RA Kwong P.N., Brinkley C.P., Pearson R.L., Bouffard G.G., Blakesly R.W., RA Green E.D., Dickson M.C., Rodriguez A.C., Grimwood J., Schmutz J., RA Myers R.M., Butterfield Y.S., Griffith M., Griffith O.L., Krzywinski M.I., RA Liao N., Morin R., Morrin R., Palmquist D., Petrescu A.S., Skalska U., RA Smailus D.E., Stott J.M., Schnerch A., Schein J.E., Jones S.J., Holt R.A., RA Baross A., Marra M.A., Clifton S., Makowski K.A., Bosak S., Malek J.; RT "The status, quality, and expansion of the NIH full-length cDNA project: RT the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [2] {ECO:0000313|EMBL:EDM08994.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=BN {ECO:0000313|EMBL:EDM08994.1}; RX PubMed=15632090; DOI=10.1101/gr.2889405; RA Florea L., Di Francesco V., Miller J., Turner R., Yao A., Harris M., RA Walenz B., Mobarry C., Merkulov G.V., Charlab R., Dew I., Deng Z., RA Istrail S., Li P., Sutton G.; RT "Gene and alternative splicing annotation with AIR."; RL Genome Res. 15:54-66(2005). RN [3] {ECO:0000313|EMBL:EDM08994.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=BN {ECO:0000313|EMBL:EDM08994.1}; RA Mural R.J., Li P.W., Adams M.D., Amanatides P.G., Baden-Tillson H., RA Barnstead M., Chin S.H., Dew I., Evans C.A., Ferriera S., Flanigan M., RA Fosler C., Glodek A., Gu Z., Holt R.A., Jennings D., Kraft C.L., Lu F., RA Nguyen T., Nusskern D.R., Pfannkoch C.M., Sitter C., Sutton G.G., RA Venter J.C., Wang Z., Woodage T., Zheng X.H., Zhong F.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0007829|PubMed:22673903} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=22673903; RA Lundby A., Secher A., Lage K., Nordsborg N.B., Dmytriyev A., Lundby C., RA Olsen J.V.; RT "Quantitative maps of protein phosphorylation sites across 14 different rat RT organs and tissues."; RL Nat. Commun. 3:876-876(2012). CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; BC162035; AAI62035.1; -; mRNA. DR EMBL; CH473970; EDM08994.1; -; Genomic_DNA. DR RefSeq; XP_006253419.1; XM_006253357.3. DR GeneID; 364630; -. DR AGR; RGD:1306998; -. DR RGD; 1306998; Thsd1. DR OrthoDB; 5301340at2759; -. DR Proteomes; UP000234681; Chromosome 16. DR Bgee; ENSRNOG00000012108; Expressed in lung and 19 other tissues. DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW. DR InterPro; IPR038877; THSD1. DR PANTHER; PTHR16311; THROMBOSPONDIN TYPE I DOMAIN-CONTAINING 1; 1. DR PANTHER; PTHR16311:SF3; THROMBOSPONDIN TYPE-1 DOMAIN-CONTAINING PROTEIN 1; 1. PE 1: Evidence at protein level; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1..22 FT /evidence="ECO:0000256|SAM:SignalP" FT CHAIN 23..797 FT /evidence="ECO:0000256|SAM:SignalP" FT /id="PRO_5039970053" FT TRANSMEM 356..378 FT /note="Helical" FT /evidence="ECO:0000256|SAM:Phobius" FT REGION 414..459 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 532..722 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 558..578 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 579..607 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 629..649 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 797 AA; 88671 MW; 935E590EA2642E23 CRC64; MKPMLKDFSN LLLVVLCDYV LGEADYLVLR EPVHVALSDR TVSVGFHYLG DVNGTLRNVT VMLWEANTNR TLTTKYLLTN QAQGTLQFEC FYFKEAGDYW FVMIPEETDN GTRVPLWEKS ASLKVEWPVF HIDLNRTAKA AEGTFQVGIF TSQPLCLFPV DKPDLLVDVI FKDSLPEART SLGQPLEIRA SKRTRLTQGQ WVEFGCAPLG VEAYVTVMLR LLGQDSVIAS TGPIDLAQKF GYKLMMAPEV RCESVLEVMV LPPPCVFVQG VLAVYKEAPK APEERTFQMA ENRLPLGERR SVFNCTLFDV GKNKYCFNFG ILKKSHFSAK GCMLIQRNIV FRPPDPSPDP EKYNNVVTVT GISLCLFIIF ATVLITLWRR FGRAPKCSTP ARHNSIHSPG FRKNSDEENI CELSEPRGSF SDAGDGPRGS PGDTGIPLTY RCSASAPPED EASGSESFQS NAQKIIPPLF SYRLAQQQLK EMKKKGLTET TKVYHVSQSP LTDTVVDATA SPPLDLECPE EAAASKFRIK SPFLDQPGAG PGERPPSRLD GVLPPPSCAV SPSQTLIRKS QMRSTGGRDG SSERGHCRSS LFRRTASFHE TKQARPFRER SLSALTPRQA PAYSSRMRTW EQMEDRGRPP SRSAHLLPER PEHFQGAAGR ASSPLGPLSK SYTVGHPRRK PDPGDRQAGL VAGAAAEKME PHRAHRGPSP SHRSASRKQT SPVFLKDSYQ KVSQLSPSHF RKDKCQSFPI HPEFAFYDNT SFRLTEAEQR MLDLPGYFGS NEEDETTSTL SVEKLVI //