ID H0Y300_HUMAN Unreviewed; 442 AA. AC H0Y300; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 19-FEB-2014, sequence version 4. DT 07-JUN-2017, entry version 47. DE SubName: Full=Haptoglobin {ECO:0000313|Ensembl:ENSP00000350406}; GN Name=HP {ECO:0000313|Ensembl:ENSP00000350406}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000350406, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000350406, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15616553; DOI=10.1038/nature03187; RA Martin J., Han C., Gordon L.A., Terry A., Prabhakar S., She X., RA Xie G., Hellsten U., Chan Y.M., Altherr M., Couronne O., Aerts A., RA Bajorek E., Black S., Blumer H., Branscomb E., Brown N.C., Bruno W.J., RA Buckingham J.M., Callen D.F., Campbell C.S., Campbell M.L., RA Campbell E.W., Caoile C., Challacombe J.F., Chasteen L.A., RA Chertkov O., Chi H.C., Christensen M., Clark L.M., Cohn J.D., RA Denys M., Detter J.C., Dickson M., Dimitrijevic-Bussod M., Escobar J., RA Fawcett J.J., Flowers D., Fotopulos D., Glavina T., Gomez M., RA Gonzales E., Goodstein D., Goodwin L.A., Grady D.L., Grigoriev I., RA Groza M., Hammon N., Hawkins T., Haydu L., Hildebrand C.E., Huang W., RA Israni S., Jett J., Jewett P.B., Kadner K., Kimball H., Kobayashi A., RA Krawczyk M.-C., Leyba T., Longmire J.L., Lopez F., Lou Y., Lowry S., RA Ludeman T., Manohar C.F., Mark G.A., McMurray K.L., Meincke L.J., RA Morgan J., Moyzis R.K., Mundt M.O., Munk A.C., Nandkeshwar R.D., RA Pitluck S., Pollard M., Predki P., Parson-Quintana B., Ramirez L., RA Rash S., Retterer J., Ricke D.O., Robinson D.L., Rodriguez A., RA Salamov A., Saunders E.H., Scott D., Shough T., Stallings R.L., RA Stalvey M., Sutherland R.D., Tapia R., Tesmer J.G., Thayer N., RA Thompson L.S., Tice H., Torney D.C., Tran-Gyamfi M., Tsai M., RA Ulanovsky L.E., Ustaszewska A., Vo N., White P.S., Williams A.L., RA Wills P.L., Wu J.-R., Wu K., Yang J., DeJong P., Bruce D., RA Doggett N.A., Deaven L., Schmutz J., Grimwood J., Richardson P., RA Rokhsar D.S., Eichler E.E., Gilna P., Lucas S.M., Myers R.M., RA Rubin E.M., Pennacchio L.A.; RT "The sequence and analysis of duplication-rich human chromosome 16."; RL Nature 432:988-994(2004). RN [2] {ECO:0000213|PubMed:21269460} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21269460; DOI=10.1186/1752-0509-5-17; RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., RA Burckstummer T., Bennett K.L., Superti-Furga G., Colinge J.; RT "Initial characterization of the human central proteome."; RL BMC Syst. Biol. 5:17-17(2011). RN [3] {ECO:0000313|Ensembl:ENSP00000350406} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. RN [4] {ECO:0000213|PubMed:24275569} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., RA Wang L., Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human RT liver phosphoproteome."; RL J. Proteomics 96:253-262(2014). CC -!- SIMILARITY: Belongs to the peptidase S1 family. CC {ECO:0000256|SAAS:SAAS00597082}. CC -!- CAUTION: Lacks conserved residue(s) required for the propagation CC of feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000350406}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC009087; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; H0Y300; -. DR SMR; H0Y300; -. DR PeptideAtlas; H0Y300; -. DR PRIDE; H0Y300; -. DR Ensembl; ENST00000357763; ENSP00000350406; ENSG00000257017. DR UCSC; uc059wyd.1; human. DR HGNC; HGNC:5141; HP. DR OpenTargets; ENSG00000257017; -. DR GeneTree; ENSGT00760000118890; -. DR OMA; NPVDQVQ; -. DR OrthoDB; EOG091G099M; -. DR Proteomes; UP000005640; Chromosome 16. DR Bgee; ENSG00000257017; -. DR ExpressionAtlas; H0Y300; baseline and differential. DR GO; GO:0005576; C:extracellular region; IEA:InterPro. DR GO; GO:0030492; F:hemoglobin binding; IEA:InterPro. DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro. DR CDD; cd00033; CCP; 2. DR CDD; cd00190; Tryp_SPc; 1. DR InterPro; IPR008292; Haptoglobin. DR InterPro; IPR009003; Peptidase_S1_PA. DR InterPro; IPR001314; Peptidase_S1A. DR InterPro; IPR000436; Sushi_SCR_CCP_dom. DR InterPro; IPR001254; Trypsin_dom. DR PANTHER; PTHR24256:SF389; PTHR24256:SF389; 1. DR Pfam; PF00089; Trypsin; 1. DR PRINTS; PR00722; CHYMOTRYPSIN. DR SMART; SM00020; Tryp_SPc; 1. DR SUPFAM; SSF50494; SSF50494; 1. DR SUPFAM; SSF57535; SSF57535; 1. DR PROSITE; PS50923; SUSHI; 1. DR PROSITE; PS50240; TRYPSIN_DOM; 1. PE 1: Evidence at protein level; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Disulfide bond {ECO:0000256|SAAS:SAAS00597225}; KW Proteomics identification {ECO:0000213|MaxQB:H0Y300, KW ECO:0000213|PeptideAtlas:H0Y300}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}; KW Signal {ECO:0000256|SAM:SignalP}; KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 442 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003545834. FT DOMAIN 31 88 Sushi (CCP/SCR). {ECO:0000259|PROSITE: FT PS50923}. FT DOMAIN 198 440 Peptidase S1. {ECO:0000259|PROSITE: FT PS50240}. SQ SEQUENCE 442 AA; 49106 MW; E0FE40771335015F CRC64; MSALGAVIAL LLWGQLFAVD SGNDVTDIAD DGCPKPPEIA HGYVEHSVRY QCKNYYKLRT EGDGVYTLND KKQWINKAVG DKLPECEADD GCPKPPEIAH GYVEHSVRYQ CKNYYKLRTE GDGEAMPCSL PHVNLRVTFT SPADVGKEGM LMMMSPSPRV YTLNNEKQWI NKAVGDKLPE CEAVCGKPKN PANPVQRILG GHLDAKGSFP WQAKMVSHHN LTTGATLINE QWLLTTAKNL FLNHSENATA KDIAPTLTLY VGKKQLVEIE KVVLHPNYSQ VDIGLIKLKQ KVSVNERVMP ICLPSKDYAE VGRVGYVSGW GRNANFKFTD HLKYVMLPVA DQDQCIRHYE GSTVPEKKTP KSPVGVQPIL NEHTFCAGMS KYQEDTCYGD AGSAFAVHDL EEDTWYATGI LSFDKSCAVA EYGVYVKVTS IQDWVQKTIA EN //