ID J3QR68_HUMAN Unreviewed; 404 AA. AC J3QR68; DT 03-OCT-2012, integrated into UniProtKB/TrEMBL. DT 03-OCT-2012, sequence version 1. DT 24-JUL-2024, entry version 75. DE SubName: Full=Haptoglobin {ECO:0000313|Ensembl:ENSP00000464070.1}; DE Flags: Fragment; GN Name=HP {ECO:0000313|Ensembl:ENSP00000464070.1}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000464070.1, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15616553; DOI=10.1038/nature03187; RA Martin J., Han C., Gordon L.A., Terry A., Prabhakar S., She X., Xie G., RA Hellsten U., Chan Y.M., Altherr M., Couronne O., Aerts A., Bajorek E., RA Black S., Blumer H., Branscomb E., Brown N.C., Bruno W.J., Buckingham J.M., RA Callen D.F., Campbell C.S., Campbell M.L., Campbell E.W., Caoile C., RA Challacombe J.F., Chasteen L.A., Chertkov O., Chi H.C., Christensen M., RA Clark L.M., Cohn J.D., Denys M., Detter J.C., Dickson M., RA Dimitrijevic-Bussod M., Escobar J., Fawcett J.J., Flowers D., Fotopulos D., RA Glavina T., Gomez M., Gonzales E., Goodstein D., Goodwin L.A., Grady D.L., RA Grigoriev I., Groza M., Hammon N., Hawkins T., Haydu L., Hildebrand C.E., RA Huang W., Israni S., Jett J., Jewett P.B., Kadner K., Kimball H., RA Kobayashi A., Krawczyk M.-C., Leyba T., Longmire J.L., Lopez F., Lou Y., RA Lowry S., Ludeman T., Manohar C.F., Mark G.A., McMurray K.L., Meincke L.J., RA Morgan J., Moyzis R.K., Mundt M.O., Munk A.C., Nandkeshwar R.D., RA Pitluck S., Pollard M., Predki P., Parson-Quintana B., Ramirez L., Rash S., RA Retterer J., Ricke D.O., Robinson D.L., Rodriguez A., Salamov A., RA Saunders E.H., Scott D., Shough T., Stallings R.L., Stalvey M., RA Sutherland R.D., Tapia R., Tesmer J.G., Thayer N., Thompson L.S., Tice H., RA Torney D.C., Tran-Gyamfi M., Tsai M., Ulanovsky L.E., Ustaszewska A., RA Vo N., White P.S., Williams A.L., Wills P.L., Wu J.-R., Wu K., Yang J., RA DeJong P., Bruce D., Doggett N.A., Deaven L., Schmutz J., Grimwood J., RA Richardson P., Rokhsar D.S., Eichler E.E., Gilna P., Lucas S.M., RA Myers R.M., Rubin E.M., Pennacchio L.A.; RT "The sequence and analysis of duplication-rich human chromosome 16."; RL Nature 432:988-994(2004). RN [2] {ECO:0007829|PubMed:21269460} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21269460; DOI=10.1186/1752-0509-5-17; RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., Burckstummer T., RA Bennett K.L., Superti-Furga G., Colinge J.; RT "Initial characterization of the human central proteome."; RL BMC Syst. Biol. 5:17-17(2011). RN [3] {ECO:0007829|PubMed:24275569} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., Wang L., RA Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human liver RT phosphoproteome."; RL J. Proteomics 96:253-262(2014). RN [4] {ECO:0000313|Ensembl:ENSP00000464070.1} RP IDENTIFICATION. RG Ensembl; RL Submitted (APR-2024) to UniProtKB. CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC009087; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR MassIVE; J3QR68; -. DR Antibodypedia; 48338; 1292 antibodies from 45 providers. DR Ensembl; ENST00000567185.7; ENSP00000464070.1; ENSG00000257017.10. DR UCSC; uc059wyk.1; human. DR HGNC; HGNC:5141; HP. DR VEuPathDB; HostDB:ENSG00000257017; -. DR GeneTree; ENSGT00940000159903; -. DR ChiTaRS; HP; human. DR Proteomes; UP000005640; Chromosome 16. DR Bgee; ENSG00000257017; Expressed in pericardium and 167 other cell types or tissues. DR ExpressionAtlas; J3QR68; baseline and differential. DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro. DR GO; GO:0006508; P:proteolysis; IEA:InterPro. DR CDD; cd00033; CCP; 2. DR CDD; cd00190; Tryp_SPc; 1. DR Gene3D; 2.10.70.10; Complement Module, domain 1; 2. DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2. DR InterPro; IPR009003; Peptidase_S1_PA. DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin. DR InterPro; IPR001314; Peptidase_S1A. DR InterPro; IPR035976; Sushi/SCR/CCP_sf. DR InterPro; IPR000436; Sushi_SCR_CCP_dom. DR InterPro; IPR001254; Trypsin_dom. DR PANTHER; PTHR24255; COMPLEMENT COMPONENT 1, S SUBCOMPONENT-RELATED; 1. DR PANTHER; PTHR24255:SF30; HAPTOGLOBIN; 1. DR Pfam; PF00089; Trypsin; 1. DR PRINTS; PR00722; CHYMOTRYPSIN. DR SMART; SM00032; CCP; 2. DR SMART; SM00020; Tryp_SPc; 1. DR SUPFAM; SSF57535; Complement control module/SCR domain; 2. DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1. DR PROSITE; PS50923; SUSHI; 2. DR PROSITE; PS50240; TRYPSIN_DOM; 1. PE 1: Evidence at protein level; KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157}; KW Proteomics identification {ECO:0007829|PeptideAtlas:J3QR68, KW ECO:0007829|ProteomicsDB:J3QR68}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}; KW Signal {ECO:0000256|SAM:SignalP}; KW Sushi {ECO:0000256|ARBA:ARBA00022659, ECO:0000256|PROSITE- KW ProRule:PRU00302}. FT SIGNAL 1..17 FT /evidence="ECO:0000256|SAM:SignalP" FT CHAIN 18..404 FT /evidence="ECO:0000256|SAM:SignalP" FT /id="PRO_5003776716" FT DOMAIN 30..87 FT /note="Sushi" FT /evidence="ECO:0000259|PROSITE:PS50923" FT DOMAIN 89..145 FT /note="Sushi" FT /evidence="ECO:0000259|PROSITE:PS50923" FT DOMAIN 160..402 FT /note="Peptidase S1" FT /evidence="ECO:0000259|PROSITE:PS50240" FT NON_TER 1 FT /evidence="ECO:0000313|Ensembl:ENSP00000464070.1" SQ SEQUENCE 404 AA; 45041 MW; FED9C71B77EBBF64 CRC64; XALGAVIALL LWGQLFAVDS GNDVTDIADD GCPKPPEIAH GYVEHSVRYQ CKNYYKLRTE GDGVYTLNDK KQWINKAVGD KLPECEADDG CPKPPEIAHG YVEHSVRYQC KNYYKLRTEG DVYTLNNEKQ WINKAVGDKL PECEAVCGKP KNPANPVQRI LGGHLDAKGS FPWQAKMVSH HNLTTGATLI NEQWLLTTAK NLFLNHSENA TAKDIAPTLT LYVGKKQLVE IEKVVLHPNY SQVDIGLIKL KQKVSVNERV MPICLPSKDY AEVGRVGYVS GWGRNANFKF TDHLKYVMLP VADQDQCIRH YEGSTVPEKK TPKSPVGVQP ILNEHTFCAG MSKYQEDTCY GDAGSAFAVH DLEEDTWYAT GILSFDKSCA VAEYGVYVKV TSIQDWVQKT IAEN //