ID J3QR68_HUMAN Unreviewed; 404 AA. AC J3QR68; DT 03-OCT-2012, integrated into UniProtKB/TrEMBL. DT 03-OCT-2012, sequence version 1. DT 17-JUN-2020, entry version 56. DE SubName: Full=Haptoglobin {ECO:0000313|Ensembl:ENSP00000464070}; DE Flags: Fragment; GN Name=HP {ECO:0000313|Ensembl:ENSP00000464070}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000464070, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000464070, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15616553; DOI=10.1038/nature03187; RA Martin J., Han C., Gordon L.A., Terry A., Prabhakar S., She X., Xie G., RA Hellsten U., Chan Y.M., Altherr M., Couronne O., Aerts A., Bajorek E., RA Black S., Blumer H., Branscomb E., Brown N.C., Bruno W.J., Buckingham J.M., RA Callen D.F., Campbell C.S., Campbell M.L., Campbell E.W., Caoile C., RA Challacombe J.F., Chasteen L.A., Chertkov O., Chi H.C., Christensen M., RA Clark L.M., Cohn J.D., Denys M., Detter J.C., Dickson M., RA Dimitrijevic-Bussod M., Escobar J., Fawcett J.J., Flowers D., Fotopulos D., RA Glavina T., Gomez M., Gonzales E., Goodstein D., Goodwin L.A., Grady D.L., RA Grigoriev I., Groza M., Hammon N., Hawkins T., Haydu L., Hildebrand C.E., RA Huang W., Israni S., Jett J., Jewett P.B., Kadner K., Kimball H., RA Kobayashi A., Krawczyk M.-C., Leyba T., Longmire J.L., Lopez F., Lou Y., RA Lowry S., Ludeman T., Manohar C.F., Mark G.A., McMurray K.L., Meincke L.J., RA Morgan J., Moyzis R.K., Mundt M.O., Munk A.C., Nandkeshwar R.D., RA Pitluck S., Pollard M., Predki P., Parson-Quintana B., Ramirez L., Rash S., RA Retterer J., Ricke D.O., Robinson D.L., Rodriguez A., Salamov A., RA Saunders E.H., Scott D., Shough T., Stallings R.L., Stalvey M., RA Sutherland R.D., Tapia R., Tesmer J.G., Thayer N., Thompson L.S., Tice H., RA Torney D.C., Tran-Gyamfi M., Tsai M., Ulanovsky L.E., Ustaszewska A., RA Vo N., White P.S., Williams A.L., Wills P.L., Wu J.-R., Wu K., Yang J., RA DeJong P., Bruce D., Doggett N.A., Deaven L., Schmutz J., Grimwood J., RA Richardson P., Rokhsar D.S., Eichler E.E., Gilna P., Lucas S.M., RA Myers R.M., Rubin E.M., Pennacchio L.A.; RT "The sequence and analysis of duplication-rich human chromosome 16."; RL Nature 432:988-994(2004). RN [2] {ECO:0000213|PubMed:21269460} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21269460; DOI=10.1186/1752-0509-5-17; RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., Burckstummer T., RA Bennett K.L., Superti-Furga G., Colinge J.; RT "Initial characterization of the human central proteome."; RL BMC Syst. Biol. 5:17-17(2011). RN [3] {ECO:0000313|Ensembl:ENSP00000464070} RP IDENTIFICATION. RG Ensembl; RL Submitted (AUG-2012) to UniProtKB. RN [4] {ECO:0000213|PubMed:24275569} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., Wang L., RA Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human liver RT phosphoproteome."; RL J. Proteomics 96:253-262(2014). CC -!- SIMILARITY: Belongs to the peptidase S1 family. CC {ECO:0000256|SAAS:SAAS00886988}. CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC009087; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR jPOST; J3QR68; -. DR PeptideAtlas; J3QR68; -. DR PRIDE; J3QR68; -. DR Antibodypedia; 48338; 1134 antibodies. DR Ensembl; ENST00000567185; ENSP00000464070; ENSG00000257017. DR UCSC; uc059wyk.1; human. DR EuPathDB; HostDB:ENSG00000257017.8; -. DR HGNC; HGNC:5141; HP. DR OpenTargets; ENSG00000257017; -. DR eggNOG; KOG3627; Eukaryota. DR eggNOG; COG5640; LUCA. DR GeneTree; ENSGT00940000159903; -. DR ChiTaRS; HP; human. DR Proteomes; UP000005640; Chromosome 16. DR Bgee; ENSG00000257017; Expressed in liver and 209 other tissues. DR ExpressionAtlas; J3QR68; baseline and differential. DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro. DR CDD; cd00033; CCP; 2. DR CDD; cd00190; Tryp_SPc; 1. DR InterPro; IPR009003; Peptidase_S1_PA. DR InterPro; IPR001314; Peptidase_S1A. DR InterPro; IPR035976; Sushi/SCR/CCP_sf. DR InterPro; IPR000436; Sushi_SCR_CCP_dom. DR InterPro; IPR001254; Trypsin_dom. DR Pfam; PF00089; Trypsin; 1. DR PRINTS; PR00722; CHYMOTRYPSIN. DR SMART; SM00032; CCP; 2. DR SMART; SM00020; Tryp_SPc; 1. DR SUPFAM; SSF50494; SSF50494; 1. DR SUPFAM; SSF57535; SSF57535; 2. DR PROSITE; PS50923; SUSHI; 2. DR PROSITE; PS50240; TRYPSIN_DOM; 1. PE 1: Evidence at protein level; KW Disulfide bond {ECO:0000256|SAAS:SAAS00887447}; KW Proteomics identification {ECO:0000213|MaxQB:J3QR68, KW ECO:0000213|PeptideAtlas:J3QR68, ECO:0000213|ProteomicsDB:J3QR68}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}; KW Signal {ECO:0000256|SAM:SignalP}; KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}. FT SIGNAL 1..17 FT /evidence="ECO:0000256|SAM:SignalP" FT CHAIN 18..404 FT /evidence="ECO:0000256|SAM:SignalP" FT /id="PRO_5003776716" FT DOMAIN 30..87 FT /note="Sushi" FT /evidence="ECO:0000259|PROSITE:PS50923" FT DOMAIN 89..145 FT /note="Sushi" FT /evidence="ECO:0000259|PROSITE:PS50923" FT DOMAIN 160..402 FT /note="Peptidase S1" FT /evidence="ECO:0000259|PROSITE:PS50240" FT NON_TER 1 FT /evidence="ECO:0000313|Ensembl:ENSP00000464070" SQ SEQUENCE 404 AA; 45041 MW; FED9C71B77EBBF64 CRC64; XALGAVIALL LWGQLFAVDS GNDVTDIADD GCPKPPEIAH GYVEHSVRYQ CKNYYKLRTE GDGVYTLNDK KQWINKAVGD KLPECEADDG CPKPPEIAHG YVEHSVRYQC KNYYKLRTEG DVYTLNNEKQ WINKAVGDKL PECEAVCGKP KNPANPVQRI LGGHLDAKGS FPWQAKMVSH HNLTTGATLI NEQWLLTTAK NLFLNHSENA TAKDIAPTLT LYVGKKQLVE IEKVVLHPNY SQVDIGLIKL KQKVSVNERV MPICLPSKDY AEVGRVGYVS GWGRNANFKF TDHLKYVMLP VADQDQCIRH YEGSTVPEKK TPKSPVGVQP ILNEHTFCAG MSKYQEDTCY GDAGSAFAVH DLEEDTWYAT GILSFDKSCA VAEYGVYVKV TSIQDWVQKT IAEN //