ID J3QR68_HUMAN Unreviewed; 404 AA. AC J3QR68; DT 03-OCT-2012, integrated into UniProtKB/TrEMBL. DT 03-OCT-2012, sequence version 1. DT 15-FEB-2017, entry version 35. DE SubName: Full=Haptoglobin {ECO:0000313|Ensembl:ENSP00000464070}; DE Flags: Fragment; GN Name=HP {ECO:0000313|Ensembl:ENSP00000464070}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000464070, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000464070, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15616553; DOI=10.1038/nature03187; RA Martin J., Han C., Gordon L.A., Terry A., Prabhakar S., She X., RA Xie G., Hellsten U., Chan Y.M., Altherr M., Couronne O., Aerts A., RA Bajorek E., Black S., Blumer H., Branscomb E., Brown N.C., Bruno W.J., RA Buckingham J.M., Callen D.F., Campbell C.S., Campbell M.L., RA Campbell E.W., Caoile C., Challacombe J.F., Chasteen L.A., RA Chertkov O., Chi H.C., Christensen M., Clark L.M., Cohn J.D., RA Denys M., Detter J.C., Dickson M., Dimitrijevic-Bussod M., Escobar J., RA Fawcett J.J., Flowers D., Fotopulos D., Glavina T., Gomez M., RA Gonzales E., Goodstein D., Goodwin L.A., Grady D.L., Grigoriev I., RA Groza M., Hammon N., Hawkins T., Haydu L., Hildebrand C.E., Huang W., RA Israni S., Jett J., Jewett P.B., Kadner K., Kimball H., Kobayashi A., RA Krawczyk M.-C., Leyba T., Longmire J.L., Lopez F., Lou Y., Lowry S., RA Ludeman T., Manohar C.F., Mark G.A., McMurray K.L., Meincke L.J., RA Morgan J., Moyzis R.K., Mundt M.O., Munk A.C., Nandkeshwar R.D., RA Pitluck S., Pollard M., Predki P., Parson-Quintana B., Ramirez L., RA Rash S., Retterer J., Ricke D.O., Robinson D.L., Rodriguez A., RA Salamov A., Saunders E.H., Scott D., Shough T., Stallings R.L., RA Stalvey M., Sutherland R.D., Tapia R., Tesmer J.G., Thayer N., RA Thompson L.S., Tice H., Torney D.C., Tran-Gyamfi M., Tsai M., RA Ulanovsky L.E., Ustaszewska A., Vo N., White P.S., Williams A.L., RA Wills P.L., Wu J.-R., Wu K., Yang J., DeJong P., Bruce D., RA Doggett N.A., Deaven L., Schmutz J., Grimwood J., Richardson P., RA Rokhsar D.S., Eichler E.E., Gilna P., Lucas S.M., Myers R.M., RA Rubin E.M., Pennacchio L.A.; RT "The sequence and analysis of duplication-rich human chromosome 16."; RL Nature 432:988-994(2004). RN [2] {ECO:0000213|PubMed:21269460} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21269460; DOI=10.1186/1752-0509-5-17; RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., RA Burckstummer T., Bennett K.L., Superti-Furga G., Colinge J.; RT "Initial characterization of the human central proteome."; RL BMC Syst. Biol. 5:17-17(2011). RN [3] {ECO:0000313|Ensembl:ENSP00000464070} RP IDENTIFICATION. RG Ensembl; RL Submitted (AUG-2012) to UniProtKB. RN [4] {ECO:0000213|PubMed:24275569} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., RA Wang L., Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human RT liver phosphoproteome."; RL J. Proteomics 96:253-262(2014). CC -!- SIMILARITY: Belongs to the peptidase S1 family. CC {ECO:0000256|SAAS:SAAS00597082}. CC -!- CAUTION: Lacks conserved residue(s) required for the propagation CC of feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00302}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000464070}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC009087; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; J3QR68; -. DR STRING; 9606.ENSP00000348170; -. DR PaxDb; J3QR68; -. DR PeptideAtlas; J3QR68; -. DR PRIDE; J3QR68; -. DR Ensembl; ENST00000567185; ENSP00000464070; ENSG00000257017. DR UCSC; uc059wyk.1; human. DR HGNC; HGNC:5141; HP. DR OpenTargets; ENSG00000257017; -. DR eggNOG; KOG3627; Eukaryota. DR eggNOG; COG5640; LUCA. DR GeneTree; ENSGT00760000118890; -. DR Proteomes; UP000005640; Chromosome 16. DR ExpressionAtlas; J3QR68; baseline and differential. DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro. DR CDD; cd00033; CCP; 2. DR CDD; cd00190; Tryp_SPc; 1. DR InterPro; IPR009003; Peptidase_S1_PA. DR InterPro; IPR001314; Peptidase_S1A. DR InterPro; IPR000436; Sushi_SCR_CCP_dom. DR InterPro; IPR001254; Trypsin_dom. DR Pfam; PF00089; Trypsin; 1. DR PRINTS; PR00722; CHYMOTRYPSIN. DR SMART; SM00032; CCP; 2. DR SMART; SM00020; Tryp_SPc; 1. DR SUPFAM; SSF50494; SSF50494; 1. DR SUPFAM; SSF57535; SSF57535; 2. DR PROSITE; PS50923; SUSHI; 2. DR PROSITE; PS50240; TRYPSIN_DOM; 1. PE 1: Evidence at protein level; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Disulfide bond {ECO:0000256|SAAS:SAAS00597225}; KW Proteomics identification {ECO:0000213|MaxQB:J3QR68, KW ECO:0000213|PeptideAtlas:J3QR68}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}; KW Signal {ECO:0000256|SAM:SignalP}; KW Sushi {ECO:0000256|PROSITE-ProRule:PRU00302}. FT SIGNAL 1 17 {ECO:0000256|SAM:SignalP}. FT CHAIN 18 404 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003776716. FT DOMAIN 30 87 Sushi (CCP/SCR). {ECO:0000259|PROSITE: FT PS50923}. FT DOMAIN 89 145 Sushi (CCP/SCR). {ECO:0000259|PROSITE: FT PS50923}. FT DOMAIN 160 402 Peptidase S1. {ECO:0000259|PROSITE: FT PS50240}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSP00000464070}. SQ SEQUENCE 404 AA; 45041 MW; FED9C71B77EBBF64 CRC64; XALGAVIALL LWGQLFAVDS GNDVTDIADD GCPKPPEIAH GYVEHSVRYQ CKNYYKLRTE GDGVYTLNDK KQWINKAVGD KLPECEADDG CPKPPEIAHG YVEHSVRYQC KNYYKLRTEG DVYTLNNEKQ WINKAVGDKL PECEAVCGKP KNPANPVQRI LGGHLDAKGS FPWQAKMVSH HNLTTGATLI NEQWLLTTAK NLFLNHSENA TAKDIAPTLT LYVGKKQLVE IEKVVLHPNY SQVDIGLIKL KQKVSVNERV MPICLPSKDY AEVGRVGYVS GWGRNANFKF TDHLKYVMLP VADQDQCIRH YEGSTVPEKK TPKSPVGVQP ILNEHTFCAG MSKYQEDTCY GDAGSAFAVH DLEEDTWYAT GILSFDKSCA VAEYGVYVKV TSIQDWVQKT IAEN //