ID LRP4_HUMAN Reviewed; 1950 AA. AC O75096; DT 13-APR-2004, integrated into UniProtKB/Swiss-Prot. DT 10-MAY-2004, sequence version 2. DT 05-MAY-2009, entry version 71. DE RecName: Full=Low-density lipoprotein receptor-related protein 4; DE AltName: Full=Multiple epidermal growth factor-like domains 7; DE Flags: Precursor; GN Name=LRP4; Synonyms=KIAA0816, MEGF7; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16554811; DOI=10.1038/nature04632; RA Taylor T.D., Noguchi H., Totoki Y., Toyoda A., Kuroki Y., Dewar K., RA Lloyd C., Itoh T., Takeda T., Kim D.-W., She X., Barlow K.F., RA Bloom T., Bruford E., Chang J.L., Cuomo C.A., Eichler E., RA FitzGerald M.G., Jaffe D.B., LaButti K., Nicol R., Park H.-S., RA Seaman C., Sougnez C., Yang X., Zimmer A.R., Zody M.C., Birren B.W., RA Nusbaum C., Fujiyama A., Hattori M., Rogers J., Lander E.S., RA Sakaki Y.; RT "Human chromosome 11 DNA sequence and analysis including novel gene RT identification."; RL Nature 440:497-500(2006). RN [2] RP NUCLEOTIDE SEQUENCE [MRNA] OF 375-1950. RC TISSUE=Brain; RX MEDLINE=98360089; PubMed=9693030; DOI=10.1006/geno.1998.5341; RA Nakayama M., Nakajima D., Nagase T., Nomura N., Seki N., Ohara O.; RT "Identification of high-molecular-weight proteins with multiple EGF- RT like motifs by motif-trap screening."; RL Genomics 51:27-34(1998). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1764-1950. RC TISSUE=Muscle; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). CC -!- FUNCTION: Potential cell surface endocytic receptor, which binds CC and internalizes extracellular ligands for degradation by CC lysosomes. CC -!- SUBCELLULAR LOCATION: Membrane; Single-pass type I membrane CC protein (Potential). CC -!- TISSUE SPECIFICITY: Expressed in several regions of the brain. CC -!- SIMILARITY: Belongs to the LDLR family. CC -!- SIMILARITY: Contains 3 EGF-like domains. CC -!- SIMILARITY: Contains 8 LDL-receptor class A domains. CC -!- SIMILARITY: Contains 20 LDL-receptor class B repeats. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC021573; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AB011540; BAA32468.1; -; mRNA. DR EMBL; BC037360; AAH37360.1; -; mRNA. DR EMBL; BC041048; AAH41048.1; -; mRNA. DR IPI; IPI00306851; -. DR UniGene; Hs.4930; -. DR HSSP; P00736; 1APQ. DR IntAct; O75096; 2. DR PhosphoSite; O75096; -. DR PRIDE; O75096; -. DR Ensembl; ENSG00000134569; Homo sapiens. DR GeneCards; GC11M046834; -. DR HGNC; HGNC:6696; LRP4. DR HPA; HPA011934; -. DR HPA; HPA012300; -. DR MIM; 604270; gene. DR PharmGKB; PA30454; -. DR HOGENOM; O75096; -. DR HOVERGEN; O75096; -. DR OMA; O75096; NVISLDY. DR ArrayExpress; O75096; -. DR Bgee; O75096; -. DR CleanEx; HS_LRP4; -. DR GermOnline; ENSG00000134569; Homo sapiens. DR GO; GO:0016021; C:integral to membrane; IEA:UniProtKB-KW. DR GO; GO:0005509; F:calcium ion binding; NAS:UniProtKB. DR GO; GO:0004872; F:receptor activity; IEA:UniProtKB-KW. DR GO; GO:0006897; P:endocytosis; IEA:UniProtKB-KW. DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like. DR InterPro; IPR006209; EGF. DR InterPro; IPR006210; EGF-like. DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_CS. DR InterPro; IPR001881; EGF_Ca_bd. DR InterPro; IPR013091; EGF_Ca_bd_2. DR InterPro; IPR018097; EGF_Ca_bd_CS. DR InterPro; IPR013032; EGF_like_reg_CS. DR InterPro; IPR002172; LDL_rcpt_classA_cys-rich. DR InterPro; IPR000033; LDLR. DR Gene3D; G3DSA:2.120.10.30; 6-blade_b-propeller_TolB-like; 4. DR Gene3D; G3DSA:4.10.400.10; LDL_rcpt_classA_cys-rich; 8. DR Pfam; PF00008; EGF; 3. DR Pfam; PF07645; EGF_CA; 1. DR Pfam; PF00057; Ldl_recept_a; 8. DR Pfam; PF00058; Ldl_recept_b; 19. DR PRINTS; PR00261; LDLRECEPTOR. DR SMART; SM00181; EGF; 4. DR SMART; SM00179; EGF_CA; 1. DR SMART; SM00192; LDLa; 8. DR SMART; SM00135; LY; 20. DR PROSITE; PS00010; ASX_HYDROXYL; 1. DR PROSITE; PS00022; EGF_1; FALSE_NEG. DR PROSITE; PS01186; EGF_2; 3. DR PROSITE; PS50026; EGF_3; FALSE_NEG. DR PROSITE; PS01187; EGF_CA; 1. DR PROSITE; PS01209; LDLRA_1; 8. DR PROSITE; PS50068; LDLRA_2; 8. DR PROSITE; PS51120; LDLRB; 20. PE 2: Evidence at transcript level; KW Calcium; Disulfide bond; EGF-like domain; Endocytosis; Glycoprotein; KW Membrane; Receptor; Repeat; Signal; Transmembrane. FT SIGNAL 1 20 Potential. FT CHAIN 21 1950 Low-density lipoprotein receptor-related FT protein 4. FT /FTId=PRO_0000017325. FT TOPO_DOM 21 1768 Extracellular (Potential). FT TRANSMEM 1769 1791 Potential. FT TOPO_DOM 1792 1950 Cytoplasmic (Potential). FT DOMAIN 71 112 LDL-receptor class A 1. FT DOMAIN 115 151 LDL-receptor class A 2. FT DOMAIN 154 189 LDL-receptor class A 3. FT DOMAIN 192 228 LDL-receptor class A 4. FT DOMAIN 235 271 LDL-receptor class A 5. FT DOMAIN 275 311 LDL-receptor class A 6. FT DOMAIN 314 350 LDL-receptor class A 7. FT DOMAIN 356 395 LDL-receptor class A 8. FT DOMAIN 399 439 EGF-like 1; calcium-binding (Potential). FT DOMAIN 440 479 EGF-like 2; calcium-binding (Potential). FT REPEAT 525 567 LDL-receptor class B 1. FT REPEAT 568 610 LDL-receptor class B 2. FT REPEAT 611 654 LDL-receptor class B 3. FT REPEAT 655 697 LDL-receptor class B 4. FT REPEAT 698 738 LDL-receptor class B 5. FT DOMAIN 743 782 EGF-like 3. FT REPEAT 830 872 LDL-receptor class B 6. FT REPEAT 873 915 LDL-receptor class B 7. FT REPEAT 916 959 LDL-receptor class B 8. FT REPEAT 960 1001 LDL-receptor class B 9. FT REPEAT 1002 1043 LDL-receptor class B 10. FT REPEAT 1138 1180 LDL-receptor class B 11. FT REPEAT 1181 1223 LDL-receptor class B 12. FT REPEAT 1224 1267 LDL-receptor class B 13. FT REPEAT 1268 1308 LDL-receptor class B 14. FT REPEAT 1309 1351 LDL-receptor class B 15. FT REPEAT 1442 1484 LDL-receptor class B 16. FT REPEAT 1485 1527 LDL-receptor class B 17. FT REPEAT 1528 1571 LDL-receptor class B 18. FT REPEAT 1572 1613 LDL-receptor class B 19. FT REPEAT 1614 1655 LDL-receptor class B 20. FT MOTIF 1811 1814 Endocytosis signal (Potential). FT CARBOHYD 309 309 N-linked (GlcNAc...) (Potential). FT CARBOHYD 543 543 N-linked (GlcNAc...) (Potential). FT CARBOHYD 764 764 N-linked (GlcNAc...) (Potential). FT CARBOHYD 946 946 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1122 1122 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1460 1460 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1512 1512 N-linked (GlcNAc...) (Potential). FT DISULFID 72 89 By similarity. FT DISULFID 79 102 By similarity. FT DISULFID 96 111 By similarity. FT DISULFID 116 128 By similarity. FT DISULFID 123 141 By similarity. FT DISULFID 135 150 By similarity. FT DISULFID 155 167 By similarity. FT DISULFID 162 180 By similarity. FT DISULFID 174 188 By similarity. FT DISULFID 193 205 By similarity. FT DISULFID 200 218 By similarity. FT DISULFID 212 227 By similarity. FT DISULFID 236 248 By similarity. FT DISULFID 243 261 By similarity. FT DISULFID 255 270 By similarity. FT DISULFID 276 288 By similarity. FT DISULFID 283 301 By similarity. FT DISULFID 295 310 By similarity. FT DISULFID 315 327 By similarity. FT DISULFID 322 340 By similarity. FT DISULFID 334 349 By similarity. FT DISULFID 357 369 By similarity. FT DISULFID 364 382 By similarity. FT DISULFID 376 394 By similarity. FT DISULFID 403 414 By similarity. FT DISULFID 410 423 By similarity. FT DISULFID 425 438 By similarity. FT DISULFID 444 454 By similarity. FT DISULFID 450 463 By similarity. FT DISULFID 465 478 By similarity. FT DISULFID 747 758 By similarity. FT DISULFID 754 767 By similarity. FT DISULFID 769 781 By similarity. FT CONFLICT 1131 1131 I -> V (in Ref. 2; BAA32468). FT CONFLICT 1599 1599 S -> G (in Ref. 2; BAA32468). FT CONFLICT 1691 1691 R -> Q (in Ref. 2; BAA32468). FT CONFLICT 1907 1907 T -> M (in Ref. 2; BAA32468). SQ SEQUENCE 1950 AA; 215965 MW; D298624D70B2A287 CRC64; MRRQWGALLL GALLCAHAVA LGLRAGERTR SGPGSSSPSG GISGGASAGS GLGRGAGLGR GAGLASSPEC ACGRSHFTCA VSALGECTCI PAQWQCDGDN DCGDHSDEDG CILPTCSPLD FHCDNGKCIR RSWVCDGDND CEDDSDEQDC PPRECEEDEF PCQNGYCIRS LWHCDGDNDC GDNSDEQCDM RKCSDKEFRC SDGSCIAEHW YCDGDTDCKD GSDEENCPSA VPAPPCNLEE FQCAYGRCIL DIYHCDGDDD CGDWSDESDC SSHQPCRSGE FMCDSGLCIN AGWRCDGDAD CDDQSDERNC TTSMCTAEQF RCHSGRCVRL SWRCDGEDDC ADNSDEENCE NTGSPQCALD QFLCWNGRCI GQRKLCNGVN DCGDNSDESP QQNCRPRTGE ENCNVNNGGC AQKCQMVRGA VQCTCHTGYR LTEDGHTCQD VNECAEEGYC SQGCTNSEGA FQCWCETGYE LRPDRRSCKA LGPEPVLLFA NRIDIRQVLP HRSEYTLLLN NLENAIALDF HHRRELVFWS DVTLDRILRA NLNGSNVEEV VSTGLESPGG LAVDWVHDKL YWTDSGTSRI EVANLDGAHR KVLLWQNLEK PRAIALHPME GTIYWTDWGN TPRIEASSMD GSGRRIIADT HLFWPNGLTI DYAGRRMYWV DAKHHVIERA NLDGSHRKAV ISQGLPHPFA ITVFEDSLYW TDWHTKSINS ANKFTGKNQE IIRNKLHFPM DIHTLHPQRQ PAGKNRCGDN NGGCTHLCLP SGQNYTCACP TGFRKISSHA CAQSLDKFLL FARRMDIRRI SFDTEDLSDD VIPLADVRSA VALDWDSRDD HVYWTDVSTD TISRAKWDGT GQEVVVDTSL ESPAGLAIDW VTNKLYWTDA GTDRIEVANT DGSMRTVLIW ENLDRPRDIV VEPMGGYMYW TDWGASPKIE RAGMDASGRQ VIISSNLTWP NGLAIDYGSQ RLYWADAGMK TIEFAGLDGS KRKVLIGSQL PHPFGLTLYG ERIYWTDWQT KSIQSADRLT GLDRETLQEN LENLMDIHVF HRRRPPVSTP CAMENGGCSH LCLRSPNPSG FSCTCPTGIN LLSDGKTCSP GMNSFLIFAR RIDIRMVSLD IPYFADVVVP INITMKNTIA IGVDPQEGKV YWSDSTLHRI SRANLDGSQH EDIITTGLQT TDGLAVDAIG RKVYWTDTGT NRIEVGNLDG SMRKVLVWQN LDSPRAIVLY HEMGFMYWTD WGENAKLERS GMDGSDRAVL INNNLGWPNG LTVDKASSQL LWADAHTERI EAADLNGANR HTLVSPVQHP YGLTLLDSYI YWTDWQTRSI HRADKGTGSN VILVRSNLPG LMDMQAVDRA QPLGFNKCGS RNGGCSHLCL PRPSGFSCAC PTGIQLKGDG KTCDPSPETY LLFSSRGSIR RISLDTSDHT DVHVPVPELN NVISLDYDSV DGKVYYTDVF LDVIRRADLN GSNMETVIGR GLKTTDGLAV DWVARNLYWT DTGRNTIEAS RLDGSCRKVL INNSLDEPRA IAVFPRKGYL FWTDWGHIAK IERANLDGSE RKVLINTDLG WPNGLTLDYD TRRIYWVDAH LDRIESADLN GKLRQVLVSH VSHPFALTQQ DRWIYWTDWQ TKSIQRVDKY SGRNKETVLA NVEGLMDIIV VSPQRQTGTN ACGVNNGGCT HLCFARASDF VCACPDEPDS RPCSLVPGLV PPAPRATGMS EKSPVLPNTP PTTLYSSTTR TRTSLEEVEG RCSERDARLG LCARSNDAVP AAPGEGLHIS YAIGGLLSIL LILVVIAALM LYRHKKSKFT DPGMGNLTYS NPSYRTSTQE VKIEAIPKPA MYNQLCYKKE GGPDHNYTKE KIKIVEGICL LSGDDAEWDD LKQLRSSRGG LLRDHVCMKT DTVSIQASSG SLDDTETEQL LQEEQSECSS VHTAATPERR GSLPDTGWKH ERKLSSESQV //