ID LRP4_HUMAN STANDARD; PRT; 1950 AA. AC O75096; DT 05-JUL-2004 (Rel. 44, Created) DT 05-JUL-2004 (Rel. 44, Last sequence update) DT 05-JUL-2004 (Rel. 44, Last annotation update) DE Low-density lipoprotein receptor-related protein 4 precursor (Multiple DE epidermal growth factor-like domains 7). GN Name=LRP4; Synonyms=KIAA0816, MEGF7; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP CONCEPTUAL TRANSLATION. RA Blatter M.-C.; RL Unpublished observations (MAR-2004). RN [2] RP SEQUENCE OF 375-1950 FROM N.A. RC TISSUE=Brain; RX MEDLINE=98360089; PubMed=9693030; DOI=10.1006/geno.1998.5341; RA Nakayama M., Nakajima D., Nagase T., Nomura N., Seki N., Ohara O.; RT "Identification of high-molecular-weight proteins with multiple EGF- RT like motifs by motif-trap screening."; RL Genomics 51:27-34(1998). RN [3] RP SEQUENCE OF 1764-1950 FROM N.A. RC TISSUE=Muscle; RX MEDLINE=22388257; PubMed=12477932; DOI=10.1073/pnas.242603899; RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., RA Klausner R.D., Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., RA Diatchenko L., Marusina K., Farmer A.A., Rubin G.M., Hong L., RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C., RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J.A., Gunaratne P.H., RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; RT "Generation and initial analysis of more than 15,000 full-length human RT and mouse cDNA sequences."; RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). CC -!- FUNCTION: Potential cell surface endocytic receptor, which binds CC and internalizes extracellular ligands for degradation by CC lysosomes. CC -!- SUBCELLULAR LOCATION: Type I membrane protein (Potential). CC -!- TISSUE SPECIFICITY: Expressed in several regions of the brain. CC -!- SIMILARITY: Belongs to the LDLR family. CC -!- SIMILARITY: Contains 3 EGF-like domains. CC -!- SIMILARITY: Contains 8 LDL-receptor class A domains. CC -!- SIMILARITY: Contains 19 LDL-receptor class B domains. CC -!- CAUTION: The sequence has been constructed according to mouse and CC rat sequences. The N-terminus part differs from mouse and rat CC sequences, but is confirmed by EST. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC021573; -; NOT_ANNOTATED_CDS. DR EMBL; AB011540; BAA32468.1; -. DR EMBL; BC037360; AAH37360.1; -. DR EMBL; BC041048; AAH41048.1; -. DR HSSP; P00736; 1APQ. DR IntAct; O75096; -. DR Genew; HGNC:6696; LRP4. DR H-InvDB; HIX0009607; -. DR MIM; 604270; -. DR GO; GO:0005509; F:calcium ion binding; NAS. DR InterPro; IPR000152; Asx_hydroxyl_S. DR InterPro; IPR001881; EGF_Ca. DR InterPro; IPR006209; EGF_like. DR InterPro; IPR002172; LDL_receptor_A. DR InterPro; IPR000033; Ldl_receptor_rep. DR Pfam; PF00008; EGF; 5. DR Pfam; PF00057; Ldl_recept_a; 1. DR Pfam; PF00058; Ldl_recept_b; 19. DR SMART; SM00181; EGF; 7. DR SMART; SM00179; EGF_CA; 3. DR SMART; SM00192; LDLa; 8. DR SMART; SM00135; LY; 20. DR PROSITE; PS00010; ASX_HYDROXYL; 1. DR PROSITE; PS00022; EGF_1; FALSE_NEG. DR PROSITE; PS01186; EGF_2; 3. DR PROSITE; PS50026; EGF_3; FALSE_NEG. DR PROSITE; PS01187; EGF_CA; 1. DR PROSITE; PS01209; LDLRA_1; 8. DR PROSITE; PS50068; LDLRA_2; 8. KW Calcium-binding; EGF-like domain; Endocytosis; Glycoprotein; Receptor; KW Repeat; Signal; Transmembrane. FT SIGNAL 1 20 Potential. FT CHAIN 21 1950 Low-density lipoprotein receptor-related FT protein 4. FT DOMAIN 21 1768 Extracellular (Potential). FT TRANSMEM 1769 1791 Potential. FT DOMAIN 1792 1950 Cytoplasmic (Potential). FT DOMAIN 71 112 LDL-receptor class A 1. FT DOMAIN 115 151 LDL-receptor class A 2. FT DOMAIN 154 189 LDL-receptor class A 3. FT DOMAIN 192 228 LDL-receptor class A 4. FT DOMAIN 235 271 LDL-receptor class A 5. FT DOMAIN 275 311 LDL-receptor class A 6. FT DOMAIN 314 350 LDL-receptor class A 7. FT DOMAIN 356 395 LDL-receptor class A 8. FT DOMAIN 399 439 EGF-like 1, calcium-binding (Potential). FT DOMAIN 440 479 EGF-like 2, calcium-binding (Potential). FT DOMAIN 525 566 LDL-receptor class B 1. FT DOMAIN 568 609 LDL-receptor class B 2. FT DOMAIN 611 653 LDL-receptor class B 3. FT DOMAIN 655 696 LDL-receptor class B 4. FT DOMAIN 698 738 LDL-receptor class B 5. FT DOMAIN 743 782 EGF-like 3. FT DOMAIN 830 871 LDL-receptor class B 6. FT DOMAIN 873 914 LDL-receptor class B 7. FT DOMAIN 916 958 LDL-receptor class B 8. FT DOMAIN 960 1001 LDL-receptor class B 9. FT DOMAIN 1002 1042 LDL-receptor class B 10. FT DOMAIN 1138 1179 LDL-receptor class B 11. FT DOMAIN 1181 1222 LDL-receptor class B 12. FT DOMAIN 1224 1266 LDL-receptor class B 13. FT DOMAIN 1268 1308 LDL-receptor class B 14. FT DOMAIN 1309 1350 LDL-receptor class B 15. FT DOMAIN 1442 1483 LDL-receptor class B 16. FT DOMAIN 1485 1526 LDL-receptor class B 17. FT DOMAIN 1528 1570 LDL-receptor class B 18. FT DOMAIN 1572 1617 LDL-receptor class B 19. FT SITE 1811 1814 Endocytosis signal (Potential). FT DISULFID 72 89 By similarity. FT DISULFID 79 102 By similarity. FT DISULFID 96 111 By similarity. FT DISULFID 116 128 By similarity. FT DISULFID 123 141 By similarity. FT DISULFID 135 150 By similarity. FT DISULFID 155 167 By similarity. FT DISULFID 162 180 By similarity. FT DISULFID 174 188 By similarity. FT DISULFID 193 205 By similarity. FT DISULFID 200 218 By similarity. FT DISULFID 212 227 By similarity. FT DISULFID 236 248 By similarity. FT DISULFID 243 261 By similarity. FT DISULFID 255 270 By similarity. FT DISULFID 276 288 By similarity. FT DISULFID 283 301 By similarity. FT DISULFID 295 310 By similarity. FT DISULFID 315 327 By similarity. FT DISULFID 322 340 By similarity. FT DISULFID 334 349 By similarity. FT DISULFID 357 369 By similarity. FT DISULFID 364 382 By similarity. FT DISULFID 376 394 By similarity. FT DISULFID 403 414 By similarity. FT DISULFID 410 423 By similarity. FT DISULFID 425 438 By similarity. FT DISULFID 444 454 By similarity. FT DISULFID 450 463 By similarity. FT DISULFID 465 478 By similarity. FT DISULFID 747 758 By similarity. FT DISULFID 754 767 By similarity. FT DISULFID 769 781 By similarity. FT CARBOHYD 309 309 N-linked (GlcNAc...) (Potential). FT CARBOHYD 543 543 N-linked (GlcNAc...) (Potential). FT CARBOHYD 764 764 N-linked (GlcNAc...) (Potential). FT CARBOHYD 946 946 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1122 1122 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1460 1460 N-linked (GlcNAc...) (Potential). FT CARBOHYD 1512 1512 N-linked (GlcNAc...) (Potential). FT CONFLICT 1131 1131 I -> V (in Ref. 2). FT CONFLICT 1599 1599 S -> G (in Ref. 2). FT CONFLICT 1691 1691 R -> Q (in Ref. 2). FT CONFLICT 1907 1907 T -> M (in Ref. 2). SQ SEQUENCE 1950 AA; 215963 MW; D298624D70B2A287 CRC64; MRRQWGALLL GALLCAHAVA LGLRAGERTR SGPGSSSPSG GISGGASAGS GLGRGAGLGR GAGLASSPEC ACGRSHFTCA VSALGECTCI PAQWQCDGDN DCGDHSDEDG CILPTCSPLD FHCDNGKCIR RSWVCDGDND CEDDSDEQDC PPRECEEDEF PCQNGYCIRS LWHCDGDNDC GDNSDEQCDM RKCSDKEFRC SDGSCIAEHW YCDGDTDCKD GSDEENCPSA VPAPPCNLEE FQCAYGRCIL DIYHCDGDDD CGDWSDESDC SSHQPCRSGE FMCDSGLCIN AGWRCDGDAD CDDQSDERNC TTSMCTAEQF RCHSGRCVRL SWRCDGEDDC ADNSDEENCE NTGSPQCALD QFLCWNGRCI GQRKLCNGVN DCGDNSDESP QQNCRPRTGE ENCNVNNGGC AQKCQMVRGA VQCTCHTGYR LTEDGHTCQD VNECAEEGYC SQGCTNSEGA FQCWCETGYE LRPDRRSCKA LGPEPVLLFA NRIDIRQVLP HRSEYTLLLN NLENAIALDF HHRRELVFWS DVTLDRILRA NLNGSNVEEV VSTGLESPGG LAVDWVHDKL YWTDSGTSRI EVANLDGAHR KVLLWQNLEK PRAIALHPME GTIYWTDWGN TPRIEASSMD GSGRRIIADT HLFWPNGLTI DYAGRRMYWV DAKHHVIERA NLDGSHRKAV ISQGLPHPFA ITVFEDSLYW TDWHTKSINS ANKFTGKNQE IIRNKLHFPM DIHTLHPQRQ PAGKNRCGDN NGGCTHLCLP SGQNYTCACP TGFRKISSHA CAQSLDKFLL FARRMDIRRI SFDTEDLSDD VIPLADVRSA VALDWDSRDD HVYWTDVSTD TISRAKWDGT GQEVVVDTSL ESPAGLAIDW VTNKLYWTDA GTDRIEVANT DGSMRTVLIW ENLDRPRDIV VEPMGGYMYW TDWGASPKIE RAGMDASGRQ VIISSNLTWP NGLAIDYGSQ RLYWADAGMK TIEFAGLDGS KRKVLIGSQL PHPFGLTLYG ERIYWTDWQT KSIQSADRLT GLDRETLQEN LENLMDIHVF HRRRPPVSTP CAMENGGCSH LCLRSPNPSG FSCTCPTGIN LLSDGKTCSP GMNSFLIFAR RIDIRMVSLD IPYFADVVVP INITMKNTIA IGVDPQEGKV YWSDSTLHRI SRANLDGSQH EDIITTGLQT TDGLAVDAIG RKVYWTDTGT NRIEVGNLDG SMRKVLVWQN LDSPRAIVLY HEMGFMYWTD WGENAKLERS GMDGSDRAVL INNNLGWPNG LTVDKASSQL LWADAHTERI EAADLNGANR HTLVSPVQHP YGLTLLDSYI YWTDWQTRSI HRADKGTGSN VILVRSNLPG LMDMQAVDRA QPLGFNKCGS RNGGCSHLCL PRPSGFSCAC PTGIQLKGDG KTCDPSPETY LLFSSRGSIR RISLDTSDHT DVHVPVPELN NVISLDYDSV DGKVYYTDVF LDVIRRADLN GSNMETVIGR GLKTTDGLAV DWVARNLYWT DTGRNTIEAS RLDGSCRKVL INNSLDEPRA IAVFPRKGYL FWTDWGHIAK IERANLDGSE RKVLINTDLG WPNGLTLDYD TRRIYWVDAH LDRIESADLN GKLRQVLVSH VSHPFALTQQ DRWIYWTDWQ TKSIQRVDKY SGRNKETVLA NVEGLMDIIV VSPQRQTGTN ACGVNNGGCT HLCFARASDF VCACPDEPDS RPCSLVPGLV PPAPRATGMS EKSPVLPNTP PTTLYSSTTR TRTSLEEVEG RCSERDARLG LCARSNDAVP AAPGEGLHIS YAIGGLLSIL LILVVIAALM LYRHKKSKFT DPGMGNLTYS NPSYRTSTQE VKIEAIPKPA MYNQLCYKKE GGPDHNYTKE KIKIVEGICL LSGDDAEWDD LKQLRSSRGG LLRDHVCMKT DTVSIQASSG SLDDTETEQL LQEEQSECSS VHTAATPERR GSLPDTGWKH ERKLSSESQV //