ID FNDC1_HUMAN Reviewed; 1836 AA. AC Q4ZHG4; Q5JPI0; Q5VU31; Q5VU32; Q5VXX4; Q70CQ6; Q96JG1; DT 17-APR-2007, integrated into UniProtKB/Swiss-Prot. DT 17-APR-2007, sequence version 2. DT 10-JUL-2007, entry version 22. DE Fibronectin type III domain-containing protein 1 (Expressed in DE synovial lining protein) (Activation-associated cDNA protein). GN Name=FNDC1; Synonyms=FNDC2, KIAA1866, MEL4B3; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), AND VARIANT LYS-1446. RX MEDLINE=98368331; PubMed=9704633; RX DOI=10.1002/1529-0131(199808)41:8<1356::AID-ART4>3.0.CO;2-X; RA Seki T., Selby J., Haupl T., Winchester R.; RT "Use of differential subtraction method to identify genes that RT characterize the phenotype of cultured rheumatoid arthritis RT synoviocytes."; RL Arthritis Rheum. 41:1356-1364(1998). RN [2] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 4), AND INDUCTION BY TGF-BETA. RX PubMed=16098131; DOI=10.1111/j.0906-6705.2005.00349.x; RA Anderegg U., Breitschwerdt K., Koehler M.J., Sticherling M., RA Haustein U.-F., Simon J.C., Saalbach A.; RT "MEL4B3, a novel mRNA is induced in skin tumors and regulated by TGF- RT beta and pro-inflammatory cytokines."; RL Exp. Dermatol. 14:709-718(2005). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2), AND VARIANTS RP GLN-411; GLU-951; GLU-1128; PRO-1209; ARG-1228 AND LYS-1446. RC TISSUE=Brain; RX MEDLINE=21245130; PubMed=11347906; DOI=10.1093/dnares/8.2.85; RA Nagase T., Nakayama M., Nakajima D., Kikuno R., Ohara O.; RT "Prediction of the coding sequences of unidentified human genes. XX. RT The complete sequences of 100 new cDNA clones from brain which code RT for large proteins in vitro."; RL DNA Res. 8:85-95(2001). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND VARIANTS PRO-1209; RP ARG-1228 AND LYS-1446. RX MEDLINE=22935763; PubMed=14574404; DOI=10.1038/nature02055; RA Mungall A.J., Palmer S.A., Sims S.K., Edwards C.A., Ashurst J.L., RA Wilming L., Jones M.C., Horton R., Hunt S.E., Scott C.E., RA Gilbert J.G.R., Clamp M.E., Bethel G., Milne S., Ainscough R., RA Almeida J.P., Ambrose K.D., Andrews T.D., Ashwell R.I.S., RA Babbage A.K., Bagguley C.L., Bailey J., Banerjee R., Barker D.J., RA Barlow K.F., Bates K., Beare D.M., Beasley H., Beasley O., Bird C.P., RA Blakey S.E., Bray-Allen S., Brook J., Brown A.J., Brown J.Y., RA Burford D.C., Burrill W., Burton J., Carder C., Carter N.P., RA Chapman J.C., Clark S.Y., Clark G., Clee C.M., Clegg S., Cobley V., RA Collier R.E., Collins J.E., Colman L.K., Corby N.R., Coville G.J., RA Culley K.M., Dhami P., Davies J., Dunn M., Earthrowl M.E., RA Ellington A.E., Evans K.A., Faulkner L., Francis M.D., Frankish A., RA Frankland J., French L., Garner P., Garnett J., Ghori M.J., RA Gilby L.M., Gillson C.J., Glithero R.J., Grafham D.V., Grant M., RA Gribble S., Griffiths C., Griffiths M.N.D., Hall R., Halls K.S., RA Hammond S., Harley J.L., Hart E.A., Heath P.D., Heathcott R., RA Holmes S.J., Howden P.J., Howe K.L., Howell G.R., Huckle E., RA Humphray S.J., Humphries M.D., Hunt A.R., Johnson C.M., Joy A.A., RA Kay M., Keenan S.J., Kimberley A.M., King A., Laird G.K., Langford C., RA Lawlor S., Leongamornlert D.A., Leversha M., Lloyd C.R., Lloyd D.M., RA Loveland J.E., Lovell J., Martin S., Mashreghi-Mohammadi M., RA Maslen G.L., Matthews L., McCann O.T., McLaren S.J., McLay K., RA McMurray A., Moore M.J.F., Mullikin J.C., Niblett D., Nickerson T., RA Novik K.L., Oliver K., Overton-Larty E.K., Parker A., Patel R., RA Pearce A.V., Peck A.I., Phillimore B.J.C.T., Phillips S., Plumb R.W., RA Porter K.M., Ramsey Y., Ranby S.A., Rice C.M., Ross M.T., Searle S.M., RA Sehra H.K., Sheridan E., Skuce C.D., Smith S., Smith M., Spraggon L., RA Squares S.L., Steward C.A., Sycamore N., Tamlyn-Hall G., Tester J., RA Theaker A.J., Thomas D.W., Thorpe A., Tracey A., Tromans A., Tubby B., RA Wall M., Wallis J.M., West A.P., White S.S., Whitehead S.L., RA Whittaker H., Wild A., Willey D.J., Wilmer T.E., Wood J.M., Wray P.W., RA Wyatt J.C., Young L., Younger R.M., Bentley D.R., Coulson A., RA Durbin R.M., Hubbard T., Sulston J.E., Dunham I., Rogers J., Beck S.; RT "The DNA sequence and analysis of human chromosome 6."; RL Nature 425:805-811(2003). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1144-1836, AND VARIANTS RP PRO-1209; ARG-1228 AND LYS-1446. RC TISSUE=Lymph node; RG The German cDNA consortium; RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases. CC -!- FUNCTION: May be an activator of G protein signaling (By CC similarity). CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=4; CC Name=1; CC IsoId=Q4ZHG4-1; Sequence=Displayed; CC Name=2; CC IsoId=Q4ZHG4-2; Sequence=VSP_024663; CC Note=No experimental confirmation available; CC Name=3; CC IsoId=Q4ZHG4-3; Sequence=VSP_024662; CC Note=Gene prediction based on EST data; CC Name=4; CC IsoId=Q4ZHG4-4; Sequence=VSP_024664; CC Note=Expressed in the stroma close to skin tumors, in the tumor CC cells themselves and in the epidermis of psoriasis; CC -!- TISSUE SPECIFICITY: Isoform 4 is almost absent from healthy skin; CC especially in epidermal keratinocytes, skin fibroblasts or CC endothelial cells and is barely detectable in benign melanocytic CC naevi. CC -!- INDUCTION: By TGF-beta present in the melanoma cell conditioned CC medium (MCCM). CC -!- SIMILARITY: Contains 5 fibronectin type-III domains. CC -!- SEQUENCE CAUTION: CC Sequence=CAH70504.1; Type=Erroneous gene model prediction; CC Sequence=CAH71650.1; Type=Erroneous gene model prediction; CC Sequence=CAH71651.1; Type=Erroneous gene model prediction; CC Sequence=CAI20153.1; Type=Erroneous gene model prediction; CC Sequence=CAI20154.1; Type=Erroneous gene model prediction; CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; DQ009660; AAY26234.1; -; mRNA. DR EMBL; AJ586132; CAE51894.1; -; mRNA. DR EMBL; AB058769; BAB47495.2; ALT_INIT; mRNA. DR EMBL; AL356417; CAH70504.1; ALT_SEQ; Genomic_DNA. DR EMBL; AL355492; CAH70504.1; JOINED; Genomic_DNA. DR EMBL; AL590551; CAH71650.1; ALT_SEQ; Genomic_DNA. DR EMBL; AL590551; CAH71651.1; ALT_SEQ; Genomic_DNA. DR EMBL; AL355492; CAH71651.1; JOINED; Genomic_DNA. DR EMBL; AL355492; CAI20153.1; ALT_SEQ; Genomic_DNA. DR EMBL; AL590551; CAI20153.1; JOINED; Genomic_DNA. DR EMBL; AL355492; CAI20154.1; ALT_SEQ; Genomic_DNA. DR EMBL; AL356417; CAI20154.1; JOINED; Genomic_DNA. DR EMBL; AL832410; CAI46178.2; -; mRNA. DR UniGene; Hs.520525; -. DR Ensembl; ENSG00000164694; Homo sapiens. DR HGNC; HGNC:21184; FNDC1. DR MIM; 609991; gene. DR PharmGKB; PA134906656; -. DR ArrayExpress; Q4ZHG4; -. DR RZPD-ProtExp; Y0210; -. DR InterPro; IPR003961; FN_III. DR InterPro; IPR008957; FN_III-like. DR Gene3D; G3DSA:2.60.40.30; FN_III-like; 5. DR Pfam; PF00041; fn3; 5. DR SMART; SM00060; FN3; 4. DR PROSITE; PS50853; FN3; 5. KW Alternative splicing; Polymorphism; Repeat. FT CHAIN 1 1836 Fibronectin type III domain-containing FT protein 1. FT /FTId=PRO_0000284831. FT DOMAIN 2 77 Fibronectin type-III 1. FT DOMAIN 106 198 Fibronectin type-III 2. FT DOMAIN 207 302 Fibronectin type-III 3. FT DOMAIN 307 401 Fibronectin type-III 4. FT DOMAIN 1597 1691 Fibronectin type-III 5. FT COMPBIAS 619 721 Ser-rich. FT COMPBIAS 954 957 Poly-Pro. FT COMPBIAS 1191 1196 Poly-Pro. FT COMPBIAS 1391 1458 Thr-rich. FT VAR_SEQ 1 1469 Missing (in isoform 4). FT /FTId=VSP_024664. FT VAR_SEQ 1 101 MGLKVTWDPPKDATSRPVEHYNIAYGKSLKSLKYIKVNAET FT YSFLIEDVEPGVVYFVLLTAENHSGVSRPVYRAESPPGGEW FT IEIDGFPIKGPGPFNETVT -> MAPEAGATLRAPRRLSWA FT ALLLLAALLPVASSAAAS (in isoform 3). FT /FTId=VSP_024662. FT VAR_SEQ 342 405 EYILSYAPALKPFGAKSLTYPGDTTSALVDGLQPGERYLFK FT IRATNRRGLGPHSKAFIVAMPTT -> A (in isoform FT 2). FT /FTId=VSP_024663. FT VARIANT 386 386 T -> A (in dbSNP:rs509648). FT /FTId=VAR_031826. FT VARIANT 411 411 E -> Q (in dbSNP:rs420137). FT /FTId=VAR_031827. FT VARIANT 951 951 Q -> E (in dbSNP:rs370434). FT /FTId=VAR_031828. FT VARIANT 1128 1128 D -> E (in dbSNP:rs420054). FT /FTId=VAR_031829. FT VARIANT 1209 1209 L -> P (in dbSNP:rs3003174). FT /FTId=VAR_031830. FT VARIANT 1228 1228 Q -> R (in dbSNP:rs2501176). FT /FTId=VAR_031831. FT VARIANT 1446 1446 T -> K (in dbSNP:rs386360). FT /FTId=VAR_031832. FT VARIANT 1516 1516 T -> A (in dbSNP:rs7763726). FT /FTId=VAR_031833. FT CONFLICT 1426 1426 T -> TTRRTTT (in Ref. 4; CAH71650). FT CONFLICT 1627 1627 D -> N (in Ref. 2; CAE51894). FT CONFLICT 1836 1836 W -> G (in Ref. 2; CAE51894). SQ SEQUENCE 1836 AA; 199421 MW; 0859A2CA012397C7 CRC64; MGLKVTWDPP KDATSRPVEH YNIAYGKSLK SLKYIKVNAE TYSFLIEDVE PGVVYFVLLT AENHSGVSRP VYRAESPPGG EWIEIDGFPI KGPGPFNETV TEKEVPNKPL RVRVRSSDDR LSVAWKAPRL SGAKSPRRSR GFLLGYGESG RKMNYVPLTR DERTHEIKKL ASESVYVVSL QSMNSQGRSQ PVYRAALTKR KISEEDELDV PDDISVRVMS SQSVLVSWVD PVLEKQKKVV ASRQYTVRYR EKGELARWDY KQIANRRVLI ENLIPDTVYE FAVRISQGER DGKWSTSVFQ RTPESAPTTA PENLNVWPVN GKPTVVAASW DALPETEGKV KEYILSYAPA LKPFGAKSLT YPGDTTSALV DGLQPGERYL FKIRATNRRG LGPHSKAFIV AMPTTSKADV EQNTEDNGKP EKPEPSSPSP RAPASSQHPS VPASPQGRNA KDLLLDLKNK ILANGGAPRK PQLRAKKAEE LDLQSTEITG EEELGSREDS PMSPSDTQDQ KRTLRPPSRH GHSVVAPGRT AVRARMPALP RREGVDKPGF SLATQPRPGA PPSASASPAH HASTQGTSHR PSLPASLNDN DLVDSDEDER AVGSLHPKGA FAQPRPALSP SRQSPSSVLR DRSSVHPGAK PASPARRTPH SGAAEEDSSA SAPPSRLSPP HGGSSRLLPT QPHLSSPLSK GGKDGEDAPA TNSNAPSRST MSSSVSSHLS SRTQVSEGAE ASDGESHGDG DREDGGRQAE ATAQTLRARP ASGHFHLLRH KPFAANGRSP SRFSIGRGPR LQPSSSPQST VPSRAHPRVP SHSDSHPKLS SGIHGDEEDE KPLPATVVND HVPSSSRQPI SRGWEDLRRS PQRGASLHRK EPIPENPKST GADTHPQGKY SSLASKAQDV QQSTDADTEG HSPKAQPGST DRHASPARPP AARSQQHPSV PRRMTPGRAP QQQPPPPVAT SQHHPGPQSR DAGRSPSQPR LSLTQAGRPR PTSQGRSHSS SDPYTASSRG MLPTALQNQD EDAQGSYDDD STEVEAQDVR APAHAARAKE AAASLPKHQQ VESPTGAGAG GDHRSQRGHA ASPARPSRPG GPQSRARVPS RAAPGKSEPP SKRPLSSKSQ QSVSAEDDEE EDAGFFKGGK EDLLSSSVPK WPSSSTPRGG KDADGSLAKE EREPAIALAP RGGSLAPVKR PLPPPPGSSP RASHVPSRLP PRSAATVSPV AGTHPWPQYT TRAPPGHFST TPMLSLRQRM MHARFRNPLS RQPARPSYRQ GYNGRPNVEG KVLPGSNGKP NGQRIINGPQ GTKWVVDLDR GLVLNAEGRY LQDSHGNPLR IKLGGDGRTI VDLEGTPVVS PDGLPLFGQG RHGTPLANAQ DKPILSLGGK PLVGLEVIKK TTHPPTTTMQ PTTTTTPLPT TTTPRPTTAT TRRTTTRRPT TTVRTTTRTT TTTTPTPTTP IPTCPPGTLE RHDDDGNLIM SSNGIPECYA EEDEFSGLET DTAVPTEEAY VIYDEDYEFE TSRPPTTTEP STTATTPRVI PEEGAISSFP EEEFDLAGRK RFVAPYVTYL NKDPSAPCSL TDALDHFQVD SLDEIIPNDL KKSDLPPQHA PRNITVVAVE GCHSFVIVDW DKATPGDVVT GYLVYSASYE DFIRNKWSTQ ASSVTHLPIE NLKPNTRYYF KVQAQNPHGY GPISPSVSFV TESDNPLLVV RPPGGEPIWI PFAFKHDPSY TDCHGRQYVK RTWYRKFVGV VLCNSLRYKI YLSDNLKDTF YSIGDSWGRG EDHCQFVDSH LDGRTGPQSY VEALPTIQGY YRQYRQEPVR FGNIGFGTPY YYVGWYECGV SIPGKW //