ID A6H5V4_MOUSE Unreviewed; 370 AA. AC A6H5V4; DT 24-JUL-2007, integrated into UniProtKB/TrEMBL. DT 24-JUL-2007, sequence version 1. DT 07-OCT-2020, entry version 96. DE SubName: Full=Mesoderm posterior 2 {ECO:0000313|EMBL:AAI45653.1}; DE SubName: Full=Mesoderm posterior protein 2 {ECO:0000313|Ensembl:ENSMUSP00000103017}; GN Name=Mesp2 {ECO:0000313|EMBL:AAI45653.1, GN ECO:0000313|Ensembl:ENSMUSP00000103017, ECO:0000313|MGI:MGI:1096325}; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; OC Murinae; Mus; Mus. OX NCBI_TaxID=10090 {ECO:0000313|EMBL:AAI45653.1}; RN [1] {ECO:0000313|EMBL:AAI45653.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Brain {ECO:0000313|EMBL:AAI45653.1}; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RA Gerhard D.S., Wagner L., Feingold E.A., Shenmen C.M., Grouse L.H., RA Schuler G., Klein S.L., Old S., Rasooly R., Good P., Guyer M., Peck A.M., RA Derge J.G., Lipman D., Collins F.S., Jang W., Sherry S., Feolo M., RA Misquitta L., Lee E., Rotmistrovsky K., Greenhut S.F., Schaefer C.F., RA Buetow K., Bonner T.I., Haussler D., Kent J., Kiekhaus M., Furey T., RA Brent M., Prange C., Schreiber K., Shapiro N., Bhat N.K., Hopkins R.F., RA Hsie F., Driscoll T., Soares M.B., Casavant T.L., Scheetz T.E., RA Brown-stein M.J., Usdin T.B., Toshiyuki S., Carninci P., Piao Y., RA Dudekula D.B., Ko M.S., Kawakami K., Suzuki Y., Sugano S., Gruber C.E., RA Smith M.R., Simmons B., Moore T., Waterman R., Johnson S.L., Ruan Y., RA Wei C.L., Mathavan S., Gunaratne P.H., Wu J., Garcia A.M., Hulyk S.W., RA Fuh E., Yuan Y., Sneed A., Kowis C., Hodgson A., Muzny D.M., McPherson J., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madari A., Young A.C., Wetherby K.D., Granite S.J., RA Kwong P.N., Brinkley C.P., Pearson R.L., Bouffard G.G., Blakesly R.W., RA Green E.D., Dickson M.C., Rodriguez A.C., Grimwood J., Schmutz J., RA Myers R.M., Butterfield Y.S., Griffith M., Griffith O.L., Krzywinski M.I., RA Liao N., Morin R., Morrin R., Palmquist D., Petrescu A.S., Skalska U., RA Smailus D.E., Stott J.M., Schnerch A., Schein J.E., Jones S.J., Holt R.A., RA Baross A., Marra M.A., Clifton S., Makowski K.A., Bosak S., Malek J.; RT "The status, quality, and expansion of the NIH full-length cDNA project: RT the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [2] {ECO:0000313|Ensembl:ENSMUSP00000103017, ECO:0000313|Proteomes:UP000000589} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C57BL/6J {ECO:0000313|Ensembl:ENSMUSP00000103017, RC ECO:0000313|Proteomes:UP000000589}; RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112; RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., RA Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., RA Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S., RA Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., RA Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., RA Eichler E.E., Ponting C.P.; RT "Lineage-specific biology revealed by a finished genome assembly of the RT mouse."; RL PLoS Biol. 7:E1000112-E1000112(2009). RN [3] {ECO:0000313|Ensembl:ENSMUSP00000103017} RP IDENTIFICATION. RC STRAIN=C57BL/6J {ECO:0000313|Ensembl:ENSMUSP00000103017}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC109221; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BC145652; AAI45653.1; -; mRNA. DR EMBL; BC145654; AAI45655.1; -; mRNA. DR RefSeq; NP_032615.2; NM_008589.2. DR SMR; A6H5V4; -. DR PRIDE; A6H5V4; -. DR Antibodypedia; 28666; 101 antibodies. DR Ensembl; ENSMUST00000107394; ENSMUSP00000103017; ENSMUSG00000030543. DR GeneID; 17293; -. DR KEGG; mmu:17293; -. DR UCSC; uc009hzc.1; mouse. DR CTD; 145873; -. DR MGI; MGI:1096325; Mesp2. DR GeneTree; ENSGT00530000063712; -. DR HOGENOM; CLU_064749_0_0_1; -. DR KO; K09076; -. DR OMA; LPRPSCQ; -. DR OrthoDB; 1072866at2759; -. DR TreeFam; TF325707; -. DR BioGRID-ORCS; 17293; 1 hit in 18 CRISPR screens. DR Proteomes; UP000000589; Chromosome 7. DR Bgee; ENSMUSG00000030543; Expressed in secondary oocyte and 93 other tissues. DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro. DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro. DR GO; GO:1990837; F:sequence-specific double-stranded DNA binding; IEA:Ensembl. DR CDD; cd00083; HLH; 1. DR Gene3D; 4.10.280.10; -; 1. DR InterPro; IPR011598; bHLH_dom. DR InterPro; IPR036638; HLH_DNA-bd_sf. DR InterPro; IPR040259; Mesogenin/MesP. DR PANTHER; PTHR20937; PTHR20937; 1. DR Pfam; PF00010; HLH; 1. DR SMART; SM00353; HLH; 1. DR SUPFAM; SSF47459; SSF47459; 1. DR PROSITE; PS50888; BHLH; 1. PE 2: Evidence at transcript level; KW Reference proteome {ECO:0000313|Proteomes:UP000000589}. FT DOMAIN 79..133 FT /note="BHLH" FT /evidence="ECO:0000259|PROSITE:PS50888" FT REGION 24..90 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 231..265 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 325..350 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 24..47 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 54..70 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 370 AA; 39836 MW; BF48437113680033 CRC64; MAQSPPPQSL QGLDHWVFSQ GWGWAQQSDS TSPASSSDSS GSCPCYATRR PSQPAGPARS TRTTQATAPR RTRPAPAGGQ RQSASEREKL RMRTLARALQ ELRRFLPPSV APAGQSLTKI ETLRLAIRYI GHLSALLGLS EDSLRRRRRR SADAAFSHRC PQCPDGGSPS QAQMLGPSLG SAMSSGVSWG CPPACPGPLI SPENLGNRIS NVDPWVTPPY CPQIQSPLHQ SLERAADSSP WAPPQACPGM QMSPEPRNKT GHWTQSTEPA ELTKVYQSLS VSPEPCLSLG SPLLLPRPSC QRLQPQPQPQ PQWGCWGHDA EVLSTSEDQG SSPALQLPVA SPTPSSGLQL SGCPELWQED LEGPPLNIFY //