ID D3YYM4_MOUSE Unreviewed; 1114 AA. AC D3YYM4; DT 20-APR-2010, integrated into UniProtKB/TrEMBL. DT 20-APR-2010, sequence version 1. DT 11-JUL-2012, entry version 21. DE SubName: Full=Protein Wdr72; GN Name=Wdr72; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C57BL/6J; RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112; RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., RA She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., RA Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., RA Zhou S., Teague B., Potamousis K., Churas C., Place M., Herschleb J., RA Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., RA Lindblad-Toh K., Eichler E.E., Ponting C.P.; RT "Lineage-specific biology revealed by a finished genome assembly of RT the mouse."; RL PLoS Biol. 7:E1000112-E1000112(2009). RN [2] RP IDENTIFICATION. RC STRAIN=C57BL/6J; RG Ensembl; RL Submitted (MAY-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC108944; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC111087; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR IPI; IPI00762703; -. DR RefSeq; NP_001028672.2; NM_001033500.3. DR UniGene; Mm.335289; -. DR ProteinModelPortal; D3YYM4; -. DR SMR; D3YYM4; 14-190, 414-609. DR PRIDE; D3YYM4; -. DR Ensembl; ENSMUST00000055879; ENSMUSP00000057320; ENSMUSG00000044976. DR GeneID; 546144; -. DR KEGG; mmu:546144; -. DR UCSC; uc009qre.1; mouse. DR CTD; 256764; -. DR MGI; MGI:3583957; Wdr72. DR GeneTree; ENSGT00520000055590; -. DR HOGENOM; HOG000168573; -. DR OMA; CLLPWGV; -. DR OrthoDB; EOG41ZF93; -. DR NextBio; 413378; -. DR Bgee; D3YYM4; -. DR Gene3D; G3DSA:2.130.10.10; WD40/YVTN_repeat-like; 4. DR InterPro; IPR011044; Quino_amine_DH_bsu. DR InterPro; IPR011047; Quinonprotein_ADH-like. DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom. DR InterPro; IPR001680; WD40_repeat. DR InterPro; IPR019775; WD40_repeat_CS. DR InterPro; IPR017986; WD40_repeat_dom. DR Pfam; PF00400; WD40; 2. DR SMART; SM00320; WD40; 7. DR SUPFAM; SSF50969; Amine_DH_B_like; 1. DR SUPFAM; SSF50998; Quin_alc_DH_like; 1. DR PROSITE; PS00678; WD_REPEATS_1; 1. DR PROSITE; PS50082; WD_REPEATS_2; 2. DR PROSITE; PS50294; WD_REPEATS_REGION; 2. PE 4: Predicted; KW Complete proteome; Reference proteome; Repeat; WD repeat. SQ SEQUENCE 1114 AA; 124410 MW; 209D3BF22B75E72C CRC64; MRGALQAVAL WGRKAPPHSI TAIMITDDQQ TIVTGSQEGQ LCLWSLSPEL KISAKELLFG HSASVTCLAR ARDFSKQPYV VSAAENGEMC MWNVSSGQCV EKTSLPYRHT AICYYHCSFR MTGEGWLLCC GEYQDVLVLD AGTLAVLHTF TSLQSPDWMK CMCIVHSVRI QEDSLLVVSI TGELKVWDLS SSINSIQEKQ DVHEKESKFL DSFNCQTIRF CPYTERLLLV VFSKCWKIYD YCDFSLLWTE VSRDGQFFAG GEVLAAHRIL VWTEDGHSYI YQLLNRWAQM GATLRTFSGL SKCVCPADGG VLKGTVYPHL LCSTSVEENK SLHFVMGYMN ERKEPFYKVL FSGEVSGRIT LWHIPDVPIS KFDGSPREIP ITTTWTLQDN FDKHQMVSQS ITDHFSGSRD EVGMTATITS SEYIPNLDKL ICGCEDGTIF ITKALNAAKA GLLEGDSLLK DSPCHTLLRG HHQSVTSLLY PHNLASKLDQ SWMVSGDRGS YVILWDIFTE EILHTFFLEA GPVTRLLMSP ENLKRSDGQI LCCVCGDHSV ALLHLEGRRC LLRARKHLFP VRMIRWHPVE NFLIVGCTDD SVYIWEIETG TLERHETGER ARIILNCGDD AQLIRSEPTL SVASETHKHK SIEQKSSNSH QPGPVPCPSV QLESSCKVAD ASSVPRPFNV LPVKTKWSHI GFHVLLFDLE NLVELLLPTP LSDVDPSGSF YGGDILRRAK STVEKKTLTI RRNKASCSSL QTEAQAKPSG DSLVLGDSTS KFSEENNGIK RQKKMKSSKK AHPKPPRKVD ASLTIDMAKL FLSCILPWGV DKDLDSLCTR HLSILKLQGP VSLGLASNED LFSLMLPGWD ACSTEMKEYS GVNLCSRKVL DLSSKYTATL LHQTGIPRGL ESHCDSVQQS DAIVYLLSRL FLVNKLVNMP LDLACEIDRP FKMETVHSKA RFPGSDILNI SSFYGHPKNG GNECRAPEAD LSLLKLISCW RDQSVQVTEA IQAVLLAEVQ QHMKSLRNTP VSSQPDPVAE HSICERMQIS AKMEWTEELE LQYVGKSSPL KTSVSPVKHG NDLNSANFQD TEDILDRCVL EESESAGQPR HRPWIAKVCS CRMC //