ID A9UNJ8_MONBE Unreviewed; 1058 AA. AC A9UNJ8; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 02-DEC-2020, entry version 62. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EDQ92707.1}; GN ORFNames=35744 {ECO:0000313|EMBL:EDQ92707.1}; OS Monosiga brevicollis (Choanoflagellate). OC Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Monosiga. OX NCBI_TaxID=81824 {ECO:0000313|Proteomes:UP000001357}; RN [1] {ECO:0000313|EMBL:EDQ92707.1, ECO:0000313|Proteomes:UP000001357} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MX1 / ATCC 50154 {ECO:0000313|Proteomes:UP000001357}; RX PubMed=18273011; DOI=10.1038/nature06617; RG JGI Sequencing; RA King N., Westbrook M.J., Young S.L., Kuo A., Abedin M., Chapman J., RA Fairclough S., Hellsten U., Isogai Y., Letunic I., Marr M., Pincus D., RA Putnam N., Rokas A., Wright K.J., Zuzow R., Dirks W., Good M., RA Goodstein D., Lemons D., Li W., Lyons J.B., Morris A., Nichols S., RA Richter D.J., Salamov A., Bork P., Lim W.A., Manning G., Miller W.T., RA McGinnis W., Shapiro H., Tjian R., Grigoriev I.V., Rokhsar D.; RT "The genome of the choanoflagellate Monosiga brevicollis and the origin of RT metazoans."; RL Nature 451:783-788(2008). CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; CH991543; EDQ92707.1; -; Genomic_DNA. DR RefSeq; XP_001742469.1; XM_001742417.1. DR STRING; 81824.XP_001742469.1; -. DR EnsemblProtists; EDQ92707; EDQ92707; MONBRDRAFT_35744. DR GeneID; 5887469; -. DR KEGG; mbr:MONBRDRAFT_35744; -. DR eggNOG; KOG1548; Eukaryota. DR InParanoid; A9UNJ8; -. DR Proteomes; UP000001357; Unassembled WGS sequence. DR GO; GO:0005686; C:U2 snRNP; IBA:GO_Central. DR GO; GO:0005684; C:U2-type spliceosomal complex; IBA:GO_Central. DR GO; GO:0003723; F:RNA binding; IBA:GO_Central. DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro. DR CDD; cd12281; RRM1_TatSF1_like; 1. DR CDD; cd00201; WW; 2. DR Gene3D; 2.60.120.10; -; 1. DR Gene3D; 3.30.70.330; -; 2. DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf. DR InterPro; IPR035979; RBD_domain_sf. DR InterPro; IPR014710; RmlC-like_jellyroll. DR InterPro; IPR011051; RmlC_Cupin_sf. DR InterPro; IPR034393; TatSF1-like. DR InterPro; IPR034392; TatSF1-like_RRM1. DR InterPro; IPR001202; WW_dom. DR InterPro; IPR036020; WW_dom_sf. DR PANTHER; PTHR15608; PTHR15608; 1. DR Pfam; PF00397; WW; 2. DR SMART; SM00456; WW; 2. DR SUPFAM; SSF51045; SSF51045; 2. DR SUPFAM; SSF51182; SSF51182; 1. DR SUPFAM; SSF54928; SSF54928; 1. DR PROSITE; PS50020; WW_DOMAIN_2; 2. PE 4: Predicted; KW Reference proteome {ECO:0000313|Proteomes:UP000001357}; KW Repeat {ECO:0000256|ARBA:ARBA00022737}; KW RNA-binding {ECO:0000256|ARBA:ARBA00022884}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1..31 FT /evidence="ECO:0000256|SAM:SignalP" FT CHAIN 32..1058 FT /evidence="ECO:0000256|SAM:SignalP" FT /id="PRO_5002744579" FT DOMAIN 589..616 FT /note="WW" FT /evidence="ECO:0000259|PROSITE:PS50020" FT DOMAIN 646..679 FT /note="WW" FT /evidence="ECO:0000259|PROSITE:PS50020" SQ SEQUENCE 1058 AA; 116735 MW; DCD07687E993D5C7 CRC64; MALSWIGSRL SCCVLALVVL VLASWAPATA SGDMPPLPGA SFREHYPRPD CQSRKLCLLA AEPVSNIALT LRGQGLVVAT IAADALARAS GYTFFLRTDN SETVYTIDIA AAETGTITIR DASDAVLNTT SLIELHERHP WIPTDLVNFP MTQLQLWISI DKKNGYVRVG FGYPLLVNEM ITIGPIPKCL ESSDANACAI GTVTEVDCSL AAVAYSVTRF PVVAPIPPLL VDRHRVTLEV LEDRSALVPA VLPLELQSLY DQVAGEQILL SESDAAAIDL CIKTEGCVLY EKLQEKINAG EMSDDPHMAY IRVTIGPDLG NSPGSPFVME IWPPKMYSPI HDHANAVAVI KCLSGTITSR WYNPLAEQHN GEPVPFAEGR VAKGQITYLT PAFFQTHKLA NEEDVTCVTI QSYYYLHDDY VHNETFHFTL PTPDDNLHVF KPGSDFEYSE LIQAVRQFAS TAGANQVPPV DTLVGVCATM ANVEDGSGSA APGGHEDLFP LPEGKSREYV DTDGVTMEWD GDKGAYFPKV DAAVVSNYQA QYPTTNAGTV ACTGPVPRAC VRNSAHGQPG PAMLTLPRLF LADRPASKWQ VHAMPDGRPY YFNTETQTSV WVMPPELVEA AQVQWYWWSA TTPADEDGSR RLDPGAQKPE AFTAMKDERG NAYYFNKITR QSQWERPAGF QEATAAAVAQ QEEKRTAVVE RRKQRKQSKE EAARAKVQVD GNQVKLKDTV NCHVYVTGLP LVGAPSTVPL GVRACSHVCC HARVKDITLE EFTAFMRKAG IINEDAHGEP KIKLYTDEHG EPKGDGKCTY LRVESVELAL QLLDETEIRP GFKVKIQRAV FQLREGMTLG KKNDEDEVEE GSLEPAKKKA KKKSLGQKYA RVAAVHVSSL LGDTDSTDLG EQDICASFLT LYAVTAPVLR KLHWHETDTK RKRAVGVLVL KHMFTLEEMK EDASYIFELK DTVVLMRRCG MQNNPDGVVM VRYFTDEPLG PAIATLNGRF FAGQKVVAEE WDGKTKYKVE ESEEEKEARI KQWDEYLRSQ ELEEAAERKA RAAADAST //