ID INT8_HUMAN Reviewed; 995 AA. AC Q75QN2; Q5RKZ3; Q6P1R5; Q7Z314; Q9NVS6; Q9NWY7; DT 31-OCT-2006, integrated into UniProtKB/Swiss-Prot. DT 05-JUL-2004, sequence version 1. DT 12-DEC-2006, entry version 19. DE Integrator complex subunit 8 (Int8) (KAONASHI protein 1) (KAONASHI1). GN Name=INTS8; Synonyms=C8orf52; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1). RC TISSUE=Brain; RA Atsushi S., Asakawa S., Shimizu N.; RT "Novel gene with no significant domain."; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 180-995 (ISOFORM 2). RC TISSUE=Rectum tumor; RG The German cDNA consortium; RL Submitted (JUN-2003) to the EMBL/GenBank/DDBJ databases. RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 307-995 (ISOFORM 1). RC TISSUE=Carcinoma, and Teratocarcinoma; RX PubMed=14702039; DOI=10.1038/ng1285; RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S., RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., RA Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., RA Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., RA Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., RA Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., RA Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., RA Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., RA Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., RA Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., RA Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., RA Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., RA Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., RA Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., RA Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., RA Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., RA Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., RA Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., RA Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.; RT "Complete sequencing and characterization of 21,243 full-length human RT cDNAs."; RL Nat. Genet. 36:40-45(2004). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 344-995 (ISOFORM 1). RC TISSUE=Testis, and Uterus; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [5] RP IDENTIFICATION BY MASS SPECTROMETRY, AND IDENTIFICATION IN THE RP INTEGRATOR COMPLEX. RX PubMed=16239144; DOI=10.1016/j.cell.2005.08.019; RA Baillat D., Hakimi M.-A., Naeaer A.M., Shilatifard A., Cooch N., RA Shiekhattar R.; RT "Integrator, a multiprotein mediator of small nuclear RNA processing, RT associates with the C-terminal repeat of RNA polymerase II."; RL Cell 123:265-276(2005). CC -!- FUNCTION: Component of the Integrator complex, a complex involved CC in the small nuclear RNAs (snRNA) U1 and U2 transcription and in CC their 3' box-dependent processing. The Integrator complex is CC associated with the C-terminal domain (CTD) of RNA polymerase II CC largest subunit (POLR2A) and is recruited to the U1 and U2 snRNAs CC genes. CC -!- SUBUNIT: Belongs to the multiprotein complex Integrator, at least CC composed of INTS1, INTS2, INTS3, INTS4, INTS5, INTS6, INTS7, CC INTS8, INTS9/RC74, INTS10, CPSF3L/INTS11 and INTS12. CC -!- SUBCELLULAR LOCATION: Nucleus (Probable). CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; CC IsoId=Q75QN2-1; Sequence=Displayed; CC Name=2; CC IsoId=Q75QN2-2; Sequence=VSP_021469; CC Note=No experimental confirmation available; CC -!- SIMILARITY: Contains 4 TPR repeats. CC -!- CAUTION: Ref.3 (AAH50536) sequence differs from that shown due to CC a frameshift in position 985. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AB161944; BAD10863.1; -; mRNA. DR EMBL; BX538203; CAD98067.1; -; mRNA. DR EMBL; AK000537; BAA91238.1; ALT_INIT; mRNA. DR EMBL; AK001403; BAA91671.1; ALT_INIT; mRNA. DR EMBL; BC050536; AAH50536.1; ALT_FRAME; mRNA. DR EMBL; BC064915; AAH64915.1; -; mRNA. DR EMBL; BK005731; DAA05731.1; -; mRNA. DR UniGene; Hs.521693; -. DR UniGene; Hs.567387; -. DR GermOnline; ENSG00000164941; Homo sapiens. DR Ensembl; ENSG00000164941; Homo sapiens. DR HGNC; HGNC:26048; INTS8. DR ArrayExpress; Q75QN2; -. DR RZPD-ProtExp; W1545; -. DR GO; GO:0032039; C:integrator complex; IDA:HGNC. DR GO; GO:0016180; P:snRNA processing; IDA:HGNC. DR InterPro; IPR011990; TPR-like_helical. DR Gene3D; G3DSA:1.25.40.10; TPR-like_helical; 1. DR PROSITE; PS50005; TPR; FALSE_NEG. DR PROSITE; PS50293; TPR_REGION; FALSE_NEG. KW Alternative splicing; Nuclear protein; Repeat; TPR repeat. FT CHAIN 1 995 Integrator complex subunit 8. FT /FTId=PRO_0000259553. FT REPEAT 250 288 TPR 1. FT REPEAT 320 356 TPR 2. FT REPEAT 570 603 TPR 3. FT REPEAT 833 866 TPR 4. FT VAR_SEQ 879 895 Missing (in isoform 2). FT /FTId=VSP_021469. FT CONFLICT 778 778 V -> A (in Ref. 3; BAA91238). FT CONFLICT 832 832 N -> S (in Ref. 2; CAD98067). SQ SEQUENCE 995 AA; 113088 MW; 24D06E11821BD4D5 CRC64; MSAEAADREA ATSSRPCTPP QTCWFEFLLE ESLLEKHLRK PCPDPAPVQL IVQFLEQASK PSVNEQNQVQ PPPDNKRNRI LKLLALKVAA HLKWDLDILE KSLSVPVLNM LLNELLCISK VPPGTKHVDM DLATLPPTTA MAVLLYNRWA IRTIVQSSFP VKQAKPGPPQ LSVMNQMQQE KELTENILKV LKEQAADSIL VLEAALKLNK DLYVHTMRTL DLLAMEPGMV NGETESSTAG LKVKTEEMQC QVCYDLGAAY FQQGSTNSAV YENAREKFFR TKELIAEIGS LSLHCTIDEK RLAGYCQACD VLVPSSDSTS QQLTPYSQVH ICLRSGNYQE VIQIFIEDNL TLSLPVQFRQ SVLRELFKKA QQGNEALDEI CFKVCACNTV RDILEGRTIS VQFNQLFLRP NKEKIDFLLE VCSRSVNLEK ASESLKGNMA AFLKNVCLGL EDLQYVFMIS SHELFITLLK DEERKLLVDQ MRKRSPRVNL CIKPVTSFYD IPASASVNIG QLEHQLILSV DPWRIRQILI ELHGMTSERQ FWTVSNKWEV PSVYSGVILG IKDNLTRDLV YILMAKGLHC STVKDFSHAK QLFAACLELV TEFSPKLRQV MLNEMLLLDI HTHEAGTGQA GERPPSDLIS RVRGYLEMRL PDIPLRQVIA EECVAFMLNW RENEYLTLQV PAFLLQSNPY VKLGQLLAAT CKELPGPKES RRTAKDLWEV VVQICSVSSQ HKRGNDGRVS LIKQRESTLG IMYRSELLSF IKKLREPLVL TIILSLFVKL HNVREDIVND ITAEHISIWP SSIPNLQSVD FEAVAITVKE LVRYTLSINP NNHSWLIIQA DIYFATNQYS AALHYYLQAG AVCSDFFNKA VPPDVYTDQV IKRMIKCCSL LNCHTQVAIL CQFLREIDYK TAFKSLQEQN SHDAMDSYYD YIWDVTILEY LTYLHHKRGE TDKRQIAIKA IGQTELNASN PEEVLQLAAQ RRKKKFLQAM AKLYF //