ID H2QUR4_PANTR Unreviewed; 959 AA. AC H2QUR4; A0A2J8Q874; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 26-FEB-2020, entry version 51. DE SubName: Full=GTF2I repeat domain containing 1 {ECO:0000313|EMBL:JAA08002.1, ECO:0000313|Ensembl:ENSPTRP00000032994}; DE SubName: Full=GTF2IRD1 isoform 4 {ECO:0000313|EMBL:PNI92471.1}; GN Name=GTF2IRD1 {ECO:0000313|EMBL:JAA08002.1, GN ECO:0000313|Ensembl:ENSPTRP00000032994, ECO:0000313|VGNC:VGNC:8726}; GN ORFNames=CK820_G0040825 {ECO:0000313|EMBL:PNI92471.1}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000032994, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000032994, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the human RT genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Ensembl:ENSPTRP00000032994} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. RN [3] {ECO:0000313|EMBL:JAA08002.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Adipose stromal {ECO:0000313|EMBL:JAA08002.1}, Skeletal muscle RC {ECO:0000313|EMBL:JAA44080.1}, Skin {ECO:0000313|EMBL:JAA24036.1}, and RC Smooth vascular {ECO:0000313|EMBL:JAA19101.1}; RA Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.; RT "De novo assembly of the reference chimpanzee transcriptome from NextGen RT mRNA sequences."; RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0000313|EMBL:PNI92471.1, ECO:0000313|Proteomes:UP000236370} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Yerkes chimp pedigree #C0471 {ECO:0000313|EMBL:PNI92471.1}; RC TISSUE=Blood {ECO:0000313|EMBL:PNI92471.1}; RA Pollen A., Hastie A., Hormozdiari F., Dougherty M., Liu R., Chaisson M., RA Hoppe E., Hill C., Pang A., Hillier L., Baker C., Armstrong J., RA Shendure J., Paten B., Wilson R., Chao H., Schneider V., Ventura M., RA Kronenberg Z., Murali S., Gordon D., Cantsilieris S., Munson K., Nelson B., RA Raja A., Underwood J., Diekhans M., Fiddes I., Haussler D., Eichler E.; RT "High-resolution comparative analysis of great ape genomes."; RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC188564; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC200908; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; GABC01003336; JAA08002.1; -; mRNA. DR EMBL; GABF01003044; JAA19101.1; -; mRNA. DR EMBL; GABD01009064; JAA24036.1; -; mRNA. DR EMBL; GABE01000659; JAA44080.1; -; mRNA. DR EMBL; NBAG03000068; PNI92471.1; -; Genomic_DNA. DR RefSeq; XP_001150464.2; XM_001150464.5. DR STRING; 9598.ENSPTRP00000032994; -. DR Ensembl; ENSPTRT00000035691; ENSPTRP00000032994; ENSPTRG00000019292. DR GeneID; 463471; -. DR KEGG; ptr:463471; -. DR CTD; 9569; -. DR VGNC; VGNC:8726; GTF2IRD1. DR eggNOG; ENOG410IEPZ; Eukaryota. DR eggNOG; ENOG41100H8; LUCA. DR GeneTree; ENSGT00940000159414; -. DR HOGENOM; CLU_014412_0_0_1; -. DR KO; K03121; -. DR OrthoDB; 115381at2759; -. DR TreeFam; TF352524; -. DR Proteomes; UP000002277; Chromosome 7. DR Proteomes; UP000236370; Unassembled WGS sequence. DR Bgee; ENSPTRG00000019292; Expressed in adult mammalian kidney and 5 other tissues. DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro. DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro. DR Gene3D; 3.90.1460.10; -; 5. DR InterPro; IPR004212; GTF2I. DR InterPro; IPR036647; GTF2I-like_rpt_sf. DR InterPro; IPR016659; TF_II-I. DR Pfam; PF02946; GTF2I; 5. DR PIRSF; PIRSF016441; TF_II-I; 1. DR SUPFAM; SSF117773; SSF117773; 5. DR PROSITE; PS51139; GTF2I; 5. PE 2: Evidence at transcript level; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}. FT REGION 96..117 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 230..250 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 468..492 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 655..677 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 892..927 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 908..927 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 959 AA; 106025 MW; 05F306237D20111F CRC64; MALLGKRCDV PTNGCGPDRW NSAFTRKDEI ITSLVSALDS MCSALSKLNA EVACVAVHDE SAFVVGTEKG RMFLNARKEL QSDFLRFCRG PPWKDPEAEH PKKVQRGEGG GRSLPRSSLE HGSDVYLLRK MVEEVFDVLY SEALGRASVV PLPYERLLRE PGLLAVQGLP EGLAFRRPAE YDPKALMAIL EHSHRIRFKL KRPLEDGGRD SKALVELNGV SLIPKGSRDC GLHGQAPKVP PQDLPPTATS SSMASFLYST ALPNHAIREL KQEAPSCPLA PSDLGLSRPM PEPKATGAQD FSDCCGQKPT GPGGPLIQNV HASKRILFSI VHDKSEKWDA FIKETEDINT LRECVQILFN SRYAEALGLD HMVPVPYRKI ACDPEAVEIV GIPDKIPFKR PCTYGVPKLK RILEERHSIH FIIKRMFDER IFTGNKFTKD TTKLEPASPP EDTSAEVSRA TVLDLAGNAR SDKGSMSEDC GPGTSGELGG LRPIKIEPED LDIIQVTVPD PSPTSEEMTD SMPGHLPSED SGYGMEMLTD KGLSEDARPE ERPVEDSHGD VIRPLRKQVE LLFNTRYAKA IGISEPVKVP YSKFLMHPEE LFVVGLPEGI SLRRPNCFGI AKLRKILEAS NSIQFVIKRP ELLTEGVKEP IVDSQGTASS LGFSPPALPP ERDSGDPLVD ESLKRQGFQE NYDARLSRID IANTLREQVQ DLFNKKYGEA LGIKYPVQVP YKRIKSNPGS VIIEGLPPGI PFRKPCTFGS QNLERILAVA DKIKFTVTRP FQGLIPKPDE DDANRLGEKV ILREQVKELF NEKYGEALGL NRPVLVPYKL IRDSPDAVEV TGLPDDIPFR NPNTYDIHRL EKILKAREHV RMVIINQLQP FAEICNDAKV PAKDSSIPKR KRKRVSEGNS VSSSSSSSSS SSSNPDSVAS ANQISLVQWP MYMVDYAGLN VQLPGPLNY //