ID H2QUR4_PANTR Unreviewed; 959 AA. AC H2QUR4; A0A2J8Q874; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 08-MAY-2019, entry version 46. DE SubName: Full=GTF2I repeat domain containing 1 {ECO:0000313|EMBL:JAA08002.1, ECO:0000313|Ensembl:ENSPTRP00000032994}; DE SubName: Full=GTF2IRD1 isoform 4 {ECO:0000313|EMBL:PNI92471.1}; GN Name=GTF2IRD1 {ECO:0000313|EMBL:JAA08002.1, GN ECO:0000313|Ensembl:ENSPTRP00000032994, ECO:0000313|VGNC:VGNC:8726}; GN ORFNames=CK820_G0040825 {ECO:0000313|EMBL:PNI92471.1}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000032994, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000032994, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the RT human genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Ensembl:ENSPTRP00000032994} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. RN [3] {ECO:0000313|EMBL:JAA08002.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Adipose stromal {ECO:0000313|EMBL:JAA08002.1}, Skeletal muscle RC {ECO:0000313|EMBL:JAA44080.1}, Skin {ECO:0000313|EMBL:JAA24036.1}, and RC Smooth vascular {ECO:0000313|EMBL:JAA19101.1}; RA Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.; RT "De novo assembly of the reference chimpanzee transcriptome from RT NextGen mRNA sequences."; RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0000313|EMBL:PNI92471.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Yerkes chimp pedigree #C0471 {ECO:0000313|EMBL:PNI92471.1}; RC TISSUE=Blood {ECO:0000313|EMBL:PNI92471.1}; RA Pollen A., Hastie A., Hormozdiari F., Dougherty M., Liu R., RA Chaisson M., Hoppe E., Hill C., Pang A., Hillier L., Baker C., RA Armstrong J., Shendure J., Paten B., Wilson R., Chao H., Schneider V., RA Ventura M., Kronenberg Z., Murali S., Gordon D., Cantsilieris S., RA Munson K., Nelson B., Raja A., Underwood J., Diekhans M., Fiddes I., RA Haussler D., Eichler E.; RT "High-resolution comparative analysis of great ape genomes."; RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AC188564; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC200908; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; GABC01003336; JAA08002.1; -; mRNA. DR EMBL; GABF01003044; JAA19101.1; -; mRNA. DR EMBL; GABD01009064; JAA24036.1; -; mRNA. DR EMBL; GABE01000659; JAA44080.1; -; mRNA. DR EMBL; NBAG03000068; PNI92471.1; -; Genomic_DNA. DR RefSeq; XP_001150464.2; XM_001150464.5. DR STRING; 9598.ENSPTRP00000032994; -. DR Ensembl; ENSPTRT00000035691; ENSPTRP00000032994; ENSPTRG00000019292. DR GeneID; 463471; -. DR KEGG; ptr:463471; -. DR CTD; 9569; -. DR VGNC; VGNC:8726; GTF2IRD1. DR eggNOG; ENOG410IEPZ; Eukaryota. DR eggNOG; ENOG41100H8; LUCA. DR GeneTree; ENSGT00940000159414; -. DR KO; K03121; -. DR OMA; KTDKWDS; -. DR OrthoDB; 115381at2759; -. DR TreeFam; TF352524; -. DR Proteomes; UP000002277; Chromosome 7. DR Bgee; ENSPTRG00000019292; Expressed in 6 organ(s), highest expression level in adult mammalian kidney. DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro. DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro. DR Gene3D; 3.90.1460.10; -; 5. DR InterPro; IPR004212; GTF2I. DR InterPro; IPR036647; GTF2I-like_rpt_sf. DR InterPro; IPR016659; TF_II-I. DR Pfam; PF02946; GTF2I; 5. DR PIRSF; PIRSF016441; TF_II-I; 1. DR SUPFAM; SSF117773; SSF117773; 5. DR PROSITE; PS51139; GTF2I; 5. PE 2: Evidence at transcript level; KW Complete proteome {ECO:0000313|Proteomes:UP000002277}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}. SQ SEQUENCE 959 AA; 106025 MW; 05F306237D20111F CRC64; MALLGKRCDV PTNGCGPDRW NSAFTRKDEI ITSLVSALDS MCSALSKLNA EVACVAVHDE SAFVVGTEKG RMFLNARKEL QSDFLRFCRG PPWKDPEAEH PKKVQRGEGG GRSLPRSSLE HGSDVYLLRK MVEEVFDVLY SEALGRASVV PLPYERLLRE PGLLAVQGLP EGLAFRRPAE YDPKALMAIL EHSHRIRFKL KRPLEDGGRD SKALVELNGV SLIPKGSRDC GLHGQAPKVP PQDLPPTATS SSMASFLYST ALPNHAIREL KQEAPSCPLA PSDLGLSRPM PEPKATGAQD FSDCCGQKPT GPGGPLIQNV HASKRILFSI VHDKSEKWDA FIKETEDINT LRECVQILFN SRYAEALGLD HMVPVPYRKI ACDPEAVEIV GIPDKIPFKR PCTYGVPKLK RILEERHSIH FIIKRMFDER IFTGNKFTKD TTKLEPASPP EDTSAEVSRA TVLDLAGNAR SDKGSMSEDC GPGTSGELGG LRPIKIEPED LDIIQVTVPD PSPTSEEMTD SMPGHLPSED SGYGMEMLTD KGLSEDARPE ERPVEDSHGD VIRPLRKQVE LLFNTRYAKA IGISEPVKVP YSKFLMHPEE LFVVGLPEGI SLRRPNCFGI AKLRKILEAS NSIQFVIKRP ELLTEGVKEP IVDSQGTASS LGFSPPALPP ERDSGDPLVD ESLKRQGFQE NYDARLSRID IANTLREQVQ DLFNKKYGEA LGIKYPVQVP YKRIKSNPGS VIIEGLPPGI PFRKPCTFGS QNLERILAVA DKIKFTVTRP FQGLIPKPDE DDANRLGEKV ILREQVKELF NEKYGEALGL NRPVLVPYKL IRDSPDAVEV TGLPDDIPFR NPNTYDIHRL EKILKAREHV RMVIINQLQP FAEICNDAKV PAKDSSIPKR KRKRVSEGNS VSSSSSSSSS SSSNPDSVAS ANQISLVQWP MYMVDYAGLN VQLPGPLNY //