ID A0A061G916_THECC Unreviewed; 308 AA. AC A0A061G916; DT 03-SEP-2014, integrated into UniProtKB/TrEMBL. DT 03-SEP-2014, sequence version 1. DT 02-DEC-2020, entry version 33. DE SubName: Full=ZIM-like 1 {ECO:0000313|EMBL:EOY26355.1}; GN ORFNames=TCM_027858 {ECO:0000313|EMBL:EOY26355.1}; OS Theobroma cacao (Cacao) (Cocoa). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma. OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY26355.1, ECO:0000313|Proteomes:UP000026915}; RN [1] {ECO:0000313|EMBL:EOY26355.1, ECO:0000313|Proteomes:UP000026915} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915}; RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53; RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L., RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C., RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A., RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H., RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L., RA Kuhn D.N.; RT "The genome sequence of the most widely cultivated cacao type and its use RT to identify candidate genes regulating pod color."; RL Genome Biol. 14:r53-r53(2013). CC -!- FUNCTION: Transcriptional activator that specifically binds 5'-GATA-3' CC or 5'-GAT-3' motifs within gene promoters. CC {ECO:0000256|ARBA:ARBA00002206}. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00357}. CC -!- SIMILARITY: Belongs to the type IV zinc-finger family. Class C CC subfamily. {ECO:0000256|ARBA:ARBA00007722}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; CM001884; EOY26355.1; -; Genomic_DNA. DR STRING; 3641.EOY26355; -. DR EnsemblPlants; EOY26355; EOY26355; TCM_027858. DR Gramene; EOY26355; EOY26355; TCM_027858. DR eggNOG; KOG1601; Eukaryota. DR HOGENOM; CLU_057264_0_1_1; -. DR OMA; HHHIMNG; -. DR OrthoDB; 1089061at2759; -. DR Proteomes; UP000026915; Chromosome 6. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR GO; GO:0006355; P:regulation of transcription, DNA-templated; IEA:InterPro. DR CDD; cd00202; ZnF_GATA; 1. DR Gene3D; 3.30.50.10; -; 1. DR InterPro; IPR010402; CCT_domain. DR InterPro; IPR010399; Tify_dom. DR InterPro; IPR000679; Znf_GATA. DR InterPro; IPR013088; Znf_NHR/GATA. DR Pfam; PF06203; CCT; 1. DR Pfam; PF00320; GATA; 1. DR Pfam; PF06200; tify; 1. DR SMART; SM00979; TIFY; 1. DR SMART; SM00401; ZnF_GATA; 1. DR PROSITE; PS51017; CCT; 1. DR PROSITE; PS00344; GATA_ZN_FINGER_1; 1. DR PROSITE; PS50114; GATA_ZN_FINGER_2; 1. DR PROSITE; PS51320; TIFY; 1. PE 3: Inferred from homology; KW Activator {ECO:0000256|ARBA:ARBA00023159}; KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}; KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE- KW ProRule:PRU00357}; Reference proteome {ECO:0000313|Proteomes:UP000026915}; KW Zinc {ECO:0000256|ARBA:ARBA00022833}; KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE- KW ProRule:PRU00094}. FT DOMAIN 87..122 FT /note="Tify" FT /evidence="ECO:0000259|PROSITE:PS51320" FT DOMAIN 154..196 FT /note="CCT" FT /evidence="ECO:0000259|PROSITE:PS51017" FT DOMAIN 224..272 FT /note="GATA-type" FT /evidence="ECO:0000259|PROSITE:PS50114" FT REGION 34..94 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 187..224 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 34..62 FT /note="Polyampholyte" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 188..224 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 308 AA; 33850 MW; 29EE51239CCBB307 CRC64; MDGIHGKNGR MHMGNDVQQP MHHHVHYEHH HHIMNGNGMV DDDDVHHAHH HHHHHDVDDN VGCGEAEGVE AGDLPSDHPG VLSDNQGPDN GDQLTLSFQG QVYVYDSVPP EKVQAVLLLL GGREVPPTMP AIPITTQNNR GLPGTPQRFS VPQRLASLLR FREKRKERNF DKKIRYTVRK EVALRMQRNK GQFTSSKPNT DDSVSAASSL GSNQSWGADG NGSQNQEIVC RHCGISEKST PMMRRGPEGP RTLCNACGLM WANKGTLRDL SKAAPQTGNS SSLSKNGENV NFEADQVVRI TENVSGSS //