ID A0A0B4KHP5_DROME Unreviewed; 1430 AA. AC A0A0B4KHP5; DT 01-APR-2015, integrated into UniProtKB/TrEMBL. DT 01-APR-2015, sequence version 1. DT 18-SEP-2019, entry version 41. DE SubName: Full=Cap-n-collar, isoform O {ECO:0000313|EMBL:AGB96244.1}; GN Name=cnc {ECO:0000313|EMBL:AGB96244.1, GN ECO:0000313|FlyBase:FBgn0262975}; GN Synonyms=5134 {ECO:0000313|EMBL:AGB96244.1}, anon-WO0153538.6 GN {ECO:0000313|EMBL:AGB96244.1}, anon-WO0153538.7 GN {ECO:0000313|EMBL:AGB96244.1}, BcDNA:RE05559 GN {ECO:0000313|EMBL:AGB96244.1}, CG13826 {ECO:0000313|EMBL:AGB96244.1}, GN CG17894 {ECO:0000313|EMBL:AGB96244.1}, CG4566 GN {ECO:0000313|EMBL:AGB96244.1}, CG4578 {ECO:0000313|EMBL:AGB96244.1}, GN CNC {ECO:0000313|EMBL:AGB96244.1}, CnC {ECO:0000313|EMBL:AGB96244.1}, GN Cnc {ECO:0000313|EMBL:AGB96244.1}, Cnc-C GN {ECO:0000313|EMBL:AGB96244.1}, cnc-C {ECO:0000313|EMBL:AGB96244.1}, GN CNC_DROME {ECO:0000313|EMBL:AGB96244.1}, CncC GN {ECO:0000313|EMBL:AGB96244.1}, cncC {ECO:0000313|EMBL:AGB96244.1}, GN DM12 {ECO:0000313|EMBL:AGB96244.1}, Dmel\CG43286 GN {ECO:0000313|EMBL:AGB96244.1}, dNrf2 {ECO:0000313|EMBL:AGB96244.1}, GN l(3)03921 {ECO:0000313|EMBL:AGB96244.1}, l(3)j5E7 GN {ECO:0000313|EMBL:AGB96244.1}, NRF2 {ECO:0000313|EMBL:AGB96244.1}, GN Nrf2 {ECO:0000313|EMBL:AGB96244.1}; GN ORFNames=CG43286 {ECO:0000313|EMBL:AGB96244.1, GN ECO:0000313|FlyBase:FBgn0262975}, Dmel_CG43286 GN {ECO:0000313|EMBL:AGB96244.1}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803}; RN [1] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., RA Brandon R.C., Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., RA Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Gabor G.L., RA Abril J.F., Agbayani A., An H.J., Andrews-Pfannkoch C., Baldwin D., RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., RA Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., RA Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., RA Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., RA Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., RA Hostin D., Houston K.A., Howland T.J., Wei M.H., Ibegwam C., RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., RA Kimmel B.E., Kodira C.D., Kraft C., Kravitz S., Kulp D., Lai Z., RA Lasko P., Lei Y., Levitsky A.A., Li J., Li Z., Liang Y., Lin X., RA Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., RA Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., RA Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., RA Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., RA Wang Z.Y., Wassarman D.A., Weinstock G.M., Weissenbach J., RA Williams S.M., WoodageT, Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., RA Yeh R.F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537568; RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A., RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R., RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C., RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; RT "Finishing a whole-genome shotgun: release 3 of the Drosophila RT melanogaster euchromatic genome sequence."; RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002). RN [3] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP GENOME REANNOTATION. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfied E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., RA Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., RA Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a RT systematic review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537573; RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., RA Ashburner M., Celniker S.E.; RT "The transposable elements of the Drosophila melanogaster euchromatin: RT a genomics perspective."; RL Genome Biol. 3:RESEARCH0084-RESEARCH0084(2002). RN [5] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537574; RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., RA Karpen G.H.; RT "Heterochromatic sequences in a Drosophila whole-genome shotgun RT assembly."; RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002). RN [6] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022; RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., RA Ashburner M., Anxolabehere D.; RT "Combined evidence annotation of transposable elements in genome RT sequences."; RL PLoS Comput. Biol. 1:166-175(2005). RN [7] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569856; DOI=10.1126/science.1139815; RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.; RT "The Release 5.1 annotation of Drosophila melanogaster RT heterochromatin."; RL Science 316:1586-1591(2007). RN [8] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569867; DOI=10.1126/science.1139816; RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M., RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A., RA Dimitri P., Karpen G.H., Celniker S.E.; RT "Sequence finishing and mapping of Drosophila melanogaster RT heterochromatin."; RL Science 316:1625-1628(2007). CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|SAAS:SAAS01109866}. CC -!- SIMILARITY: Belongs to the bZIP family. CC {ECO:0000256|SAAS:SAAS00810676}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AE014297; AGB96244.1; -; Genomic_DNA. DR RefSeq; NP_001262864.1; NM_001275935.1. DR GeneID; 42743; -. DR CTD; 42743; -. DR FlyBase; FBgn0262975; cnc. DR eggNOG; KOG3863; Eukaryota. DR eggNOG; ENOG410ZGMS; LUCA. DR OMA; VPMIETQ; -. DR GenomeRNAi; 42743; -. DR Proteomes; UP000000803; Chromosome 3R. DR ExpressionAtlas; A0A0B4KHP5; baseline and differential. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW. DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro. DR InterPro; IPR004827; bZIP. DR InterPro; IPR004826; bZIP_Maf. DR InterPro; IPR008917; TF_DNA-bd_sf. DR Pfam; PF03131; bZIP_Maf; 1. DR SMART; SM00338; BRLZ; 1. DR SUPFAM; SSF47454; SSF47454; 1. DR PROSITE; PS50217; BZIP; 1. DR PROSITE; PS00036; BZIP_BASIC; 1. PE 3: Inferred from homology; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000803}; KW DNA-binding {ECO:0000256|SAAS:SAAS00812580}; KW Nucleus {ECO:0000256|SAAS:SAAS01109505}; KW Reference proteome {ECO:0000313|Proteomes:UP000000803}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transcription {ECO:0000256|SAAS:SAAS01002396}; KW Transcription regulation {ECO:0000256|SAAS:SAAS01001750}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 1430 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002094263. FT DOMAIN 1195 1258 BZIP. {ECO:0000259|PROSITE:PS50217}. FT REGION 255 286 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT REGION 377 421 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT REGION 508 626 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT REGION 654 673 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT REGION 710 770 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT REGION 853 1026 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT REGION 1043 1156 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT REGION 1290 1385 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT COILED 425 445 {ECO:0000256|SAM:Coils}. FT COILED 1220 1254 {ECO:0000256|SAM:Coils}. FT COMPBIAS 256 284 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 395 413 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 508 529 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 530 550 Polyampholyte. {ECO:0000256|SAM:MobiDB- FT lite}. FT COMPBIAS 589 613 Basic. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 716 741 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 742 756 Pro-rich. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 853 899 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 975 994 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 1081 1110 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 1129 1147 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 1295 1381 Polar. {ECO:0000256|SAM:MobiDB-lite}. SQ SEQUENCE 1430 AA; 153324 MW; 1537AED82C02DBDA CRC64; MISNKKSYAM KMLQLALALS LLHYNPDYLL HRWDSQLELG THGDGWELEM LRTVHRLDMD HNPYGNRKGL SPRIEDLLNF DDPSLGGMAN GIGGCKLPPR FNGSTFVMNL HNTTGNSSVQ TAALQDVQST SAAATGGTMV VGTGGAPTSG GQTSGSALGE IHIDTASLDP GNANHSPLHP TSELDTFLTP HALQDQRSIW EQNLADLYDY NDLSLQTSPY ANLPLKDGQP QPSNSSHLDL SLAALLHGFT GGSGAPLSTA ALNDSTPHPR NLGSVTNNSA GRSDDGEESL YLGRLFGEDE DEDYEGELIG GVANACEVEG LTTDEPFGSN CFANEVEIGD DEEESEIAEV LYKQDVDLGF SLDQEAIINA SYASGNSAAT NVKSKPEDET KSSDPSISES SGFKDTDVNA ENEASAASVD DIEKLKALEE LQQDKDKNNE NQLEDITNEW NGIPFTIDNE TGEYIRLPLD ELLNDVLKLS EFPLQDDLSN DPVASTSQAA AAFNENQAQR IVSETGEDLL SGEGISSKQN RNEAKNKDND PEKADGDSFS VSDFEELQNS VGSPLFDLDE DAKKELDEML QSAVPSYHHP HPHHGHPHAH PHSHHHASMH HAHAHHAAAA AAAHQRAVQQ ANYGGGVGVG VGVGVGVGSG TGSAFQRQPA AGGFHHGHHQ GRMPRLNRSV SMERLQDFAT YFSPIPSMVG GVSDMSPYPH HYPGYSYQAS PSNGAPGTPG QHGQYGSGAN ATLQPPPPPP PPHHAAMLHH PNAALGDICP TGQPHYGHNL GSAVTSSMHL TNSSHEADGA AAAAAAYKVE HDLMYYGNTS SDINQTDGFI NSIFTDEDLH LMDMNESFCR MVDNSTSNNS SVLGLPSSGH VSNGSGSSAQ LGAGNPHGNQ ANGASGGVGS MSGSAVGAGA TGMTADLLAS GGAGAQGGAD RLDASSDSAV SSMGSERVPS LSDGEWGEGS DSAQDYHQGK YGGPYDFSYN NNSRLSTATR QPPVAQKKHQ LYGKRDPHKQ TPSALPPTAP PAAATAVQSQ SIKYEYDAGY ASSGMASGGI SEPGAMGPAL SKDYHHHQPY GMGASGSAFS GDYTVRPSPR TSQDLVQLNH TYSLPQGSGS LPRPQARDKK PLVATKTASK GASAGNSSSV GGNSSNLEEE HLTRDEKRAR SLNIPISVPD IINLPMDEFN ERLSKYDLSE NQLSLIRDIR RRGKNKVAAQ NCRKRKLDQI LTLEDEVNAV VKRKTQLNQD RDHLESERKR ISNKFAMLHR HVFQYLRDPE GNPCSPADYS LQQAADGSVY LLPREKSEGN NTATAASNAV SSASGGSLNG HVPTQAPMHS HQSHGMQAQH VVGGMSQQQQ QQSRLPPHLQ QQHHLQSQQQ QPGGQQQQQH RKEXKYLLRI AATKSWFTSF QRGQQHRSVQ QQQQQFDYRY DMMNNSYLYY //