ID A0A0B4KHP5_DROME Unreviewed; 1430 AA. AC A0A0B4KHP5; DT 01-APR-2015, integrated into UniProtKB/TrEMBL. DT 01-APR-2015, sequence version 1. DT 03-MAY-2023, entry version 58. DE SubName: Full=Cap-n-collar, isoform O {ECO:0000313|EMBL:AGB96244.1}; GN Name=cnc {ECO:0000313|EMBL:AGB96244.1, GN ECO:0000313|FlyBase:FBgn0262975}; GN Synonyms=5134 {ECO:0000313|EMBL:AGB96244.1}, anon-WO0153538.6 GN {ECO:0000313|EMBL:AGB96244.1}, anon-WO0153538.7 GN {ECO:0000313|EMBL:AGB96244.1}, BcDNA:RE05559 GN {ECO:0000313|EMBL:AGB96244.1}, CG13826 {ECO:0000313|EMBL:AGB96244.1}, GN CG17894 {ECO:0000313|EMBL:AGB96244.1}, CG4566 GN {ECO:0000313|EMBL:AGB96244.1}, CG4578 {ECO:0000313|EMBL:AGB96244.1}, GN CNC {ECO:0000313|EMBL:AGB96244.1}, CnC {ECO:0000313|EMBL:AGB96244.1}, GN Cnc {ECO:0000313|EMBL:AGB96244.1}, Cnc-C GN {ECO:0000313|EMBL:AGB96244.1}, cnc-C {ECO:0000313|EMBL:AGB96244.1}, GN CNC_DROME {ECO:0000313|EMBL:AGB96244.1}, CncC GN {ECO:0000313|EMBL:AGB96244.1}, cncC {ECO:0000313|EMBL:AGB96244.1}, GN cncC/Nrf2 {ECO:0000313|EMBL:AGB96244.1}, DM12 GN {ECO:0000313|EMBL:AGB96244.1}, Dmel\CG43286 GN {ECO:0000313|EMBL:AGB96244.1}, dNrf2 {ECO:0000313|EMBL:AGB96244.1}, GN l(3)03921 {ECO:0000313|EMBL:AGB96244.1}, l(3)j5E7 GN {ECO:0000313|EMBL:AGB96244.1}, NRF2 {ECO:0000313|EMBL:AGB96244.1}, GN Nrf2 {ECO:0000313|EMBL:AGB96244.1}, nrf2 GN {ECO:0000313|EMBL:AGB96244.1}; GN ORFNames=CG43286 {ECO:0000313|EMBL:AGB96244.1, GN ECO:0000313|FlyBase:FBgn0262975}, Dmel_CG43286 GN {ECO:0000313|EMBL:AGB96244.1}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; OC Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803}; RN [1] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C., RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A., RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D., RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F., RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E., RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A., RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C., RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G., RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537568; RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A., RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R., RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C., RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster RT euchromatic genome sequence."; RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002). RN [3] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP GENOME REANNOTATION. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfied E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D., Drysdale R.A., Harris N.L., RA Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M., Yamada C., RA Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic RT review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537573; RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M., RA Celniker S.E.; RT "The transposable elements of the Drosophila melanogaster euchromatin: a RT genomics perspective."; RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002). RN [5] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537574; RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., RA Karpen G.H.; RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly."; RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002). RN [6] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022; RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., RA Ashburner M., Anxolabehere D.; RT "Combined evidence annotation of transposable elements in genome RT sequences."; RL PLoS Comput. Biol. 1:166-175(2005). RN [7] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569856; DOI=10.1126/science.1139815; RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.; RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin."; RL Science 316:1586-1591(2007). RN [8] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569867; DOI=10.1126/science.1139816; RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M., RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A., RA Dimitri P., Karpen G.H., Celniker S.E.; RT "Sequence finishing and mapping of Drosophila melanogaster RT heterochromatin."; RL Science 316:1625-1628(2007). CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AE014297; AGB96244.1; -; Genomic_DNA. DR RefSeq; NP_001262864.1; NM_001275935.1. DR DNASU; 42743; -. DR EnsemblMetazoa; FBtr0330372; FBpp0303398; FBgn0262975. DR GeneID; 42743; -. DR AGR; FB:FBgn0262975; -. DR CTD; 42743; -. DR FlyBase; FBgn0262975; cnc. DR VEuPathDB; VectorBase:FBgn0262975; -. DR OMA; TESFCRM; -. DR OrthoDB; 382726at2759; -. DR BioGRID-ORCS; 42743; 0 hits in 3 CRISPR screens. DR ChiTaRS; cnc; fly. DR GenomeRNAi; 42743; -. DR Proteomes; UP000000803; Chromosome 3R. DR Bgee; FBgn0262975; Expressed in crop (Drosophila) and 59 other tissues. DR ExpressionAtlas; A0A0B4KHP5; baseline and differential. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro. DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:InterPro. DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro. DR CDD; cd14698; bZIP_CNC; 1. DR Gene3D; 1.10.880.10; Transcription factor, Skn-1-like, DNA-binding domain; 1. DR InterPro; IPR004827; bZIP. DR InterPro; IPR004826; bZIP_Maf. DR InterPro; IPR046347; bZIP_sf. DR InterPro; IPR047167; NFE2-like. DR InterPro; IPR008917; TF_DNA-bd_sf. DR PANTHER; PTHR24411; NUCLEAR FACTOR ERYTHROID 2-RELATED FACTOR; 1. DR PANTHER; PTHR24411:SF55; SEGMENTATION PROTEIN CAP'N'COLLAR; 1. DR Pfam; PF03131; bZIP_Maf; 1. DR SMART; SM00338; BRLZ; 1. DR SUPFAM; SSF47454; A DNA-binding domain in eukaryotic transcription factors; 1. DR SUPFAM; SSF57959; Leucine zipper domain; 1. DR PROSITE; PS50217; BZIP; 1. DR PROSITE; PS00036; BZIP_BASIC; 1. PE 4: Predicted; KW Activator {ECO:0000256|ARBA:ARBA00023159}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW DNA-binding {ECO:0000256|ARBA:ARBA00023125}; KW Reference proteome {ECO:0000313|Proteomes:UP000000803}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transcription {ECO:0000256|ARBA:ARBA00023163}; KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}. FT SIGNAL 1..18 FT /evidence="ECO:0000256|SAM:SignalP" FT CHAIN 19..1430 FT /evidence="ECO:0000256|SAM:SignalP" FT /id="PRO_5002094263" FT DOMAIN 1195..1258 FT /note="BZIP" FT /evidence="ECO:0000259|PROSITE:PS50217" FT REGION 255..286 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 377..421 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 508..626 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 654..673 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 710..770 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 853..1026 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1043..1156 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1290..1385 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COILED 1220..1254 FT /evidence="ECO:0000256|SAM:Coils" FT COMPBIAS 256..284 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 395..413 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 508..529 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 530..550 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 589..613 FT /note="Basic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 716..741 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 742..756 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 853..899 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 975..994 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1081..1110 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1129..1147 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1295..1381 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 1430 AA; 153324 MW; 1537AED82C02DBDA CRC64; MISNKKSYAM KMLQLALALS LLHYNPDYLL HRWDSQLELG THGDGWELEM LRTVHRLDMD HNPYGNRKGL SPRIEDLLNF DDPSLGGMAN GIGGCKLPPR FNGSTFVMNL HNTTGNSSVQ TAALQDVQST SAAATGGTMV VGTGGAPTSG GQTSGSALGE IHIDTASLDP GNANHSPLHP TSELDTFLTP HALQDQRSIW EQNLADLYDY NDLSLQTSPY ANLPLKDGQP QPSNSSHLDL SLAALLHGFT GGSGAPLSTA ALNDSTPHPR NLGSVTNNSA GRSDDGEESL YLGRLFGEDE DEDYEGELIG GVANACEVEG LTTDEPFGSN CFANEVEIGD DEEESEIAEV LYKQDVDLGF SLDQEAIINA SYASGNSAAT NVKSKPEDET KSSDPSISES SGFKDTDVNA ENEASAASVD DIEKLKALEE LQQDKDKNNE NQLEDITNEW NGIPFTIDNE TGEYIRLPLD ELLNDVLKLS EFPLQDDLSN DPVASTSQAA AAFNENQAQR IVSETGEDLL SGEGISSKQN RNEAKNKDND PEKADGDSFS VSDFEELQNS VGSPLFDLDE DAKKELDEML QSAVPSYHHP HPHHGHPHAH PHSHHHASMH HAHAHHAAAA AAAHQRAVQQ ANYGGGVGVG VGVGVGVGSG TGSAFQRQPA AGGFHHGHHQ GRMPRLNRSV SMERLQDFAT YFSPIPSMVG GVSDMSPYPH HYPGYSYQAS PSNGAPGTPG QHGQYGSGAN ATLQPPPPPP PPHHAAMLHH PNAALGDICP TGQPHYGHNL GSAVTSSMHL TNSSHEADGA AAAAAAYKVE HDLMYYGNTS SDINQTDGFI NSIFTDEDLH LMDMNESFCR MVDNSTSNNS SVLGLPSSGH VSNGSGSSAQ LGAGNPHGNQ ANGASGGVGS MSGSAVGAGA TGMTADLLAS GGAGAQGGAD RLDASSDSAV SSMGSERVPS LSDGEWGEGS DSAQDYHQGK YGGPYDFSYN NNSRLSTATR QPPVAQKKHQ LYGKRDPHKQ TPSALPPTAP PAAATAVQSQ SIKYEYDAGY ASSGMASGGI SEPGAMGPAL SKDYHHHQPY GMGASGSAFS GDYTVRPSPR TSQDLVQLNH TYSLPQGSGS LPRPQARDKK PLVATKTASK GASAGNSSSV GGNSSNLEEE HLTRDEKRAR SLNIPISVPD IINLPMDEFN ERLSKYDLSE NQLSLIRDIR RRGKNKVAAQ NCRKRKLDQI LTLEDEVNAV VKRKTQLNQD RDHLESERKR ISNKFAMLHR HVFQYLRDPE GNPCSPADYS LQQAADGSVY LLPREKSEGN NTATAASNAV SSASGGSLNG HVPTQAPMHS HQSHGMQAQH VVGGMSQQQQ QQSRLPPHLQ QQHHLQSQQQ QPGGQQQQQH RKEXKYLLRI AATKSWFTSF QRGQQHRSVQ QQQQQFDYRY DMMNNSYLYY //