ID A0A0B4KHP5_DROME Unreviewed; 1430 AA. AC A0A0B4KHP5; DT 01-APR-2015, integrated into UniProtKB/TrEMBL. DT 01-APR-2015, sequence version 1. DT 12-AUG-2020, entry version 45. DE SubName: Full=Cap-n-collar, isoform O {ECO:0000313|EMBL:AGB96244.1}; GN Name=cnc {ECO:0000313|EMBL:AGB96244.1, GN ECO:0000313|FlyBase:FBgn0262975}; GN Synonyms=5134 {ECO:0000313|EMBL:AGB96244.1}, anon-WO0153538.6 GN {ECO:0000313|EMBL:AGB96244.1}, anon-WO0153538.7 GN {ECO:0000313|EMBL:AGB96244.1}, BcDNA:RE05559 GN {ECO:0000313|EMBL:AGB96244.1}, CG13826 {ECO:0000313|EMBL:AGB96244.1}, GN CG17894 {ECO:0000313|EMBL:AGB96244.1}, CG4566 GN {ECO:0000313|EMBL:AGB96244.1}, CG4578 {ECO:0000313|EMBL:AGB96244.1}, GN CNC {ECO:0000313|EMBL:AGB96244.1}, CnC {ECO:0000313|EMBL:AGB96244.1}, GN Cnc {ECO:0000313|EMBL:AGB96244.1}, Cnc-C GN {ECO:0000313|EMBL:AGB96244.1}, cnc-C {ECO:0000313|EMBL:AGB96244.1}, GN CNC_DROME {ECO:0000313|EMBL:AGB96244.1}, CncC GN {ECO:0000313|EMBL:AGB96244.1}, cncC {ECO:0000313|EMBL:AGB96244.1}, GN DM12 {ECO:0000313|EMBL:AGB96244.1}, Dmel\CG43286 GN {ECO:0000313|EMBL:AGB96244.1}, dNrf2 {ECO:0000313|EMBL:AGB96244.1}, GN l(3)03921 {ECO:0000313|EMBL:AGB96244.1}, l(3)j5E7 GN {ECO:0000313|EMBL:AGB96244.1}, NRF2 {ECO:0000313|EMBL:AGB96244.1}, GN Nrf2 {ECO:0000313|EMBL:AGB96244.1}, nrf2 GN {ECO:0000313|EMBL:AGB96244.1}; GN ORFNames=CG43286 {ECO:0000313|EMBL:AGB96244.1, GN ECO:0000313|FlyBase:FBgn0262975}, Dmel_CG43286 GN {ECO:0000313|EMBL:AGB96244.1}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Holometabola; Diptera; Brachycera; Muscomorpha; Ephydroidea; OC Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803}; RN [1] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C., RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A., RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D., RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F., RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E., RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A., RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C., RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G., RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537568; RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A., RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R., RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C., RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster RT euchromatic genome sequence."; RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002). RN [3] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP GENOME REANNOTATION. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfied E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D., Drysdale R.A., Harris N.L., RA Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M., Yamada C., RA Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic RT review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537573; RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M., RA Celniker S.E.; RT "The transposable elements of the Drosophila melanogaster euchromatin: a RT genomics perspective."; RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002). RN [5] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537574; RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., RA Karpen G.H.; RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly."; RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002). RN [6] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022; RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., RA Ashburner M., Anxolabehere D.; RT "Combined evidence annotation of transposable elements in genome RT sequences."; RL PLoS Comput. Biol. 1:166-175(2005). RN [7] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569856; DOI=10.1126/science.1139815; RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.; RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin."; RL Science 316:1586-1591(2007). RN [8] {ECO:0000313|EMBL:AGB96244.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569867; DOI=10.1126/science.1139816; RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M., RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A., RA Dimitri P., Karpen G.H., Celniker S.E.; RT "Sequence finishing and mapping of Drosophila melanogaster RT heterochromatin."; RL Science 316:1625-1628(2007). CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AE014297; AGB96244.1; -; Genomic_DNA. DR RefSeq; NP_001262864.1; NM_001275935.1. DR EnsemblMetazoa; FBtr0330372; FBpp0303398; FBgn0262975. DR GeneID; 42743; -. DR CTD; 42743; -. DR FlyBase; FBgn0262975; cnc. DR OMA; VPMIETQ; -. DR BioGRID-ORCS; 42743; 0 hits in 5 CRISPR screens. DR ChiTaRS; cnc; fly. DR GenomeRNAi; 42743; -. DR Proteomes; UP000000803; Chromosome 3R. DR ExpressionAtlas; A0A0B4KHP5; baseline and differential. DR GO; GO:0042025; C:host cell nucleus; IEA:InterPro. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW. DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro. DR InterPro; IPR004827; bZIP. DR InterPro; IPR004826; bZIP_Maf. DR InterPro; IPR008917; TF_DNA-bd_sf. DR Pfam; PF03131; bZIP_Maf; 1. DR SMART; SM00338; BRLZ; 1. DR SUPFAM; SSF47454; SSF47454; 1. DR PROSITE; PS50217; BZIP; 1. DR PROSITE; PS00036; BZIP_BASIC; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW DNA-binding {ECO:0000256|ARBA:ARBA00023125}; KW Reference proteome {ECO:0000313|Proteomes:UP000000803}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transcription {ECO:0000256|ARBA:ARBA00023163}; KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}. FT SIGNAL 1..18 FT /evidence="ECO:0000256|SAM:SignalP" FT CHAIN 19..1430 FT /evidence="ECO:0000256|SAM:SignalP" FT /id="PRO_5002094263" FT DOMAIN 1195..1258 FT /note="BZIP" FT /evidence="ECO:0000259|PROSITE:PS50217" FT REGION 255..286 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 377..421 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 508..626 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 654..673 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 710..770 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 853..1026 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1043..1156 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1290..1385 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COILED 425..445 FT /evidence="ECO:0000256|SAM:Coils" FT COILED 1220..1254 FT /evidence="ECO:0000256|SAM:Coils" FT COMPBIAS 256..284 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 395..413 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 508..529 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 530..550 FT /note="Polyampholyte" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 589..613 FT /note="Basic" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 716..741 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 742..756 FT /note="Pro-rich" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 853..899 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 975..994 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1081..1110 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1129..1147 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1295..1381 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 1430 AA; 153324 MW; 1537AED82C02DBDA CRC64; MISNKKSYAM KMLQLALALS LLHYNPDYLL HRWDSQLELG THGDGWELEM LRTVHRLDMD HNPYGNRKGL SPRIEDLLNF DDPSLGGMAN GIGGCKLPPR FNGSTFVMNL HNTTGNSSVQ TAALQDVQST SAAATGGTMV VGTGGAPTSG GQTSGSALGE IHIDTASLDP GNANHSPLHP TSELDTFLTP HALQDQRSIW EQNLADLYDY NDLSLQTSPY ANLPLKDGQP QPSNSSHLDL SLAALLHGFT GGSGAPLSTA ALNDSTPHPR NLGSVTNNSA GRSDDGEESL YLGRLFGEDE DEDYEGELIG GVANACEVEG LTTDEPFGSN CFANEVEIGD DEEESEIAEV LYKQDVDLGF SLDQEAIINA SYASGNSAAT NVKSKPEDET KSSDPSISES SGFKDTDVNA ENEASAASVD DIEKLKALEE LQQDKDKNNE NQLEDITNEW NGIPFTIDNE TGEYIRLPLD ELLNDVLKLS EFPLQDDLSN DPVASTSQAA AAFNENQAQR IVSETGEDLL SGEGISSKQN RNEAKNKDND PEKADGDSFS VSDFEELQNS VGSPLFDLDE DAKKELDEML QSAVPSYHHP HPHHGHPHAH PHSHHHASMH HAHAHHAAAA AAAHQRAVQQ ANYGGGVGVG VGVGVGVGSG TGSAFQRQPA AGGFHHGHHQ GRMPRLNRSV SMERLQDFAT YFSPIPSMVG GVSDMSPYPH HYPGYSYQAS PSNGAPGTPG QHGQYGSGAN ATLQPPPPPP PPHHAAMLHH PNAALGDICP TGQPHYGHNL GSAVTSSMHL TNSSHEADGA AAAAAAYKVE HDLMYYGNTS SDINQTDGFI NSIFTDEDLH LMDMNESFCR MVDNSTSNNS SVLGLPSSGH VSNGSGSSAQ LGAGNPHGNQ ANGASGGVGS MSGSAVGAGA TGMTADLLAS GGAGAQGGAD RLDASSDSAV SSMGSERVPS LSDGEWGEGS DSAQDYHQGK YGGPYDFSYN NNSRLSTATR QPPVAQKKHQ LYGKRDPHKQ TPSALPPTAP PAAATAVQSQ SIKYEYDAGY ASSGMASGGI SEPGAMGPAL SKDYHHHQPY GMGASGSAFS GDYTVRPSPR TSQDLVQLNH TYSLPQGSGS LPRPQARDKK PLVATKTASK GASAGNSSSV GGNSSNLEEE HLTRDEKRAR SLNIPISVPD IINLPMDEFN ERLSKYDLSE NQLSLIRDIR RRGKNKVAAQ NCRKRKLDQI LTLEDEVNAV VKRKTQLNQD RDHLESERKR ISNKFAMLHR HVFQYLRDPE GNPCSPADYS LQQAADGSVY LLPREKSEGN NTATAASNAV SSASGGSLNG HVPTQAPMHS HQSHGMQAQH VVGGMSQQQQ QQSRLPPHLQ QQHHLQSQQQ QPGGQQQQQH RKEXKYLLRI AATKSWFTSF QRGQQHRSVQ QQQQQFDYRY DMMNNSYLYY //