ID HM21_CAEEL Reviewed; 495 AA. AC Q22811; Q8WQE1; DT 27-APR-2001, integrated into UniProtKB/Swiss-Prot. DT 06-JUN-2002, sequence version 3. DT 10-FEB-2021, entry version 146. DE RecName: Full=Homeobox protein ceh-21; GN Name=ceh-21; ORFNames=T26C11.6; OS Caenorhabditis elegans. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; OC Caenorhabditis. OX NCBI_TaxID=6239; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA]. RC STRAIN=Bristol N2; RX PubMed=11902672; RA Buerglin T.R., Cassata G.; RT "Loss and gain of domains during evolution of cut superclass homeobox RT genes."; RL Int. J. Dev. Biol. 46:115-123(2002). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Bristol N2; RX PubMed=9851916; DOI=10.1126/science.282.5396.2012; RG The C. elegans sequencing consortium; RT "Genome sequence of the nematode C. elegans: a platform for investigating RT biology."; RL Science 282:2012-2018(1998). RN [3] RP NUCLEOTIDE SEQUENCE [MRNA] OF 313-495. RC STRAIN=Bristol N2; RX PubMed=9593691; DOI=10.1074/jbc.273.22.13552; RA Lannoy V.J., Buerglin T.R., Rousseau G.G., Lemaigre F.P.; RT "Isoforms of hepatocyte nuclear factor-6 differ in DNA-binding properties, RT contain a bifunctional homeodomain, and define the new ONECUT class of RT homeodomain proteins."; RL J. Biol. Chem. 273:13552-13562(1998). CC -!- FUNCTION: Probable DNA-binding regulatory protein involved in cell-fate CC specification. {ECO:0000250}. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000255|PROSITE-ProRule:PRU00108, CC ECO:0000255|PROSITE-ProRule:PRU00374}. CC -!- SIMILARITY: Belongs to the CUT homeobox family. {ECO:0000305}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AJ427855; CAD20808.1; -; mRNA. DR EMBL; FO080541; CCD64528.1; -; Genomic_DNA. DR EMBL; AF023470; AAB86814.1; -; mRNA. DR PIR; T28912; T28912. DR PIR; T42240; T42240. DR RefSeq; NP_508341.2; NM_075940.5. DR SMR; Q22811; -. DR BioGRID; 45451; 2. DR IntAct; Q22811; 2. DR STRING; 6239.T26C11.6; -. DR EPD; Q22811; -. DR PaxDb; Q22811; -. DR EnsemblMetazoa; T26C11.6.1; T26C11.6.1; WBGene00000444. DR UCSC; T26C11.6.2; c. elegans. DR WormBase; T26C11.6; CE29823; WBGene00000444; ceh-21. DR eggNOG; KOG2252; Eukaryota. DR GeneTree; ENSGT00950000183103; -. DR HOGENOM; CLU_551232_0_0_1; -. DR InParanoid; Q22811; -. DR PRO; PR:Q22811; -. DR Proteomes; UP000001940; Chromosome X. DR Bgee; WBGene00000444; Expressed in multi-cellular organism and 5 other tissues. DR GO; GO:0005634; C:nucleus; IBA:GO_Central. DR GO; GO:0003677; F:DNA binding; IBA:GO_Central. DR GO; GO:0030154; P:cell differentiation; IBA:GO_Central. DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IBA:GO_Central. DR CDD; cd00086; homeodomain; 1. DR Gene3D; 1.10.260.40; -; 1. DR InterPro; IPR003350; CUT_dom. DR InterPro; IPR009057; Homeobox-like_sf. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf. DR Pfam; PF02376; CUT; 1. DR Pfam; PF00046; Homeodomain; 1. DR SMART; SM01109; CUT; 1. DR SMART; SM00389; HOX; 1. DR SUPFAM; SSF46689; SSF46689; 1. DR SUPFAM; SSF47413; SSF47413; 1. DR PROSITE; PS51042; CUT; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. PE 2: Evidence at transcript level; KW DNA-binding; Homeobox; Nucleus; Reference proteome; Transcription; KW Transcription regulation. FT CHAIN 1..495 FT /note="Homeobox protein ceh-21" FT /id="PRO_0000202408" FT DNA_BIND 284..370 FT /note="CUT" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00374" FT DNA_BIND 389..449 FT /note="Homeobox" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" SQ SEQUENCE 495 AA; 54823 MW; C4FFA0984D0DAA08 CRC64; MSQQFQASSG TGSASLREFK TEHEDLREDL PYSTLRTLFG ITLDKDASQA LNIALLLYGH NYPQQVVPPE RNYAELDAQL ESVVLEDHTA ESTMEPGVSA TVTEQLEEKS DKSSDGDGTS KRLTRSLKSV ENETEEDHEE KEDEAPQSSR RESTRLKRKL LESQKTVQTT GNSSRASSKS QEKEVPGTKS QCAPKIRTTP EQSKAATKRQ SSTTVRASST CGSSVSSTST VSSPDYTAKK GRATETPKLE ELAPKKQSSA TPKPGGEVCV WDGVQIGDLS AQMNAQIGDD EELDTVDIAR RILSELKERC IPQTALAEKI LARSQGTLSD LLRMPKPWSV MKNGRATFQR MSNWLGLDPD VRRALCFLPK EDVARITGLD EPTPAKRKKT VKVIRLTFTE TQLKSLQKSF QQNHRPTREM RQKLSATLEL DFSTVGNFFM NSRRRLRIDQ QISRSSRSTG NGADTEDELD EEDVVVENVI ADATDASNQP GPSHL //