ID HM21_CAEEL Reviewed; 495 AA. AC Q22811; Q8WQE1; DT 27-APR-2001, integrated into UniProtKB/Swiss-Prot. DT 06-JUN-2002, sequence version 3. DT 30-NOV-2010, entry version 83. DE RecName: Full=Homeobox protein ceh-21; GN Name=ceh-21; ORFNames=T26C11.6; OS Caenorhabditis elegans. OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; OC Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6239; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA]. RC STRAIN=Bristol N2; RX PubMed=11902672; RA Buerglin T.R., Cassata G.; RT "Loss and gain of domains during evolution of cut superclass homeobox RT genes."; RL Int. J. Dev. Biol. 46:115-123(2002). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Bristol N2; RX MEDLINE=99069613; PubMed=9851916; DOI=10.1126/science.282.5396.2012; RG The C. elegans sequencing consortium; RT "Genome sequence of the nematode C. elegans: a platform for RT investigating biology."; RL Science 282:2012-2018(1998). RN [3] RP NUCLEOTIDE SEQUENCE [MRNA] OF 313-495. RC STRAIN=Bristol N2; RX MEDLINE=98256275; PubMed=9593691; DOI=10.1074/jbc.273.22.13552; RA Lannoy V.J., Buerglin T.R., Rousseau G.G., Lemaigre F.P.; RT "Isoforms of hepatocyte nuclear factor-6 differ in DNA-binding RT properties, contain a bifunctional homeodomain, and define the new RT ONECUT class of homeodomain proteins."; RL J. Biol. Chem. 273:13552-13562(1998). CC -!- SUBCELLULAR LOCATION: Nucleus (Potential). CC -!- SIMILARITY: Belongs to the CUT homeobox family. CC -!- SIMILARITY: Contains 1 CUT DNA-binding domain. CC -!- SIMILARITY: Contains 1 homeobox DNA-binding domain. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AJ427855; CAD20808.1; -; mRNA. DR EMBL; U41017; AAC48216.2; -; Genomic_DNA. DR EMBL; AF023470; AAB86814.1; -; mRNA. DR PIR; T28912; T28912. DR PIR; T42240; T42240. DR RefSeq; NP_508341.2; NM_075940.4. DR UniGene; Cel.9167; -. DR ProteinModelPortal; Q22811; -. DR SMR; Q22811; 290-447. DR EnsemblMetazoa; T26C11.6; T26C11.6; T26C11.6. DR GeneID; 180504; -. DR KEGG; cel:T26C11.6; -. DR NMPDR; fig|6239.3.peg.22556; -. DR UCSC; T26C11.6.2; c. elegans. DR CTD; 180504; -. DR WormBase; T26C11.6; CE29823; WBGene00000444; ceh-21. DR eggNOG; KOG2252; -. DR InParanoid; Q22811; -. DR NextBio; 909656; -. DR ArrayExpress; Q22811; -. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro. DR GO; GO:0003700; F:sequence-specific DNA binding transcription...; IEA:InterPro. DR GO; GO:0006355; P:regulation of transcription, DNA-dependent; IEA:InterPro. DR InterPro; IPR003350; Hmoeo_CUT. DR InterPro; IPR001356; Homeobox. DR InterPro; IPR009057; Homeodomain-like. DR InterPro; IPR012287; Homeodomain-rel. DR InterPro; IPR010982; Lambda_DNA-bd. DR Gene3D; G3DSA:1.10.10.60; Homeodomain-rel; 1. DR Pfam; PF02376; CUT; 1. DR Pfam; PF00046; Homeobox; 1. DR SMART; SM00389; HOX; 1. DR SUPFAM; SSF46689; Homeodomain_like; 1. DR SUPFAM; SSF47413; Lambda_like_DNA; 1. DR PROSITE; PS51042; CUT; 1. DR PROSITE; PS00027; HOMEOBOX_1; FALSE_NEG. DR PROSITE; PS50071; HOMEOBOX_2; 1. PE 2: Evidence at transcript level; KW Complete proteome; DNA-binding; Homeobox; Nucleus. FT CHAIN 1 495 Homeobox protein ceh-21. FT /FTId=PRO_0000202408. FT DNA_BIND 284 370 CUT. FT DNA_BIND 389 449 Homeobox. SQ SEQUENCE 495 AA; 54823 MW; C4FFA0984D0DAA08 CRC64; MSQQFQASSG TGSASLREFK TEHEDLREDL PYSTLRTLFG ITLDKDASQA LNIALLLYGH NYPQQVVPPE RNYAELDAQL ESVVLEDHTA ESTMEPGVSA TVTEQLEEKS DKSSDGDGTS KRLTRSLKSV ENETEEDHEE KEDEAPQSSR RESTRLKRKL LESQKTVQTT GNSSRASSKS QEKEVPGTKS QCAPKIRTTP EQSKAATKRQ SSTTVRASST CGSSVSSTST VSSPDYTAKK GRATETPKLE ELAPKKQSSA TPKPGGEVCV WDGVQIGDLS AQMNAQIGDD EELDTVDIAR RILSELKERC IPQTALAEKI LARSQGTLSD LLRMPKPWSV MKNGRATFQR MSNWLGLDPD VRRALCFLPK EDVARITGLD EPTPAKRKKT VKVIRLTFTE TQLKSLQKSF QQNHRPTREM RQKLSATLEL DFSTVGNFFM NSRRRLRIDQ QISRSSRSTG NGADTEDELD EEDVVVENVI ADATDASNQP GPSHL //