ID A0A4R6R5H7_9RHIZ Unreviewed; 420 AA. AC A0A4R6R5H7; DT 31-JUL-2019, integrated into UniProtKB/TrEMBL. DT 31-JUL-2019, sequence version 1. DT 31-JUL-2019, entry version 1. DE SubName: Full=Cytosine deaminase {ECO:0000313|EMBL:TDP81191.1}; GN ORFNames=EDD54_4524 {ECO:0000313|EMBL:TDP81191.1}; OS Oharaeibacter diazotrophicus. OC Bacteria; Proteobacteria; Alphaproteobacteria; Rhizobiales; OC Methylocystaceae; Oharaeibacter. OX NCBI_TaxID=1920512 {ECO:0000313|EMBL:TDP81191.1, ECO:0000313|Proteomes:UP000294547}; RN [1] {ECO:0000313|EMBL:TDP81191.1, ECO:0000313|Proteomes:UP000294547} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=DSM 102969 {ECO:0000313|EMBL:TDP81191.1, RC ECO:0000313|Proteomes:UP000294547}; RA Goeker M.; RT "Genomic Encyclopedia of Type Strains, Phase IV (KMG-IV): sequencing RT the most valuable type-strain genomes for metagenomic binning, RT comparative biology and taxonomic classification."; RL Submitted (MAR-2019) to the EMBL/GenBank/DDBJ databases. CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:TDP81191.1}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; SNXY01000013; TDP81191.1; -; Genomic_DNA. DR Proteomes; UP000294547; Unassembled WGS sequence. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000294547}. SQ SEQUENCE 420 AA; 44023 MW; 20A1023F95C4713A CRC64; MSRFDHVFRA AALPGRDGLA DVAVAGGRIV DIGTGFTCEA GETDVGGRLL FAGFVETHIH LDKAGIVGRC RICAGTLAEA VAETARAKAA FTVEDVYERA ARVVEKAILA GTTRLRTFVE VDPRVDLRAF EAIRAVKARY AAAIEIEMCA FAQEGLTNEP ATEVLIDRAL ADGADLVGGC PYTDPDPAEH VRRIFALAER HGVAVDFHLD FDLDPTRSDL PTVIAETVAR GYGGRVSVGH VTKLSAVAPE VFAETGRRLA DAGVAVTVLP ATDLYLTGRG ADRLVPRGVT PAHRLAALGV TASIATNNVL NPFTPFGDAS LIRMANLYAT VAQAGTAEEL ATVFRMVSDM AARILSPTRG HGVAVGAPAD LVILDAPSPA AAVAEIATPR AAWKNGRPVF DRPAVARYWE AGAAGPLTTG //