ID R9NCU4_9FIRM Unreviewed; 464 AA. AC R9NCU4; DT 24-JUL-2013, integrated into UniProtKB/TrEMBL. DT 24-JUL-2013, sequence version 1. DT 03-SEP-2014, entry version 6. DE SubName: Full=Uncharacterized protein; GN ORFNames=C817_00796; OS Dorea sp. 5-2. OC Bacteria; Firmicutes; Clostridia; Clostridiales; Lachnospiraceae; OC Dorea. OX NCBI_TaxID=1235798; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=5-2; RG The Broad Institute Genomics Platform; RG The Broad Institute Genome Sequencing Center for Infectious Disease; RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q., RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., RA Alvarado L., Arachchi H.M., Berlin A.M., Chapman S.B., RA Gainer-Dewar J., Goldberg J., Griggs A., Gujja S., Hansen M., RA Howarth C., Imamovic A., Ireland A., Larimer J., McCowan C., RA Murphy C., Pearson M., Poon T.W., Priest M., Roberts A., Saif S., RA Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.; RT "The Genome Sequence of Dorea bacterium 5-2."; RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases. CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; ASTD01000018; EOS81139.1; -; Genomic_DNA. DR RefSeq; WP_016217566.1; NZ_KE159726.1. DR EnsemblBacteria; EOS81139; EOS81139; C817_00796. DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro. DR InterPro; IPR025647; DUF4356. DR InterPro; IPR001940; Peptidase_S1C. DR InterPro; IPR009003; Trypsin-like_Pept_dom. DR Pfam; PF14266; DUF4356; 1. DR PRINTS; PR00834; PROTEASES2C. DR SUPFAM; SSF50494; SSF50494; 1. PE 4: Predicted; SQ SEQUENCE 464 AA; 51610 MW; 048BEF80E41E2FFA CRC64; MTVNSLDTFL SVKGGFNGGR RDVYFVRALG CGSGMEDQIL AMDRRMGSKM QKKVFYTRMR LPSPLIPQEE MHFYSSQFEA WKAGRKLSVR NQAQDGRFCR VLGDACTETV EKYRLVKKNM TESIEKNFVI KLLYWTDRVL GCALADWNER KCFKILMEDV QKEQDYLFCY MLTLLGCDVL LLECRKDISA ADALKALSAE FRLGEFKDTL QLPEYMPCAA PSAVHTEHED RTAKEPSGPV RVVIPEKPGR RNSCTQMPMQ PTPQPAVSGK AAGNSEKSFE ELALLASSIV MIAVYDREGE PIATGSGIMI GRDGYILTNN HVTRGGCFFA VRIEDDDNIY KTNEIIKYNT VLDLAVIRID RRLAPLPFYR GKKLVRGQKV VAIGSPLGMF NSVSDGIISG FRNIDSVDMI QFTAPISHGS SGGALLNMQG EIIGISTAGI DEGQNINLAV NYEDIGMFVR GFTG //