ID A0A3Y4Y0G8_ECOLX Unreviewed; 678 AA. AC A0A3Y4Y0G8; DT 17-JUN-2020, integrated into UniProtKB/TrEMBL. DT 17-JUN-2020, sequence version 1. DT 02-OCT-2024, entry version 16. DE SubName: Full=Sulfoquinovosidase {ECO:0000313|EMBL:HAN4354148.1}; DE EC=3.2.1.199 {ECO:0000313|EMBL:HAN4354148.1}; GN Name=yihQ {ECO:0000313|EMBL:HAN4354148.1}; GN ORFNames=G3W53_22465 {ECO:0000313|EMBL:NEN72824.1}, IFC14_002596 GN {ECO:0000313|EMBL:HAN4354148.1}; OS Escherichia coli. OC Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; OC Enterobacteriaceae; Escherichia. OX NCBI_TaxID=562 {ECO:0000313|EMBL:HAN4354148.1, ECO:0000313|Proteomes:UP000859822}; RN [1] {ECO:0000313|EMBL:HAN4354148.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=489-16 {ECO:0000313|EMBL:HAN4354148.1}; RX PubMed=30286803; DOI=10.1186/s13059-018-1540-z; RA Souvorov A., Agarwala R., Lipman D.J.; RT "SKESA: strategic k-mer extension for scrupulous assemblies."; RL Genome Biol. 19:153-153(2018). RN [2] {ECO:0000313|EMBL:NEN72824.1, ECO:0000313|Proteomes:UP000471360} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=8375wB1 {ECO:0000313|EMBL:NEN72824.1, RC ECO:0000313|Proteomes:UP000471360}; RA Subbiah M., Call D.; RL Submitted (FEB-2020) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EMBL:HAN4354148.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=489-16 {ECO:0000313|EMBL:HAN4354148.1}; RG NCBI Pathogen Detection Project; RL Submitted (SEP-2020) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 31 family. CC {ECO:0000256|ARBA:ARBA00007806, ECO:0000256|RuleBase:RU361185}. CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ CC whole genome shotgun (WGS) entry which is preliminary data. CC {ECO:0000313|EMBL:HAN4354148.1}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; DABUHV010000012; HAN4354148.1; -; Genomic_DNA. DR EMBL; JAAGYP010000044; NEN72824.1; -; Genomic_DNA. DR RefSeq; WP_000380821.1; NZ_VJVE01000007.1. DR Proteomes; UP000471360; Unassembled WGS sequence. DR Proteomes; UP000859822; Unassembled WGS sequence. DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro. DR GO; GO:0032450; F:maltose alpha-glucosidase activity; IEA:UniProtKB-EC. DR CDD; cd06594; GH31_glucosidase_YihQ; 1. DR CDD; cd14752; GH31_N; 1. DR Gene3D; 3.20.20.80; Glycosidases; 1. DR Gene3D; 2.60.40.1760; glycosyl hydrolase (family 31); 1. DR Gene3D; 2.60.40.1180; Golgi alpha-mannosidase II; 1. DR InterPro; IPR011013; Gal_mutarotase_sf_dom. DR InterPro; IPR048395; Glyco_hydro_31_C. DR InterPro; IPR025887; Glyco_hydro_31_N_dom. DR InterPro; IPR000322; Glyco_hydro_31_TIM. DR InterPro; IPR013780; Glyco_hydro_b. DR InterPro; IPR017853; Glycoside_hydrolase_SF. DR InterPro; IPR052990; Sulfoquinovosidase_GH31. DR InterPro; IPR044112; YihQ_TIM-like. DR PANTHER; PTHR46959; SULFOQUINOVOSIDASE; 1. DR PANTHER; PTHR46959:SF2; SULFOQUINOVOSIDASE; 1. DR Pfam; PF13802; Gal_mutarotas_2; 1. DR Pfam; PF01055; Glyco_hydro_31_2nd; 1. DR Pfam; PF21365; Glyco_hydro_31_3rd; 1. DR SUPFAM; SSF51445; (Trans)glycosidases; 1. DR SUPFAM; SSF74650; Galactose mutarotase-like; 1. DR SUPFAM; SSF51011; Glycosyl hydrolase domain; 1. PE 3: Inferred from homology; KW Glycosidase {ECO:0000256|RuleBase:RU361185, ECO:0000313|EMBL:HAN4354148.1}; KW Hydrolase {ECO:0000256|RuleBase:RU361185, ECO:0000313|EMBL:HAN4354148.1}. FT DOMAIN 92..162 FT /note="Glycoside hydrolase family 31 N-terminal" FT /evidence="ECO:0000259|Pfam:PF13802" FT DOMAIN 267..557 FT /note="Glycoside hydrolase family 31 TIM barrel" FT /evidence="ECO:0000259|Pfam:PF01055" FT DOMAIN 583..666 FT /note="Glycosyl hydrolase family 31 C-terminal" FT /evidence="ECO:0000259|Pfam:PF21365" SQ SEQUENCE 678 AA; 77296 MW; 6ABFAC81A8AA6789 CRC64; MDTPRPQLID FQFHQNNDSF TLRFQDRLIL IHSKDNPCLW IGSGIADIDM FRGNFSIKDK LQEKIALTDA IVSQSPDGWL IHFSRGSDIS ATLNISADDH GRLLLELQND NLNHNRIWLR LAAQPEDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKQ TYVTWQADCK ENAGGDYYWT FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWED KATLRFECAD TYISLLEKLT ALLGRQPELP DWIYDGVTLG IQGGTEVCQK KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK RVMWNWKWNS ENYPQLDSRI KQWNKEGVQF LAYINPYVAS DKDLCEEAAK RGYLAKDAAG GDYLVEFGEF YGGVVDLTNP EAYAWFKEVI KKNMIELGCG GWMADFGEYL PTDTYLHNGV SAEIMHNAWP ALWAKCNYEA LEETGKLGEI LFFMRAGSTG SQKYSTMMWA GDQNVDWSLD DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFEMKRSKEL LLRWCDFSAF TPMMRTHEGN RPGDNWQFDG DAETIAHFAR MTTVFTTLKP YLKEAVALNA KSGLPVMRPL FLHYEDDAQT YTLKYQYLLG RDILVAPVHE EGRSDWTLYL PEDNWVHAWT GEAFRGGEVT VNAPIGKPPV FYRADSEWAA LFASLKSI //