ID A0A6A4JSK3_APOLU Unreviewed; 1000 AA. AC A0A6A4JSK3; DT 17-JUN-2020, integrated into UniProtKB/TrEMBL. DT 17-JUN-2020, sequence version 1. DT 12-OCT-2022, entry version 7. DE RecName: Full=Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM {ECO:0000256|Google:UnProtein}; GN ORFNames=GE061_16101 {ECO:0000313|EMBL:KAE9436380.1}; OS Apolygus lucorum (Plant bug) (Lygocoris lucorum). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Hemiptera; Heteroptera; Panheteroptera; OC Cimicomorpha; Miridae; Mirini; Apolygus. OX NCBI_TaxID=248454 {ECO:0000313|EMBL:KAE9436380.1}; RN [1] {ECO:0000313|EMBL:KAE9436380.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=12Hb {ECO:0000313|EMBL:KAE9436380.1}; RA Wang G.; RT "Apolygus lucorum genome provides insights into omnivorousness and RT mesophyll feeding."; RL Submitted (NOV-2019) to the EMBL/GenBank/DDBJ databases. CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ CC whole genome shotgun (WGS) entry which is preliminary data. CC {ECO:0000313|EMBL:KAE9436380.1}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; WIXP01000005; KAE9436380.1; -; Genomic_DNA. DR GO; GO:0071897; P:DNA biosynthetic process; IEA:UniProt. DR InterPro; IPR043502; DNA/RNA_pol_sf. DR InterPro; IPR000477; RT_dom. DR InterPro; IPR013087; Znf_C2H2_type. DR Pfam; PF00078; RVT_1; 1. DR SUPFAM; SSF56672; SSF56672; 1. DR PROSITE; PS50878; RT_POL; 1. DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1. DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1. PE 4: Predicted; KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042}; KW Transposable element {ECO:0000256|ARBA:ARBA00022464}; KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042}; KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}. FT DOMAIN 40..63 FT /note="C2H2-type" FT /evidence="ECO:0000259|PROSITE:PS50157" FT DOMAIN 313..589 FT /note="Reverse transcriptase" FT /evidence="ECO:0000259|PROSITE:PS50878" FT REGION 1..36 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1..17 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 1000 AA; 108636 MW; 168F93FC8E1F1ECA CRC64; MADKRGMADR GGRESRRGGT DGGGGAQRGG ETDRGASLNH QCAHCERAFT TRSGLGLHIR RKHPVAANDA IDIERIQHRK LSWQIAVGPS PASPIADVEQ QPAIQYPSPG TSGLLADLKT EVLRLGGQVG RIEGYESTYL QDLVRRAAEG LDNLEDEVAA YLGRIFPRVG DATVTQHQGG PPPRVVLRGG RRAAKQARRR EYHRVQSLWR KHPARVAAEV LEEPGENQVQ PPLQEFVDFW EPLLSAPSHD PGERVVPPVS TRDSLRSILC SPVLLDEAHK CRVKNSTSAG PDGVSARRWN KAPKTIKLLI LNLLLLAGRP PKALTRTRTV FIPKKGGSGP GGYRPISVSS VVFRHFQKVL ASRLQVAGVI GDVQRAFRPA DGTAENLTVL QTVISLARIK RRQLHLAALD VAKAFDTVSH LALTDCLRGV GAPSRLIEYV ACLYQEGVTV LEAGGEVSGD IRIGRGVRQG DPLSPLLFNL VVDTALGKLP TEVGFDLAPG VRVGALAFAD DVILLAETRE GLQIALNAFA KQLAGCGLVV NPDKCGSVSL VPSGRQHKVK VVDGGFVVGQ SPIPSRSVLE VWRYLGVDFM GVATITTSRV DLGRALARVT KAPLKPQQKL RLLRTYLLPK LTYGLVFGRL TAGRLQQLDR EVRSAVRSWL KFPPGVPSAY FHAPVKSGGL GIVSLSACIP SLRRRRLLAL QGSSWEVARA AADLDFVRQQ LAWCDRATPS APKPSSSAEF AAALHESVDG KELRQCSESL VSSQWVDWAS EGIGARDYRH FHAIRVASLP TAVRCSRGSR GTTLPLCRAC RSGNEHLYHV VQQCPRTHGG RILRHDAVAK QIAGALSSSG WDVERERLYH MPSDQGKKPD IVAVKPDQSC IILDVQVVNG SLDMEQTWRA KIRKYDRNDL RQAVSELKGV APNNIRVMAA TLSWRGVWCG KSATELRELG ISVGVLRGIT TRVLLGSYLN WWSFYRGTSL HPSLPPPPSS SSLIPRAGVG //