ID Q0JFA5_ORYSJ Unreviewed; 2422 AA. AC Q0JFA5; DT 03-OCT-2006, integrated into UniProtKB/TrEMBL. DT 03-OCT-2006, sequence version 1. DT 08-APR-2008, entry version 17. DE Os04g0120000 protein. GN Name=Os04g0120000; OS Oryza sativa subsp. japonica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BEP clade; OC Ehrhartoideae; Oryzeae; Oryza. OX NCBI_TaxID=39947; RN [1] RP NUCLEOTIDE SEQUENCE. RX PubMed=16100779; DOI=10.1038/nature03895; RG International rice genome sequencing project (IRGSP); RT "The map-based sequence of the rice genome."; RL Nature 436:793-800(2005). RN [2] RP NUCLEOTIDE SEQUENCE. RX PubMed=17210932; DOI=10.1101/gr.5509507; RG The rice annotation project (RAP); RT "Curated genome annotation of Oryza sativa ssp. japonica and RT comparative genome analysis with Arabidopsis thaliana."; RL Genome Res. 17:175-183(2007). RN [3] RP NUCLEOTIDE SEQUENCE. RA Ohyanagi H., Tanaka T., Sakai H., Shigemoto Y., Yamaguchi K., RA Habara T., Fujii Y., Antonio B.A., Nagamura Y., Imanishi T., Ikeo K., RA Itoh T., Gojobori T., Sasaki T.; RT "The Rice Annotation Project Database (RAP-DB): hub for Oryza sativa RT ssp. japonica genome information."; RL (er) Nucleic Acids Research 34, D741-D744 (2006). RN [4] RP NUCLEOTIDE SEQUENCE. RG IRGSP(International Rice Genome Sequencing Project); RT "Oryza sativa nipponbare(GA3) genomic DNA, chromosome 4."; RL Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases. RN [5] RP NUCLEOTIDE SEQUENCE. RG The Rice Annotation Project (RAP); RT "The First Rice Annotation Project Meeting (RAP1)."; RL Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AP008210; BAF13982.1; -; Genomic_DNA. DR RefSeq; NP_001052068.1; -. DR UniGene; Os.80136; -. DR GeneID; 4334972; -. DR KEGG; osa:4334972; -. DR GO; GO:0003677; F:DNA binding; IEA:InterPro. DR GO; GO:0004289; F:subtilase activity; IEA:InterPro. DR GO; GO:0015074; P:DNA integration; IEA:InterPro. DR GO; GO:0006508; P:proteolysis; IEA:InterPro. DR InterPro; IPR001584; Integrase_cat-core. DR InterPro; IPR000209; Pept_S8_S53. DR InterPro; IPR013103; RVT_2. DR Gene3D; G3DSA:3.40.50.200; Pept_S8_S53; 1. DR Pfam; PF00665; rve; 1. DR Pfam; PF07727; RVT_2; 1. DR PROSITE; PS50994; INTEGRASE; 1. PE 4: Predicted; KW Aspartyl protease; Hydrolase; Metal-binding; Protease; Zinc; KW Zinc-finger. SQ SEQUENCE 2422 AA; 270242 MW; EB120A2909FC01F9 CRC64; MRPAYVDRME RYFSMRSLKP PACDDLRLYI GSAGSQNFLS PGSRETLAAR LPKFPKSPSP SSPPPPAPPQ SRPAPRRRRR RRQDLSPLAV AVAAARTSPP SPSPSLPPGP LPPRRRRRRR QVAVATARNS PPSPLPSPPP GRPRRQVVPA TAGSSPKSPR PSPSPTPLPG PLPPRRRRRR RTDLSPLAVA IAAARSSPPP GRPRRRSMVV RRRSKRKAGR NRTEQEVEMD TATSAPERTE TSTSAGGSGK KRRGERSKNK LPKETYNVIA LDQDGKPIEP PIVRSKFSNA CGTLVRTRCP INVKLWETVD DNIKTLLWNE LQKYFVFPPG SEVRGRDYAL KKMGDRWRQW KSDLNRDYVQ KNLPPFTDYG HISQADWDTF VADRTTAEAL ALRKKMSELA KKNKYPHRLG SSGYAGHVDQ WREIEQRFAA AGKPLLVDPM VERSKNWVWA RSTGQVSDEG DILFETPDIE EVTTNLQQIV EKERSGQFVP RRERDQLTAA LGTAEHSGRV RGLSSKTSWK VGFPQDAPSY KKRDKYKEQL SDKIYAQVKE HFYSLAAENP TAFPRLFPDS QQPTQSAQQT TNVPSSVGSV QTSTFPVDSI TGPTPCSLVV PIGRAGKTKE VATGLAIPGR QFHNTAIPED YARVQVAKVH SDHVSLELDI PAPEGIELLG DAVNQFILWH RRDIILSAAV LAAGSSTPSS SQAMTAAAPA PPSPPEPPSP RHPPSPPPLR SPPRQPTPPP SPSQQPPLPT PQPVQASPTS PTKQHAPPAP PSVQTSPPTP QSALVEEVHI PDGTTSEPKS NTLEPRRIIP KLISTYDPKE IDKDKEKFMF SAFRNSEKRK ELAHVLSDSQ KSVLAAQDEV QSWLSADVPE TYEYGKPFLP TYLMNKLPWE MRVMHEWYMK ASRKGLGFIS VAVPEGAFMS GPNGIFFISF QDLYALYKLD KMDVNLVAAF CLMQFHEADR TGAKVGYVDP TRICKTQHTI ELRQDCEQLV GKTPEEKEEY VKTLHKRKKL EVATYLAIAM LAHADKDVLM VPYQFTAHKY YRKRGGPVHI PSQKKLSVRT GWPCYKQPPG TNLCGYYVCE MLRVNGRYKT TSNRIPEIPY IAQRFNHTTI LNVAADLCRF IRRDVCNARG VTPESASFAD TGYDPPPKKW KGICQVGPSF EAISCNRKFI GARWYIDDEI LSSISDNEVL SPRDVEGHGT HTASTAGGNI IHNVSFLGLA AGTVRGGAPR ARLAIYKACW SGYGCSGATV LKAMDDAVYD GVDVLSLSIG GTKEDVGTLH VVANGISVVY AGGNDGPIAQ TVENQSPWLV TVAATTIDRS LPVVITLGNG EKLVAQSFVL LETASQFSEI QKYTDEDWDH CGAQVHSSKM VTCGSQIRNH DNRHCDFTMA DFADALRPDK FTGVHFKRWQ IRVTLWLTAM KCFWVARGST VLMGNGSHAS VHGVGTVDLK FTSGKIVQLK NVQHVPSIDR NLVSGSRLTR DGFKLVFESN KVVVSKHGYF IGKGYECGGL FRFSLSDFCN KSVNHICGSV DDEANVWHSR LCHINFGLMS RLSSMCLIPK FSIVKGSKCH SCVQSKQPRK PHKAAEERNL APLELLHSDL CEMNGVLTKG GKRYFMTFID DATRFCYVYL LKKKDEALDY FKIYKAEVEN QLDRKIKRLR SDRGGEFFSN EFDLFCEEHG IIHERTPPYS PESNGIAERK NCTLTDLVNA MLDTAGLPKA WWGEVLLTSN HVLNRVPNRN KDKTPYEIWI GRKPSLSYLR TWGCLAKVNV PITKKRKLGP KTVDCVFLGY AHHSIAYRFL IVKSEVLDMH VGIIMESRDA TFFESFFPMK DTHSGSNQPS EIIPSSIIPP EQTEHTHELV SEEDVSEAPR RSKRQRTAKS FGDDFTVYLV DDTPKSISEA YASPDADYWK EAVRSEMDSI IANGTWEVTE QPYGCKPVGC KWVFKKKLRP DGTIEKYKAR LVAKGYTQKE GEDFFDTYSP VARLTTIRVL LSLAASHGLL VHQMDVKTAF LNGELDEEIY MDQPDGFVVE GQEGKVCKLL KSLYGLKQAP KQWHEKFDKT LTSAGFGVNE ADKCVYYRHG GGEGVILCLY VDDILIFGTN LEVINEVKSF LSQNFDMKDL GVADVILNIK LIRGENGITL LQSHYVEKIL NRFGYIDSKP SPTPYDPSLL LRKNKRIARN QLEYSQIIGS LMYLASATRP DISFAVSKLS RFTSNPGDDH WRALERVMRY LKGTVELGLH YTGYPAVLEG YSDSNWISDV DEIKATSGYV FTLGGGAVSW RSCKQTILTR STMEAELTAL DTATVEAEWL RDLLMDLPVV EKPVPAILMN CDNQTVIVKV NSSKDNMKSS RHVKRRLKSV RKLRNSGVIT LDYIQTARNL ADPFTKGLSR NVIDNASKEM GLRPIAFLKE LTFVSLTVGR SL //