ID A0A151M620_ALLMI Unreviewed; 539 AA. AC A0A151M620; DT 08-JUN-2016, integrated into UniProtKB/TrEMBL. DT 08-JUN-2016, sequence version 1. DT 16-JAN-2019, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KYO19931.1}; GN ORFNames=Y1Q_0023991 {ECO:0000313|EMBL:KYO19931.1}; OS Alligator mississippiensis (American alligator). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae; OC Alligator. OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO19931.1}; RN [1] {ECO:0000313|EMBL:KYO19931.1, ECO:0000313|Proteomes:UP000050525} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO19931.1}; RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415; RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., RA Gongora J., Dalzell P., Moran C., Bed'hom B., Abzhanov A., RA Burgess S.C., Cooksey A.M., Castoe T.A., Crawford N.G., Densmore L.D., RA Drew J.C., Edwards S.V., Faircloth B.C., Fujita M.K., Greenwold M.J., RA Hoffmann F.G., Howard J.M., Iguchi T., Janes D.E., Khan S.Y., RA Kohno S., de Koning A.J., Lance S.L., McCarthy F.M., McCormack J.E., RA Merchant M.E., Peterson D.G., Pollock D.D., Pourmand N., Raney B.J., RA Roessler K.A., Sanford J.R., Sawyer R.H., Schmidt C.J., Triplett E.W., RA Tuberville T.D., Venegas-Anaya M., Howard J.T., Jarvis E.D., RA Guillette L.J.Jr., Glenn T.C., Green R.E., Ray D.A.; RT "Sequencing three crocodilian genomes to illuminate the evolution of RT archosaurs and amniotes."; RL Genome Biol. 13:415-415(2012). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:KYO19931.1}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AKHW03006499; KYO19931.1; -; Genomic_DNA. DR RefSeq; XP_019334293.1; XM_019478748.1. DR GeneID; 102559537; -. DR KEGG; amj:102559537; -. DR KO; K09228; -. DR OrthoDB; 1318335at2759; -. DR Proteomes; UP000050525; Unassembled WGS sequence. DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro. DR InterPro; IPR036236; Znf_C2H2_sf. DR InterPro; IPR013087; Znf_C2H2_type. DR Pfam; PF00096; zf-C2H2; 15. DR SMART; SM00355; ZnF_C2H2; 18. DR SUPFAM; SSF57667; SSF57667; 10. DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 15. DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 18. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000050525}; KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042}; KW Reference proteome {ECO:0000313|Proteomes:UP000050525}; KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042}; KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}. FT DOMAIN 37 64 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 65 92 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 93 120 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 121 148 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 149 176 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 177 204 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 205 232 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 233 260 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 261 288 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 289 316 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 316 343 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 344 371 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 372 399 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 400 427 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 428 455 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 456 483 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 484 511 C2H2-type. {ECO:0000259|PROSITE:PS50157}. FT DOMAIN 512 539 C2H2-type. {ECO:0000259|PROSITE:PS50157}. SQ SEQUENCE 539 AA; 61413 MW; C5C07B21ABF9DE5E CRC64; MNQDRCIPRG RKTHHCTECR KDFICLQDLS QHECVRHHCT KCGKRFRQLF NLASHQCMHT REKPLQCSEC GKSFTYFSQL ARHQRIHTGE KPHQCLECGK SFANSSGLAQ HHRIHTGEKP YHCSECGKSF TYSSSLAQHQ RIHTGEKPHQ CSKCGKSFTL SSSLAQHQRI HTGEKPHQCL QCGKTFSQSS SLAQHQRIHT GEKPHHCSEC GKSFTYYFNL AQHQRIHTGE KPHQCSKCGK SFTQSSHLAQ HQRIHTGEKP HQCSKCGKSF TQSCSLARHQ RIHTGEKPHQ CSERGKSFAN SRSLDQHQRI HTQKLHQCSE CGKNFTQSYS LARHQRIHTG QKPHQCSECG KSFSQSSSLA QHQRIHTGEK PHQCSECGKS FTQSSTLAQH QLVHTGEKPH QCSVCAKSFT YSFNLAQHQR IHTGEKPYQC SECGKSFTHS SSRTHHQLIH MRQKPYLCSK FGKSLTVSSA LVHHQQIHTG EKPHQCSECG KSFTQSSHLS RHQLIHTGKK PHLCSECGKC FLHSFQLVQH QCIPIQGRP //