ID Q6WNT3_TAKRU Unreviewed; 408 AA. AC Q6WNT3; DT 05-JUL-2004, integrated into UniProtKB/TrEMBL. DT 05-JUL-2004, sequence version 1. DT 16-OCT-2019, entry version 76. DE RecName: Full=Transcription factor SOX {ECO:0000256|PIRNR:PIRNR038098}; GN Name=Sox12 {ECO:0000313|EMBL:AAQ18503.1}; GN Synonyms=sox4 {ECO:0000313|Ensembl:ENSTRUP00000053530}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|EMBL:AAQ18503.1}; RN [1] {ECO:0000313|EMBL:AAQ18503.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=15019997; DOI=10.1016/j.gene.2003.12.008; RA Koopman P., Schepers G., Brenner S., Venkatesh B.; RT "Origin and diversity of the SOX transcription factor gene family: RT genome-wide analysis in Fugu rubripes."; RL Gene 328:177-186(2004). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000053530, ECO:0000313|Proteomes:UP000005226} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551351; RA Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., RA Hosoya S., Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.; RT "Integration of the genetic map and genome assembly of fugu RT facilitates insights into distinct features of genome evolution in RT teleosts and mammals."; RL Genome Biol. Evol. 3:424-442(2011). RN [3] {ECO:0000313|Ensembl:ENSTRUP00000053530} RP IDENTIFICATION. RG Ensembl; RL Submitted (OCT-2018) to UniProtKB. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PIRNR:PIRNR038098}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AY277960; AAQ18503.1; -; Genomic_DNA. DR RefSeq; XP_011603774.1; XM_011605472.1. DR Ensembl; ENSTRUT00000020811; ENSTRUP00000053530; ENSTRUG00000008283. DR GeneID; 101072693; -. DR KEGG; tru:101072693; -. DR CTD; 6659; -. DR GeneTree; ENSGT00940000161470; -. DR KO; K23581; -. DR OrthoDB; 1186233at2759; -. DR Proteomes; UP000005226; Chromosome 7. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule. DR GO; GO:0061386; P:closure of optic fissure; IEA:Ensembl. DR GO; GO:0031018; P:endocrine pancreas development; IEA:Ensembl. DR GO; GO:0045879; P:negative regulation of smoothened signaling pathway; IEA:Ensembl. DR Gene3D; 1.10.30.10; -; 1. DR InterPro; IPR009071; HMG_box_dom. DR InterPro; IPR036910; HMG_box_dom_sf. DR InterPro; IPR017386; SOX-11/4. DR Pfam; PF00505; HMG_box; 1. DR PIRSF; PIRSF038098; SOX-12/11/4a; 1. DR SMART; SM00398; HMG; 1. DR SUPFAM; SSF47095; SSF47095; 1. DR PROSITE; PS50118; HMG_BOX_2; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW DNA-binding {ECO:0000256|PIRNR:PIRNR038098, ECO:0000256|PROSITE- KW ProRule:PRU00267, ECO:0000256|SAAS:SAAS00879239}; KW Nucleus {ECO:0000256|PIRNR:PIRNR038098, ECO:0000256|PROSITE- KW ProRule:PRU00267}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}; KW Transcription {ECO:0000256|PIRNR:PIRNR038098}; KW Transcription regulation {ECO:0000256|PIRNR:PIRNR038098}. FT DOMAIN 67 135 HMG box. {ECO:0000259|PROSITE:PS50118}. FT DNA_BIND 67 135 HMG box. {ECO:0000256|PROSITE-ProRule: FT PRU00267}. FT REGION 29 56 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT REGION 136 354 Disordered. {ECO:0000256|SAM:MobiDB- FT lite}. FT COMPBIAS 145 225 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 243 272 Polar. {ECO:0000256|SAM:MobiDB-lite}. FT COMPBIAS 288 332 Polar. {ECO:0000256|SAM:MobiDB-lite}. SQ SEQUENCE 408 AA; 43521 MW; 3CFF40AB77294E64 CRC64; MVQKTSHTES TAEALSFFAV DSSSDSGTCM DLDPAASPLS PGSTASSTAG DKLAEDPAWC KTPSGHIKRP MNAFMVWSQI ERRKIMEQSP DMHNAEISKR LGKRWKLLRD SDKIPFIKEA ERLRLKHMAD YPDYKYRPRK KVKSSASKPG SSGEKGEKLH SSSINTSTKT SSSSRKNGPK SPSSSKPHKS LFGSSSSTKA SPFASEHQSE HHNSLYKSKS VSSAAKQIPD GKKPKRMYVY GSSAANLSVS PASSVVVPAS PTLSSSADSS DPLSLYEDAG SGREDGAESP GSSGSQGGSS VRQGGHTYSS RRASSPTPSG SHSSASSHSS SSSSEDEEFE DINPSPSFDS MSLGSFGSSV LDRDLDLNFE SGSGGSHFEF PDYCTPEVSE MISGDWLEST ISNLVFTY //