ID Q6WNT3_TAKRU Unreviewed; 408 AA. AC Q6WNT3; DT 05-JUL-2004, integrated into UniProtKB/TrEMBL. DT 05-JUL-2004, sequence version 1. DT 11-DEC-2019, entry version 77. DE RecName: Full=Transcription factor SOX {ECO:0000256|PIRNR:PIRNR038098}; GN Name=Sox12 {ECO:0000313|EMBL:AAQ18503.1}; GN Synonyms=sox4 {ECO:0000313|Ensembl:ENSTRUP00000053530}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|EMBL:AAQ18503.1}; RN [1] {ECO:0000313|EMBL:AAQ18503.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=15019997; DOI=10.1016/j.gene.2003.12.008; RA Koopman P., Schepers G., Brenner S., Venkatesh B.; RT "Origin and diversity of the SOX transcription factor gene family: genome- RT wide analysis in Fugu rubripes."; RL Gene 328:177-186(2004). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000053530, ECO:0000313|Proteomes:UP000005226} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551351; RA Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., Hosoya S., RA Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.; RT "Integration of the genetic map and genome assembly of fugu facilitates RT insights into distinct features of genome evolution in teleosts and RT mammals."; RL Genome Biol. Evol. 3:424-442(2011). RN [3] {ECO:0000313|Ensembl:ENSTRUP00000053530} RP IDENTIFICATION. RG Ensembl; RL Submitted (OCT-2018) to UniProtKB. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PIRNR:PIRNR038098}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AY277960; AAQ18503.1; -; Genomic_DNA. DR RefSeq; XP_011603774.1; XM_011605472.1. DR Ensembl; ENSTRUT00000020811; ENSTRUP00000053530; ENSTRUG00000008283. DR GeneID; 101072693; -. DR KEGG; tru:101072693; -. DR CTD; 6659; -. DR GeneTree; ENSGT00940000161470; -. DR KO; K23581; -. DR OrthoDB; 1186233at2759; -. DR Proteomes; UP000005226; Chromosome 7. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule. DR GO; GO:0061386; P:closure of optic fissure; IEA:Ensembl. DR GO; GO:0031018; P:endocrine pancreas development; IEA:Ensembl. DR GO; GO:0045879; P:negative regulation of smoothened signaling pathway; IEA:Ensembl. DR Gene3D; 1.10.30.10; -; 1. DR InterPro; IPR009071; HMG_box_dom. DR InterPro; IPR036910; HMG_box_dom_sf. DR InterPro; IPR017386; SOX-11/4. DR Pfam; PF00505; HMG_box; 1. DR PIRSF; PIRSF038098; SOX-12/11/4a; 1. DR SMART; SM00398; HMG; 1. DR SUPFAM; SSF47095; SSF47095; 1. DR PROSITE; PS50118; HMG_BOX_2; 1. PE 4: Predicted; KW DNA-binding {ECO:0000256|PIRNR:PIRNR038098, ECO:0000256|PROSITE- KW ProRule:PRU00267, ECO:0000256|SAAS:SAAS00879239}; KW Nucleus {ECO:0000256|PIRNR:PIRNR038098, ECO:0000256|PROSITE- KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000005226}; KW Transcription {ECO:0000256|PIRNR:PIRNR038098}; KW Transcription regulation {ECO:0000256|PIRNR:PIRNR038098}. FT DOMAIN 67..135 FT /note="HMG box" FT /evidence="ECO:0000259|PROSITE:PS50118" FT DNA_BIND 67..135 FT /note="HMG box" FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267" FT REGION 29..56 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 136..354 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 145..225 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 243..272 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 288..332 FT /note="Polar" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 408 AA; 43521 MW; 3CFF40AB77294E64 CRC64; MVQKTSHTES TAEALSFFAV DSSSDSGTCM DLDPAASPLS PGSTASSTAG DKLAEDPAWC KTPSGHIKRP MNAFMVWSQI ERRKIMEQSP DMHNAEISKR LGKRWKLLRD SDKIPFIKEA ERLRLKHMAD YPDYKYRPRK KVKSSASKPG SSGEKGEKLH SSSINTSTKT SSSSRKNGPK SPSSSKPHKS LFGSSSSTKA SPFASEHQSE HHNSLYKSKS VSSAAKQIPD GKKPKRMYVY GSSAANLSVS PASSVVVPAS PTLSSSADSS DPLSLYEDAG SGREDGAESP GSSGSQGGSS VRQGGHTYSS RRASSPTPSG SHSSASSHSS SSSSEDEEFE DINPSPSFDS MSLGSFGSSV LDRDLDLNFE SGSGGSHFEF PDYCTPEVSE MISGDWLEST ISNLVFTY //