ID A0A183VTS2_TRIRE Unreviewed; 1011 AA. AC A0A183VTS2; DT 07-SEP-2016, integrated into UniProtKB/TrEMBL. DT 07-SEP-2016, sequence version 1. DT 05-JUN-2019, entry version 27. DE RecName: Full=POU domain protein {ECO:0000256|RuleBase:RU361194}; GN ORFNames=TRE_LOCUS3878 {ECO:0000313|EMBL:VDP99757.1}; OS Trichobilharzia regenti (Nasal bird schistosome). OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; Strigeidida; OC Schistosomatoidea; Schistosomatidae; Trichobilharzia. OX NCBI_TaxID=157069 {ECO:0000313|Proteomes:UP000050795, ECO:0000313|WBParaSite:TRE_0000388101-mRNA-1}; RN [1] {ECO:0000313|Proteomes:UP000050795, ECO:0000313|WBParaSite:TRE_0000388101-mRNA-1} RP NUCLEOTIDE SEQUENCE. RG Helminth Genomes Consortium; RL Submitted (MAR-2015) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|WBParaSite:TRE_0000388101-mRNA-1} RP IDENTIFICATION. RG WormBaseParasite; RL Submitted (JUN-2016) to UniProtKB. RN [3] {ECO:0000313|EMBL:VDP99757.1} RP NUCLEOTIDE SEQUENCE. RG Pathogen Informatics; RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE- CC ProRule:PRU00108, ECO:0000256|RuleBase:RU000682, CC ECO:0000256|SAAS:SAAS00861387}. CC -!- SIMILARITY: Belongs to the POU transcription factor family. CC {ECO:0000256|RuleBase:RU361194, ECO:0000256|SAAS:SAAS00867718}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; UZAO01051570; VDP99757.1; -; Genomic_DNA. DR WBParaSite; TRE_0000388101-mRNA-1; TRE_0000388101-mRNA-1; TRE_0000388101. DR Proteomes; UP000050795; Genome assembly. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule. DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro. DR CDD; cd00086; homeodomain; 1. DR InterPro; IPR009057; Homeobox-like_sf. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf. DR InterPro; IPR013847; POU. DR InterPro; IPR000327; POU_dom. DR Pfam; PF00046; Homeodomain; 1. DR Pfam; PF00157; Pou; 1. DR PRINTS; PR00028; POUDOMAIN. DR SMART; SM00389; HOX; 1. DR SMART; SM00352; POU; 1. DR SUPFAM; SSF46689; SSF46689; 1. DR SUPFAM; SSF47413; SSF47413; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. DR PROSITE; PS00035; POU_1; 1. DR PROSITE; PS00465; POU_2; 1. DR PROSITE; PS51179; POU_3; 1. PE 3: Inferred from homology; KW Complete proteome {ECO:0000313|Proteomes:UP000050795}; KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108, KW ECO:0000256|RuleBase:RU000682, ECO:0000256|SAAS:SAAS00861401}; KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108, KW ECO:0000256|RuleBase:RU000682, ECO:0000256|SAAS:SAAS00861414}; KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108, KW ECO:0000256|RuleBase:RU000682, ECO:0000256|SAAS:SAAS00861421}; KW Reference proteome {ECO:0000313|Proteomes:UP000050795}; KW Transcription {ECO:0000256|RuleBase:RU361194}. FT DOMAIN 871 945 POU-specific. {ECO:0000259|PROSITE: FT PS51179}. FT DOMAIN 961 1011 Homeobox. {ECO:0000259|PROSITE:PS50071}. FT REGION 207 226 Disordered. {ECO:0000256|MobiDB-lite: FT A0A183VTS2}. FT REGION 833 870 Disordered. {ECO:0000256|MobiDB-lite: FT A0A183VTS2}. SQ SEQUENCE 1011 AA; 113249 MW; 1D0ED20320DDDEFF CRC64; MNKSTKRGTG TRNRLRHSIK DQVTVTQRNE KLNILPYHNN SINNSNENIH QWSSDLNNDA DSIFHPNRLL LPISRNSSEI ITSLQQNIFS HDFLMNHGVD INSLASCSSS SSSTFKKNMD NCTNSDIVNQ NPQWSSNESS FRKDFTSKPF NEQHQLLNFS GFPSCSTSSR SNPHDSGIIP LYVDLQNNES GVRSHPVEKQ IDELTNIHNN NNNSGANNNT NNPTEYFSQD NESYFDTSKS ILPTADCNQD LLSFQSSTTS LHNNMHPSID LFSCNTVDEA CVNRSKQQSI VNLQEVNEIN DFTDVNNNNN NNSNNQQSIY YDRINNTDKE VEQWKNILYR PTKSPYHTSH DLMKCTPDTN RIQTFVQSGE NLSNYSNFTD KTGFMSKLCN NTTDSDPTRI ISDQFHPDCN NSFPLKSTKL PIELSQNSST LMNSSPLVTS INQNEYLTSG VALSDNISNC LSPLRSNDWR SCKVSKQLNA ENSPMSLPLG STESLKTFDI SMNLVKENSQ YLSNSSQRLL SEDSGFNGTD YTESTQGYPS YLSSHINPIQ ESFCNQPCQY SQNQETCHYN QQANLLNLTN SSSELSNESA FQKYEWKSPN MMIMNHVSEL MNIYNSSNNY TSSNTPTTNI TTTATITTGT NSVINTTTTT TSVMTNSLEM GTNSNLHINN KNLSEKLNSH MNNDQQFNTL LHNNSNNPNL TESQSYLFPS LICNQNTTTN LIDSINSQST LNEHISLDPL QSLSTMPTTS SSLFNGMLHS LVLQPPTSMQ QLITSSTCTT NELIHECKLF NSSSKYQSEM ADELMNMTRT CSPSQVPGFI DSLNMNGVDY PQVGRNTENT NTTAMSNSSN NNGSGNGNNN NNNNSIIPGT GEYPSADDLE IFAKMFKQRR IKLGYTQADV GLALGTLYGN VFSQTTICRF EALQLSFKNM CKLKPLLQKW LHEADCSTGT TNNLDKITTQ GRKRKKRTSI EIGVKGILEN HFVKQPKPLA QDIIQLADVL GLEKEVVRVW F //