ID A0A183VTS2_TRIRE Unreviewed; 1011 AA. AC A0A183VTS2; DT 07-SEP-2016, integrated into UniProtKB/TrEMBL. DT 07-SEP-2016, sequence version 1. DT 05-JUL-2017, entry version 10. DE RecName: Full=POU domain protein {ECO:0000256|RuleBase:RU361194}; OS Trichobilharzia regenti (Nasal bird schistosome). OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; Strigeidida; OC Schistosomatoidea; Schistosomatidae; Trichobilharzia. OX NCBI_TaxID=157069 {ECO:0000313|Proteomes:UP000050795, ECO:0000313|WBParaSite:TRE_0000388101-mRNA-1}; RN [1] {ECO:0000313|Proteomes:UP000050795, ECO:0000313|WBParaSite:TRE_0000388101-mRNA-1} RP NUCLEOTIDE SEQUENCE. RG Helminth Genomes Consortium; RL Submitted (MAR-2015) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|WBParaSite:TRE_0000388101-mRNA-1} RP IDENTIFICATION. RG WormBaseParasite; RL Submitted (JUN-2016) to UniProtKB. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE- CC ProRule:PRU00530, ECO:0000256|RuleBase:RU000682, CC ECO:0000256|SAAS:SAAS00553137}. CC -!- SIMILARITY: Belongs to the POU transcription factor family. CC {ECO:0000256|RuleBase:RU361194, ECO:0000256|SAAS:SAAS00595129}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR WBParaSite; TRE_0000388101-mRNA-1; TRE_0000388101-mRNA-1; TRE_0000388101. DR Proteomes; UP000050795; Genome assembly. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule. DR GO; GO:0003700; F:transcription factor activity, sequence-specific DNA binding; IEA:UniProtKB-UniRule. DR GO; GO:0006351; P:transcription, DNA-templated; IEA:UniProtKB-UniRule. DR InterPro; IPR009057; Homeobox-like. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR010982; Lambda_DNA-bd_dom. DR InterPro; IPR013847; POU. DR InterPro; IPR000327; POU_dom. DR Pfam; PF00046; Homeobox; 1. DR Pfam; PF00157; Pou; 1. DR PRINTS; PR00028; POUDOMAIN. DR SMART; SM00389; HOX; 1. DR SMART; SM00352; POU; 1. DR SUPFAM; SSF46689; SSF46689; 1. DR SUPFAM; SSF47413; SSF47413; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. DR PROSITE; PS00035; POU_1; 1. DR PROSITE; PS00465; POU_2; 1. DR PROSITE; PS51179; POU_3; 1. PE 3: Inferred from homology; KW Complete proteome {ECO:0000313|Proteomes:UP000050795}; KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00530, KW ECO:0000256|RuleBase:RU000682, ECO:0000256|SAAS:SAAS00045280}; KW Homeobox {ECO:0000256|RuleBase:RU000682, KW ECO:0000256|SAAS:SAAS00045306}; KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00530, KW ECO:0000256|RuleBase:RU000682, ECO:0000256|SAAS:SAAS00045291}; KW Reference proteome {ECO:0000313|Proteomes:UP000050795}; KW Transcription {ECO:0000256|PROSITE-ProRule:PRU00530, KW ECO:0000256|RuleBase:RU361194}; KW Transcription regulation {ECO:0000256|PROSITE-ProRule:PRU00530}. FT DOMAIN 871 945 POU-specific. {ECO:0000259|PROSITE: FT PS51179}. FT DOMAIN 961 1011 Homeobox DNA-binding. FT {ECO:0000259|PROSITE:PS50071}. SQ SEQUENCE 1011 AA; 113249 MW; 1D0ED20320DDDEFF CRC64; MNKSTKRGTG TRNRLRHSIK DQVTVTQRNE KLNILPYHNN SINNSNENIH QWSSDLNNDA DSIFHPNRLL LPISRNSSEI ITSLQQNIFS HDFLMNHGVD INSLASCSSS SSSTFKKNMD NCTNSDIVNQ NPQWSSNESS FRKDFTSKPF NEQHQLLNFS GFPSCSTSSR SNPHDSGIIP LYVDLQNNES GVRSHPVEKQ IDELTNIHNN NNNSGANNNT NNPTEYFSQD NESYFDTSKS ILPTADCNQD LLSFQSSTTS LHNNMHPSID LFSCNTVDEA CVNRSKQQSI VNLQEVNEIN DFTDVNNNNN NNSNNQQSIY YDRINNTDKE VEQWKNILYR PTKSPYHTSH DLMKCTPDTN RIQTFVQSGE NLSNYSNFTD KTGFMSKLCN NTTDSDPTRI ISDQFHPDCN NSFPLKSTKL PIELSQNSST LMNSSPLVTS INQNEYLTSG VALSDNISNC LSPLRSNDWR SCKVSKQLNA ENSPMSLPLG STESLKTFDI SMNLVKENSQ YLSNSSQRLL SEDSGFNGTD YTESTQGYPS YLSSHINPIQ ESFCNQPCQY SQNQETCHYN QQANLLNLTN SSSELSNESA FQKYEWKSPN MMIMNHVSEL MNIYNSSNNY TSSNTPTTNI TTTATITTGT NSVINTTTTT TSVMTNSLEM GTNSNLHINN KNLSEKLNSH MNNDQQFNTL LHNNSNNPNL TESQSYLFPS LICNQNTTTN LIDSINSQST LNEHISLDPL QSLSTMPTTS SSLFNGMLHS LVLQPPTSMQ QLITSSTCTT NELIHECKLF NSSSKYQSEM ADELMNMTRT CSPSQVPGFI DSLNMNGVDY PQVGRNTENT NTTAMSNSSN NNGSGNGNNN NNNNSIIPGT GEYPSADDLE IFAKMFKQRR IKLGYTQADV GLALGTLYGN VFSQTTICRF EALQLSFKNM CKLKPLLQKW LHEADCSTGT TNNLDKITTQ GRKRKKRTSI EIGVKGILEN HFVKQPKPLA QDIIQLADVL GLEKEVVRVW F //