ID A0A183VTS2_TRIRE Unreviewed; 1011 AA. AC A0A183VTS2; DT 07-SEP-2016, integrated into UniProtKB/TrEMBL. DT 07-SEP-2016, sequence version 1. DT 02-JUN-2021, entry version 37. DE RecName: Full=POU domain protein {ECO:0000256|RuleBase:RU361194}; GN ORFNames=TRE_LOCUS3878 {ECO:0000313|EMBL:VDP99757.1}; OS Trichobilharzia regenti (Nasal bird schistosome). OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda; OC Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Trichobilharzia. OX NCBI_TaxID=157069 {ECO:0000313|Proteomes:UP000050795, ECO:0000313|WBParaSite:TRE_0000388101-mRNA-1}; RN [1] {ECO:0000313|WBParaSite:TRE_0000388101-mRNA-1} RP IDENTIFICATION. RG WormBaseParasite; RL Submitted (JUN-2016) to UniProtKB. RN [2] {ECO:0000313|EMBL:VDP99757.1, ECO:0000313|Proteomes:UP000280995} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Pathogen Informatics; RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123, CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}. CC -!- SIMILARITY: Belongs to the POU transcription factor family. CC {ECO:0000256|RuleBase:RU361194}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; UZAO01051570; VDP99757.1; -; Genomic_DNA. DR WBParaSite; TRE_0000388101-mRNA-1; TRE_0000388101-mRNA-1; TRE_0000388101. DR Proteomes; UP000050795; Unplaced. DR Proteomes; UP000280995; Unassembled WGS sequence. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW. DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro. DR CDD; cd00086; homeodomain; 1. DR Gene3D; 1.10.260.40; -; 1. DR InterPro; IPR009057; Homeobox-like_sf. DR InterPro; IPR001356; Homeobox_dom. DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf. DR InterPro; IPR013847; POU. DR InterPro; IPR000327; POU_dom. DR Pfam; PF00046; Homeodomain; 1. DR Pfam; PF00157; Pou; 1. DR PRINTS; PR00028; POUDOMAIN. DR SMART; SM00389; HOX; 1. DR SMART; SM00352; POU; 1. DR SUPFAM; SSF46689; SSF46689; 1. DR SUPFAM; SSF47413; SSF47413; 1. DR PROSITE; PS50071; HOMEOBOX_2; 1. DR PROSITE; PS00035; POU_1; 1. DR PROSITE; PS00465; POU_2; 1. DR PROSITE; PS51179; POU_3; 1. PE 3: Inferred from homology; KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE- KW ProRule:PRU00108}; KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108, KW ECO:0000256|RuleBase:RU000682}; KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108, KW ECO:0000256|RuleBase:RU000682}; KW Reference proteome {ECO:0000313|Proteomes:UP000280995}; KW Transcription {ECO:0000256|RuleBase:RU361194}. FT DOMAIN 871..945 FT /note="POU-specific" FT /evidence="ECO:0000259|PROSITE:PS51179" FT DOMAIN 961..1011 FT /note="Homeobox" FT /evidence="ECO:0000259|PROSITE:PS50071" FT REGION 207..226 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 833..870 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 1011 AA; 113249 MW; 1D0ED20320DDDEFF CRC64; MNKSTKRGTG TRNRLRHSIK DQVTVTQRNE KLNILPYHNN SINNSNENIH QWSSDLNNDA DSIFHPNRLL LPISRNSSEI ITSLQQNIFS HDFLMNHGVD INSLASCSSS SSSTFKKNMD NCTNSDIVNQ NPQWSSNESS FRKDFTSKPF NEQHQLLNFS GFPSCSTSSR SNPHDSGIIP LYVDLQNNES GVRSHPVEKQ IDELTNIHNN NNNSGANNNT NNPTEYFSQD NESYFDTSKS ILPTADCNQD LLSFQSSTTS LHNNMHPSID LFSCNTVDEA CVNRSKQQSI VNLQEVNEIN DFTDVNNNNN NNSNNQQSIY YDRINNTDKE VEQWKNILYR PTKSPYHTSH DLMKCTPDTN RIQTFVQSGE NLSNYSNFTD KTGFMSKLCN NTTDSDPTRI ISDQFHPDCN NSFPLKSTKL PIELSQNSST LMNSSPLVTS INQNEYLTSG VALSDNISNC LSPLRSNDWR SCKVSKQLNA ENSPMSLPLG STESLKTFDI SMNLVKENSQ YLSNSSQRLL SEDSGFNGTD YTESTQGYPS YLSSHINPIQ ESFCNQPCQY SQNQETCHYN QQANLLNLTN SSSELSNESA FQKYEWKSPN MMIMNHVSEL MNIYNSSNNY TSSNTPTTNI TTTATITTGT NSVINTTTTT TSVMTNSLEM GTNSNLHINN KNLSEKLNSH MNNDQQFNTL LHNNSNNPNL TESQSYLFPS LICNQNTTTN LIDSINSQST LNEHISLDPL QSLSTMPTTS SSLFNGMLHS LVLQPPTSMQ QLITSSTCTT NELIHECKLF NSSSKYQSEM ADELMNMTRT CSPSQVPGFI DSLNMNGVDY PQVGRNTENT NTTAMSNSSN NNGSGNGNNN NNNNSIIPGT GEYPSADDLE IFAKMFKQRR IKLGYTQADV GLALGTLYGN VFSQTTICRF EALQLSFKNM CKLKPLLQKW LHEADCSTGT TNNLDKITTQ GRKRKKRTSI EIGVKGILEN HFVKQPKPLA QDIIQLADVL GLEKEVVRVW F //