ID ZFHX2_HUMAN Reviewed; 2572 AA. AC Q9C0A1; Q9UPU6; DT 12-FEB-2003, integrated into UniProtKB/Swiss-Prot. DT 13-JUL-2010, sequence version 3. DT 02-OCT-2024, entry version 182. DE RecName: Full=Zinc finger homeobox protein 2; DE AltName: Full=Zinc finger homeodomain protein 2; DE Short=ZFH-2; GN Name=ZFHX2; Synonyms=KIAA1056, KIAA1762, ZNF409; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2), AND NUCLEOTIDE SEQUENCE RP [LARGE SCALE MRNA] OF 1113-2572 (ISOFORM 1). RC TISSUE=Brain; RX PubMed=11214970; DOI=10.1093/dnares/7.6.347; RA Nagase T., Kikuno R., Hattori A., Kondo Y., Okumura K., Ohara O.; RT "Prediction of the coding sequences of unidentified human genes. XIX. The RT complete sequences of 100 new cDNA clones from brain which code for large RT proteins in vitro."; RL DNA Res. 7:347-355(2000). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12508121; DOI=10.1038/nature01348; RA Heilig R., Eckenberg R., Petit J.-L., Fonknechten N., Da Silva C., RA Cattolico L., Levy M., Barbe V., De Berardinis V., Ureta-Vidal A., RA Pelletier E., Vico V., Anthouard V., Rowen L., Madan A., Qin S., Sun H., RA Du H., Pepin K., Artiguenave F., Robert C., Cruaud C., Bruels T., RA Jaillon O., Friedlander L., Samson G., Brottier P., Cure S., Segurens B., RA Aniere F., Samain S., Crespeau H., Abbasi N., Aiach N., Boscus D., RA Dickhoff R., Dors M., Dubois I., Friedman C., Gouyvenoux M., James R., RA Madan A., Mairey-Estrada B., Mangenot S., Martins N., Menard M., Oztas S., RA Ratcliffe A., Shaffer T., Trask B., Vacherie B., Bellemere C., Belser C., RA Besnard-Gonnet M., Bartol-Mavel D., Boutard M., Briez-Silla S., RA Combette S., Dufosse-Laurent V., Ferron C., Lechaplais C., Louesse C., RA Muselet D., Magdelenat G., Pateau E., Petit E., Sirvain-Trukniewicz P., RA Trybou A., Vega-Czarny N., Bataille E., Bluet E., Bordelais I., Dubois M., RA Dumont C., Guerin T., Haffray S., Hammadi R., Muanga J., Pellouin V., RA Robert D., Wunderle E., Gauguet G., Roy A., Sainte-Marthe L., Verdier J., RA Verdier-Discala C., Hillier L.W., Fulton L., McPherson J., Matsuda F., RA Wilson R., Scarpelli C., Gyapay G., Wincker P., Saurin W., Quetier F., RA Waterston R., Hood L., Weissenbach J.; RT "The DNA sequence and analysis of human chromosome 14."; RL Nature 421:601-607(2003). RN [3] RP FUNCTION, SUBCELLULAR LOCATION, INVOLVEMENT IN MARSIS, AND VARIANT MARSIS RP LYS-1913. RX PubMed=29253101; DOI=10.1093/brain/awx326; RA Habib A.M., Matsuyama A., Okorokov A.L., Santana-Varela S., Bras J.T., RA Aloisi A.M., Emery E.C., Bogdanov Y.D., Follenfant M., Gossage S.J., RA Gras M., Humphrey J., Kolesnikov A., Le Cann K., Li S., Minett M.S., RA Pereira V., Ponsolles C., Sikandar S., Torres J.M., Yamaoka K., Zhao J., RA Komine Y., Yamamori T., Maniatis N., Panov K.I., Houlden H., Ramirez J.D., RA Bennett D.L.H., Marsili L., Bachiocco V., Wood J.N., Cox J.J.; RT "A novel human pain insensitivity disorder caused by a point mutation in RT ZFHX2."; RL Brain 141:365-376(2018). CC -!- FUNCTION: Transcriptional regulator that is critical for the regulation CC of pain perception and processing of noxious stimuli. CC {ECO:0000269|PubMed:29253101}. CC -!- INTERACTION: CC Q9C0A1; O15354: GPR37; NbExp=3; IntAct=EBI-25850811, EBI-15639515; CC Q9C0A1; P28799-2: GRN; NbExp=3; IntAct=EBI-25850811, EBI-25860013; CC Q9C0A1; P10636: MAPT; NbExp=3; IntAct=EBI-25850811, EBI-366182; CC Q9C0A1; P10636-6: MAPT; NbExp=3; IntAct=EBI-25850811, EBI-7796455; CC Q9C0A1; A0A6Q8PF08: PMP22; NbExp=3; IntAct=EBI-25850811, EBI-50433196; CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000269|PubMed:29253101}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; CC IsoId=Q9C0A1-1; Sequence=Displayed; CC Name=2; CC IsoId=Q9C0A1-2; Sequence=VSP_039496, VSP_039497; CC -!- DISEASE: Marsili syndrome (MARSIS) [MIM:147430]: An autosomal dominant CC disorder characterized by congenital pain insensitivity. Painless CC cutaneous thermal burns and bone fractures are present in affected CC individuals. Corneal reflex is absent, sweating is decreased or absent. CC Patients have normal cognitive abilities, and display no evidence of CC distal weakness. {ECO:0000269|PubMed:29253101}. Note=The disease may be CC caused by variants affecting the gene represented in this entry. CC -!- SEQUENCE CAUTION: CC Sequence=BAA83008.2; Type=Erroneous initiation; Note=Extended N-terminus.; Evidence={ECO:0000305}; CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AB028979; BAA83008.2; ALT_INIT; mRNA. DR EMBL; AB051549; BAB21853.1; -; mRNA. DR EMBL; AL132855; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL135999; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR CCDS; CCDS55907.1; -. [Q9C0A1-1] DR RefSeq; NP_207646.2; NM_033400.2. [Q9C0A1-1] DR RefSeq; XP_011535547.1; XM_011537245.2. [Q9C0A1-1] DR RefSeq; XP_011535548.1; XM_011537246.2. DR RefSeq; XP_011535549.1; XM_011537247.2. DR RefSeq; XP_016877203.1; XM_017021714.1. DR AlphaFoldDB; Q9C0A1; -. DR BioGRID; 124533; 8. DR IntAct; Q9C0A1; 5. DR STRING; 9606.ENSP00000413418; -. DR GlyGen; Q9C0A1; 1 site, 1 O-linked glycan (1 site). DR iPTMnet; Q9C0A1; -. DR PhosphoSitePlus; Q9C0A1; -. DR BioMuta; ZFHX2; -. DR DMDM; 300669698; -. DR jPOST; Q9C0A1; -. DR MassIVE; Q9C0A1; -. DR PaxDb; 9606-ENSP00000413418; -. DR PeptideAtlas; Q9C0A1; -. DR ProteomicsDB; 79976; -. [Q9C0A1-1] DR ProteomicsDB; 79977; -. [Q9C0A1-2] DR Antibodypedia; 51729; 39 antibodies from 13 providers. DR DNASU; 85446; -. DR Ensembl; ENST00000419474.5; ENSP00000413418.2; ENSG00000136367.15. [Q9C0A1-1] DR GeneID; 85446; -. DR KEGG; hsa:85446; -. DR MANE-Select; ENST00000419474.5; ENSP00000413418.2; NM_033400.3; NP_207646.2. DR UCSC; uc010tno.2; human. [Q9C0A1-1] DR AGR; HGNC:20152; -. DR CTD; 85446; -. DR DisGeNET; 85446; -. DR GeneCards; ZFHX2; -. DR HGNC; HGNC:20152; ZFHX2. DR HPA; ENSG00000136367; Tissue enhanced (testis). DR MalaCards; ZFHX2; -. DR MIM; 147430; phenotype. DR MIM; 617828; gene. DR neXtProt; NX_Q9C0A1; -. DR OpenTargets; ENSG00000136367; -. DR Orphanet; 653728; Congenital insensitivity to pain syndrome, Marsili type. DR VEuPathDB; HostDB:ENSG00000136367; -. DR eggNOG; KOG1146; Eukaryota. DR GeneTree; ENSGT00940000160537; -. DR HOGENOM; CLU_001401_1_0_1; -. DR InParanoid; Q9C0A1; -. DR OMA; PDTHLPQ; -. DR OrthoDB; 4180951at2759; -. DR PhylomeDB; Q9C0A1; -. DR TreeFam; TF323288; -. DR PathwayCommons; Q9C0A1; -. DR SignaLink; Q9C0A1; -. DR BioGRID-ORCS; 85446; 16 hits in 1150 CRISPR screens. DR ChiTaRS; ZFHX2; human. DR GenomeRNAi; 85446; -. DR Pharos; Q9C0A1; Tdark. DR PRO; PR:Q9C0A1; -. DR Proteomes; UP000005640; Chromosome 14. DR RNAct; Q9C0A1; protein. DR Bgee; ENSG00000136367; Expressed in buccal mucosa cell and 128 other cell types or tissues. DR ExpressionAtlas; Q9C0A1; baseline and differential. DR GO; GO:0000785; C:chromatin; ISA:NTNU_SB. DR GO; GO:0005634; C:nucleus; IDA:UniProtKB. DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; ISA:NTNU_SB. DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR GO; GO:0030534; P:adult behavior; IEA:Ensembl. DR GO; GO:0045664; P:regulation of neuron differentiation; IBA:GO_Central. DR GO; GO:0051930; P:regulation of sensory perception of pain; IMP:UniProtKB. DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central. DR CDD; cd00086; homeodomain; 3. DR Gene3D; 3.30.160.60; Classic Zinc Finger; 3. DR Gene3D; 1.10.10.60; Homeodomain-like; 3. DR InterPro; IPR001356; HD. DR InterPro; IPR017970; Homeobox_CS. DR InterPro; IPR009057; Homeodomain-like_sf. DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2. DR InterPro; IPR036236; Znf_C2H2_sf. DR InterPro; IPR013087; Znf_C2H2_type. DR InterPro; IPR051968; ZnFinger_Homeobox_TR. DR PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1. DR PANTHER; PTHR45891:SF1; ZINC FINGER HOMEOBOX PROTEIN 2; 1. DR Pfam; PF00046; Homeodomain; 3. DR SMART; SM00389; HOX; 3. DR SMART; SM00355; ZnF_C2H2; 15. DR SMART; SM00451; ZnF_U1; 7. DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 5. DR SUPFAM; SSF46689; Homeodomain-like; 3. DR PROSITE; PS00027; HOMEOBOX_1; 1. DR PROSITE; PS50071; HOMEOBOX_2; 3. DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 9. DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 5. PE 1: Evidence at protein level; KW Alternative splicing; Disease variant; DNA-binding; Homeobox; KW Metal-binding; Nucleus; Proteomics identification; Reference proteome; KW Repeat; Transcription; Transcription regulation; Zinc; Zinc-finger. FT CHAIN 1..2572 FT /note="Zinc finger homeobox protein 2" FT /id="PRO_0000047243" FT ZN_FING 453..476 FT /note="C2H2-type 1" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 508..532 FT /note="C2H2-type 2" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 821..845 FT /note="C2H2-type 3" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 870..894 FT /note="C2H2-type 4" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1009..1032 FT /note="C2H2-type 5" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1191..1217 FT /note="C2H2-type 6" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1248..1272 FT /note="C2H2-type 7" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1480..1503 FT /note="C2H2-type 8" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT DNA_BIND 1595..1654 FT /note="Homeobox 1" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" FT ZN_FING 1670..1696 FT /note="C2H2-type 9; degenerate" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 1769..1791 FT /note="C2H2-type 10" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT DNA_BIND 1857..1916 FT /note="Homeobox 2" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" FT DNA_BIND 2065..2124 FT /note="Homeobox 3" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00108" FT ZN_FING 2451..2471 FT /note="C2H2-type 11; degenerate" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT ZN_FING 2495..2519 FT /note="C2H2-type 12" FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00042" FT REGION 1..107 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 343..425 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 537..566 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 608..655 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 675..710 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 929..974 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1061..1171 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1269..1325 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1389..1408 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1415..1434 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1528..1591 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1696..1769 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1820..1860 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1912..2065 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2268..2327 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2398..2431 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2551..2572 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1..57 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 610..627 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 693..710 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 954..968 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1095..1109 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1116..1138 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1275..1308 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1389..1403 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1529..1553 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1561..1581 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1697..1725 FT /note="Acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1983..2007 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2024..2040 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2281..2296 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2400..2423 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2558..2572 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT VAR_SEQ 854..862 FT /note="FLLDMEGAE -> RTETGLLIK (in isoform 2)" FT /evidence="ECO:0000303|PubMed:11214970" FT /id="VSP_039496" FT VAR_SEQ 863..2572 FT /note="Missing (in isoform 2)" FT /evidence="ECO:0000303|PubMed:11214970" FT /id="VSP_039497" FT VARIANT 1913 FT /note="R -> K (in MARSIS; dbSNP:rs1555344723)" FT /evidence="ECO:0000269|PubMed:29253101" FT /id="VAR_081116" FT CONFLICT 473 FT /note="R -> Q (in Ref. 1; BAA83008)" FT /evidence="ECO:0000305" FT CONFLICT 550 FT /note="P -> T (in Ref. 1; BAA83008)" FT /evidence="ECO:0000305" SQ SEQUENCE 2572 AA; 274176 MW; 239C8050F65B25C2 CRC64; MATLNSASTT GTTPSPGHNA PSLPSDTFSS STPSDPVTKD PPAASSTSEN MRSSEPGGQL LESGCGLVPP KEIGEPQEGP DCGHFPPNDP GVEKDKEQEE EEEGLPPMDL SNHLFFTAGG EAYLVAKLSL PGGSELLLPK GFPWGEAGIK EEPSLPFLAY PPPSHLTALH IQHGFDPIQG FSSSDQILSH DTSAPSPAAC EERHGAFWSY QLAPNPPGDP KDGPMGNSGG NHVAVFWLCL LCRLGFSKPQ AFMDHTQSHG VKLTPAQYQG LSGSPAVLQE GDEGCKALIS FLEPKLPARP SSDIPLDNSS TVNMEANVAQ TEDGPPEAEV QALILLDEEV MALSPPSPPT ATWDPSPTQA KESPVAAGEA GPDWFPEGQE EDGGLCPPLN QSSPTSKEGG TLPAPVGSPE DPSDPPQPYR LADDYTPAPA AFQGLSLSSH MSLLHSRNSC KTLKCPKCNW HYKYQQTLDV HMREKHPESN SHCSYCSAGG AHPRLARGES YNCGYKPYRC DVCNYSTTTK GNLSIHMQSD KHLANLQGFQ AGPGGQGSPP EASLPPSAGD KEPKTKSSWQ CKVCSYETNI SRNLRIHMTS EKHMQNVLML HQGLPLGLPP GLMGPGPPPP PGATPTSPPE LFQYFGPQAL GQPQTPLAGP GLRPDKPLEA QLLLNGFHHV GAPARKFPTS APGSLSPDAH LPPSQLLGSS SDSLPTSPPP DDSLSLKVFR CLVCQAFSTD SLELLLYHCS IGRSLPEAEW KEVAGDTHRC KLCCYGTQLK ANFQLHLKTD KHAQKYQLAA HLREGGGAMG TPSPASLGDG APYGSVSPLH LRCNICDFES NSKEKMQLHA RGAAHEENSQ IYKFLLDMEG AEAGAELGLY HCLLCAWETP SRLAVLQHLR TPAHRDAQAQ RRLQLLQNGP TTEEGLAALQ SILSFSHGQL RTPGKAPVTP LAEPPTPEKD AQNKTEQLAS EETENKTGPS RDSANQTTVY CCPYCSFLSP ESSQVRAHTL SQHAVQPKYR CPLCQEQLVG RPALHFHLSH LHNVVPECVE KLLLVATTVE MTFTTKVLSA PTLSPLDNGQ EPPTHGPEPT PSRDQAAEGP NLTPEASPDP LPEPPLASVE VPDKPSGSPG QPPSPAPSPV PEPDAQAEDV APPPTMAEEE EGTTGELRSA EPAPADSRHP LTYRKTTNFA LDKFLDPARP YKCTVCKESF TQKNILLVHY NSVSHLHKMK KAAIDPSAPA RGEAGAPPTT TAATDKPFKC TVCRVSYNQS STLEIHMRSV LHQTRSRGTK TDSKIEGPER SQEEPKEGET EGEVGTEKKG PDTSGFISGL PFLSPPPPPL DLHRFPAPLF TPPVLPPFPL VPESLLKLQQ QQLLLPFYLH DLKVGPKLTL AGPAPVLSLP AATPPPPPQP PKAELAEREW ERPPMAKEGN EAGPSSPPDP LPNEAARTAA KALLENFGFE LVIQYNEGKQ AVPPPPTPPP PEALGGGDKL ACGACGKLFS NMLILKTHEE HVHRRFLPFE ALSRYAAQFR KSYDSLYPPL AEPPKPPDGS LDSPVPHLGP PFLVPEPEAG GTRAPEERSR AGGHWPIEEE ESSRGNLPPL VPAGRRFSRT KFTEFQTQAL QSFFETSAYP KDGEVERLAS LLGLASRVVV VWFQNARQKA RKNACEGGSM PTGGGTGGAS GCRRCHATFS CVFELVRHLK KCYDDQTLEE EEEEAERGEE EEEVEEEEVE EEQGLEPPAG PEGPLPEPPD GEELSQAEAT KAGGKEPEEK ATPSPSPAHT CDQCAISFSS QDLLTSHRRL HFLPSLQPSA PPQLLDLPLL VFGERNPLVA ATSPMPGPPL KRKHEDGSLS PTGSEAGGGG EGEPPRDKRL RTTILPEQLE ILYRWYMQDS NPTRKMLDCI SEEVGLKKRV VQVWFQNTRA RERKGQFRST PGGVPSPAVK PPATATPASL PKFNLLLGKV DDGTGREAPK REAPAFPYPT ATLASGPQPF LPPGKEATTP TPEPPLPLLP PPPPSEEEGP EEPPKASPES EACSLSAGDL SDSSASSLAE PESPGAGGTS GGPGGGTGVP DGMGQRRYRT QMSSLQLKIM KACYEAYRTP TMQECEVLGE EIGLPKRVIQ VWFQNARAKE KKAKLQGTAA GSTGGSSEGL LAAQRTDCPY CDVKYDFYVS CRGHLFSRQH LAKLKEAVRA QLKSESKCYD LAPAPEAPPA LKAPPATTPA SMPLGAAPTL PRLAPVLLSG PALAQPPLGN LAPFNSGPAA SSGLLGLATS VLPTTTVVQT AGPGRPLPQR PMPDQTNTST AGTTDPVPGP PTEPLGDKVS SERKPVAGPT SSSNDALKNL KALKTTVPAL LGGQFLPFPL PPAGGTAPPA VFGPQLQGAY FQQLYGMKKG LFPMNPMIPQ TLIGLLPNAL LQPPPQPPEP TATAPPKPPE LPAPGEGEAG EVDELLTGST GISTVDVTHR YLCRQCKMAF DGEAPATAHQ RSFCFFGRGS GGSMPPPLRV PICTYHCLAC EVLLSGREAL ASHLRSSAHR RKAAPPQGGP PISITNAATA ASAAVAFAKE EARLPHTDSN PKTTTTSTLL AL //