ID E2RK30_CANFA Unreviewed; 4075 AA. AC E2RK30; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 05-SEP-2012, entry version 14. DE SubName: Full=Uncharacterized protein; GN Name=PKHD1; OS Canis familiaris (Dog) (Canis lupus familiaris). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Canidae; OC Canis. OX NCBI_TaxID=9615; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Boxer; RX PubMed=16341006; DOI=10.1038/nature04338; RG Broad Sequencing Platform; RA Lindblad-Toh K., Wade C.M., Mikkelsen T.S., Karlsson E.K., Jaffe D.B., RA Kamal M., Clamp M., Chang J.L., Kulbokas E.J. III, Zody M.C., RA Mauceli E., Xie X., Breen M., Wayne R.K., Ostrander E.A., RA Ponting C.P., Galibert F., Smith D.R., deJong P.J., Kirkness E.F., RA Alvarez P., Biagi T., Brockman W., Butler J., Chin C.-W., Cook A., RA Cuff J., Daly M.J., DeCaprio D., Gnerre S., Grabherr M., Kellis M., RA Kleber M., Bardeleben C., Goodstadt L., Heger A., Hitte C., Kim L., RA Koepfli K.-P., Parker H.G., Pollinger J.P., Searle S.M.J., RA Sutter N.B., Thomas R., Webber C., Baldwin J., Abebe A., RA Abouelleil A., Aftuck L., Ait-Zahra M., Aldredge T., Allen N., An P., RA Anderson S., Antoine C., Arachchi H., Aslam A., Ayotte L., RA Bachantsang P., Barry A., Bayul T., Benamara M., Berlin A., RA Bessette D., Blitshteyn B., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Brown A., Cahill P., Calixte N., RA Camarata J., Cheshatsang Y., Chu J., Citroen M., Collymore A., RA Cooke P., Dawoe T., Daza R., Decktor K., DeGray S., Dhargay N., RA Dooley K., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., RA Dupes A., Egbiremolen O., Elong R., Falk J., Farina A., Faro S., RA Ferguson D., Ferreira P., Fisher S., FitzGerald M., Foley K., RA Foley C., Franke A., Friedrich D., Gage D., Garber M., Gearin G., RA Giannoukos G., Goode T., Goyette A., Graham J., Grandbois E., RA Gyaltsen K., Hafez N., Hagopian D., Hagos B., Hall J., Healy C., RA Hegarty R., Honan T., Horn A., Houde N., Hughes L., Hunnicutt L., RA Husby M., Jester B., Jones C., Kamat A., Kanga B., Kells C., RA Khazanovich D., Kieu A.C., Kisner P., Kumar M., Lance K., Landers T., RA Lara M., Lee W., Leger J.-P., Lennon N., Leuper L., LeVine S., Liu J., RA Liu X., Lokyitsang Y., Lokyitsang T., Lui A., Macdonald J., Major J., RA Marabella R., Maru K., Matthews C., McDonough S., Mehta T., RA Meldrim J., Melnikov A., Meneus L., Mihalev A., Mihova T., Miller K., RA Mittelman R., Mlenga V., Mulrain L., Munson G., Navidi A., Naylor J., RA Nguyen T., Nguyen N., Nguyen C., Nguyen T., Nicol R., Norbu N., RA Norbu C., Novod N., Nyima T., Olandt P., O'Neill B., O'Neill K., RA Osman S., Oyono L., Patti C., Perrin D., Phunkhang P., Pierre F., RA Priest M., Rachupka A., Raghuraman S., Rameau R., Ray V., Raymond C., RA Rege F., Rise C., Rogers J., Rogov P., Sahalie J., Settipalli S., RA Sharpe T., Shea T., Sheehan M., Sherpa N., Shi J., Shih D., Sloan J., RA Smith C., Sparrow T., Stalker J., Stange-Thomann N., Stavropoulos S., RA Stone C., Stone S., Sykes S., Tchuinga P., Tenzing P., Tesfaye S., RA Thoulutsang D., Thoulutsang Y., Topham K., Topping I., Tsamla T., RA Vassiliev H., Venkataraman V., Vo A., Wangchuk T., Wangdi T., RA Weiand M., Wilkinson J., Wilson A., Yadav S., Yang S., Yang X., RA Young G., Yu Q., Zainoun J., Zembek L., Zimmer A., Lander E.S.; RT "Genome sequence, comparative analysis and haplotype structure of the RT domestic dog."; RL Nature 438:803-819(2005). RN [2] RP IDENTIFICATION. RC STRAIN=Boxer; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR Ensembl; ENSCAFT00000003416; ENSCAFP00000003171; ENSCAFG00000002165. DR GeneTree; ENSGT00530000062974; -. DR OMA; FDYGAML; -. DR GO; GO:0016324; C:apical plasma membrane; IEA:Compara. DR GO; GO:0005813; C:centrosome; IEA:Compara. DR GO; GO:0005932; C:microtubule basal body; IEA:Compara. DR GO; GO:0072686; C:mitotic spindle; IEA:Compara. DR GO; GO:0072372; C:primary cilium; IEA:Compara. DR GO; GO:0042384; P:cilium assembly; IEA:Compara. DR GO; GO:0001822; P:kidney development; IEA:Compara. DR GO; GO:0010824; P:regulation of centrosome duplication; IEA:Compara. DR Gene3D; G3DSA:2.60.40.10; Ig-like_fold; 9. DR Gene3D; G3DSA:2.160.20.10; Pectin_lyas_fold; 4. DR InterPro; IPR019316; G8_domain. DR InterPro; IPR013783; Ig-like_fold. DR InterPro; IPR014756; Ig_E-set. DR InterPro; IPR002909; IPT_TIG_rcpt. DR InterPro; IPR006626; PbH1. DR InterPro; IPR012334; Pectin_lyas_fold. DR InterPro; IPR011050; Pectin_lyase_fold/virulence. DR Pfam; PF10162; G8; 2. DR Pfam; PF01833; TIG; 7. DR SMART; SM00429; IPT; 5. DR SMART; SM00710; PbH1; 9. DR SUPFAM; SSF81296; Ig_E-set; 7. DR SUPFAM; SSF51126; Pectin_lyas_like; 2. DR PROSITE; PS51484; G8; 2. PE 4: Predicted; KW Complete proteome; Reference proteome. SQ SEQUENCE 4075 AA; 447675 MW; 4C330A6816DD3C5F CRC64; MIVWLISLMS IEILLLAGPA LSFHIEPKEG SLAGGTWITV IFDGLELEQL YPTNGSQLEI HLVNVAVPAL PSIPCDVSPV FLDLPVVMCR TRSLLPEVHE GLYYLEAHAG GQVVGSPSPG LQDCCTFKFS REQTPIVYQV TPPSGVPGKM IHVYGWIITG RSETFDFDAE YIDSPLILEA QGDKWVTACS LINRQTGSCY PLQENHGLGT LQCRVEGNYI GSQNVSFSVF NKGKSMVHKN AWLVSAKLDL FLYQTYSEIL SVFPETGSLG GKTDIIITGD FFDNPALVTI AGVPCDIRHM SPRKIECTTR APGKRARLTA PQAGNRGLLF EVGEAVEGLD LTVATPGYRW QIVPNASSPF GFWFKEGQPF RARLSGFFVA PETNNYTFWI QADNQASLYF SQSEDPRMKV KVASIRVGTA DWFDAWEQDR NEGVWQRKTP KLELVGGTRY YLEAEHYGRT PSRGMRIGVQ IHNTWLNPDV VSTYLREKHQ IRVRAQRLPE IQMLTVSGRG SFFLTWDNVT SQPIPENATA HQIQTAIEEL LAVKCKLEPL SANILLWLGF EQGPEGSSFE GDLTSGTEPF CGRFSLHQPR RLVLTPPAAQ KDYWLDRYTH LCIAYKGRMK NILKVTVFFT TDFQNFIKKN ITCDWNLVGT RPNSWQFTCT DLWETCVHHS MYLQPPLVDS PVLVHRIDLF PLSQETSGFY VDEIIIADTN ITVSQADSET AHPGGNLVES LCVVGSPPVY NISFWLVGCG QELPLITASI VPTGGEARRS GLVQVTTQRI QKTSPPLGGY FHIQLPTSVI PDVPVHISAS HLRKLLQNNA DNFTSRYLNS SDLTVMEDLK SCYEHEWTLS WSSQVGDLPN FIRVSDANLT GVNPAATVRV VYDGGVFLGP VFGDMLVTAN QHTQVVVRVN DIPAHCSGSC SFQYLEGSTP QIHSAWYSLD GDISLLIYIF GINFSGDPQA LEIMVNKTNC KVIFSNQTNV ICQTDLLPVG MHRLFMVVRP FGRAINTSGE ALFLSVEPRL DAVEPSRAAE IGGLWATIQG SGLEDVSLVL FGSQSCAINI TTSNSRRIQC RVPPRGKDGP VVNLTVVSGD HSAVLPMAFT YVSSLNPVIT SLSRNRSNIA GGDTLFIRMA LLVNYTDLDV EVCIQNTLAP AHVQMPQGLE VVLPPLPAGL YSISVSINGI SIRSPGVDLH IQYITEVFSI EPCCGSLLGG TILSISGIGF IRDPTLVWVF VGNRSCDILN STETVIWCET PPAALLPDSN IPAIPVPMEI WAGNVSFARE SLLNLSFTFL YEAAMTPVVT AMRGEIINNS LRFYVEGNNL SNSVILLGVS HCDLETQTLR NNVSLSGCSF PLHSLEAGFY PLQVRQKQMG FANMSAVPQQ YVITPWIMAI SPTHGSACGG TVLTVRGLAL SSRKRSVQVD LLGPFTCVIL SLGHQTILCQ INKVNDSFPD VSFTLNVTVI VNGLPSECQG NCTLFLQEET TPIVDSLTTN ISGSLTMVLI RGRKLGITAV EPMVFVDDHL PCIVTFFNAS YVICWISDLT PGLHYVSVFH ARNGYACFGN VSRHFYILPQ VFHYFPKNFS IHGGSLLTVE GTALRGKNST LVYVGQQACL TVSISTELIQ CIVPAGNGSV GLVIEVDGLS YQMGVIGYSS AFTPRLLSIS QTDDVLTFAV AQVSGAENVD IFIGMSPCVG ISGNHTVLQC VVSSLPAGEY PVRGYDRMRG WASSVLVFTS TATISGVTEN FGCLGGRLVH VFGAGFSPGN VSAAVCGAPC QVLANATVSA FSCLVLPLNV SLAFLCGLKH SEESCEASSS TYVQCDLTVT VGTETLPQSW PYLYICEESP QCLFAPDHWT ESTFPWFSGL FISPKVERDE VLIYNSSCNI AMETEAKMEC ETPNQPITAK ITEIRKSRGQ STQGNFSLQF CLRWSRTHSW FPERVPQDGD NVTVENGQLL LLDTNTSILN LLHIKGGKLI FMDPGPIELR AHAILISDGG ELRIGSEDKP FQGKAEIKLY GSSHSTPFFP YGVKFLAVRN GTLSLHGLLP EVTFTHLQAA AYAGDTVLAL EDAVDWHPGD EAVIISRIGV GGAKPMEEIV IVEAVHNTDL YLRSPLRYSH NFTENWVAGV LHILKVTVVL LSRSITIQGN LTAERMKHLA SCQEASDSEG NLQDCLYSKS EKMLGSRDLG ARVIVQSFPE EPSRVQLRGV QFRDLGQAFR KHVSALTLVG AMRDSYVQGC TVWSSFNRGL SMSMTLGLKV DSNIFYNILG HALLVGTDMD IKYISWEAAP EKKPDWSEQG NIIRNNVIIS ISGTEGLSSP EMLTPSGIYI LNPTNVVEGN RVYVAGLGYF FHLVTSQTSQ APLLSFTQNI AHSCTRYGLF IYPQFQPPWD DGRGPTLFQN FTVWGSAGGA RISRSSNLHL KNFQVYSCRD FGIDILESDA NTSVTDSLLL GHFAHKGSLC MSAGIKTPKR WELIISNTTF VNFDLTDCVS IRTCSGCSRG QGGFTVKTNQ LKFINSPNLV AFPFPHAAIL EDLDGSLSGR NRSHILASME TLSASCLVNL SFSQIVPGSV CGEDVIFHHM SIGLANAPNV SYDLTITDSR NKTTTVNYVR DTLSNLYGWM ALLLDQETYS LQFETPWISR SLQYSATFGS FAPGNYLLLV HTVLWPYPDI LVRCGSQEGR SLPSLPLPGQ DQGCDWFFNT QLRQLIYLVS GEGQVQVTLQ VKEGVPPTIS ASTSAPESAL KWSLPEAWTG IEEGWGGHNH TIPGPGDDIL ILPNRTVLVD TNLPFLKGLY VMGTLEFPVD RSNVLSVACM VIAGGELKVG TLDNPLEKEQ KLLILLRASE GIFCDRLNGI HIDPGTIGVY GKVQLHGACP KKSWTRLAAD IASGNERIIV EDAVDWRPHD KIVLSSSSYE PHEAEILTVK EVQAHHVKIY ERLKYRHIGS VHVMEDGRCI RLAAEVGLLT RNIQIQPDIS CRARLLVGSF RNSSSKEFSG VLQLSNVEIQ NFGSPLYSSI EFTNASAGSW IISSSLHQSC SGGIRAAASH GIILNDNIVF GTVGHGIDLE GQNFSLSNNL VVLMTQSAWS TVWVAGIKAN QAKDINLYGN VVAGSERIGF HIQGHRCSSP EARWSDNVAH SSLHGLHLYK ENGLDNCTGI SGFLAFKNFD YGAMLHVENS VEIENITLVD NSIGLLATVY VSSVPKSHIE NVQIVLRNSV IIATSSSFDC IQDRVKPRSA NLTSSDRAPS NPRGGRVGIL WPIFTSEPNW WPQEPWHRVR NGHSTSGILK LQDVTFSNFV KSCYSDDLDI CILPNVENTG IMHPIMAEGT RMLKIKDKNK FYFPPLQARK GLGILVCPES DCENPRKYLF KDLDGRALGL PPPVSVFPKT EAEWTGSFFN TGTFREEQKC TYRALIQGYI CKQSDQAILI LDNADATWAM QKLYPVVSVT RGFVDTFSSV NADAPCSTSG SASTFYSILP TREITKICFV DQTPQVLRFF LLGNRSTSKL LLAVFYHELQ NPRVFIGESF IPPIMVQSTS SLLDESIGSN YFSILDNLLY VVLQGQEPIE IHSGVSIHLA LTVMFSVLEK GWEIIILERL TDFLQVSQDQ IRFIHEMPGN EATLKAIADN KAKRKRNCPT VTCASPYRVG QRRPLMTEMS SYRVPSPTIM ETASKVIVIE IGDLPTIRST RLISYLTSNK LQNLAHQIIT AQQTGVLENV LNMTIGALLV TQPKGVTDYG NASSFKTGNF IYIRPYALSV LVQPSDGEVG KELTVQPRLV FLDKQNQRIE SLGPPSEPWA ISVSLEGTSD PVLKGCTQAE SQDGYVSFSN LAVLISGSNW HFIFTVTSPP VGANFTARSR SFTVLPAAPS EKSSIILAVS LCSVASWLAL CCLVCCWFRK SKSRKIKSED ISEFKTNDQK SHIHMSSKHP RSQETKKEDT MMGEDMKIKV IMDKVNQLPH QSLNGVSRRK VSRRAVREEG SSREEDVVPA PRIISITSQG HTCVPGSPDQ QIYLQEAGNW KEAQEQLVSY QLAGQDQRLL LCPDLRRERQ QLQGQSQLGQ EGGSVGLSQE KKASGGATQA SCPHLVHPET IQEQL //