Supplementary material S1: E2-like domain family multiple alignments Alignments of E2 domain families displaying significant structure and/or sequence divergence surrounding the active site. Sequences in alignments are identified by gi number and species name. Results from the JPRED secondary structure prediction program are given at the top of the alignment, and residue consensus is given at varying thresholds below the alignment. Eukaryotic proteomes with unassigned gi numbers have been assigned temporary ids. --------------------------------------------------------- Table of Contents 1.AKTIP 2.Tsg101 (UEV) 3.RWD 4.Apg10 5.Apg3 6.Ufc1 7.UbcI 8.UBE2W 9.BRUCE-like 10.Ubc6p --------------------------------------------------------- Note: Eukaryotic proteomes with unassigned gi numbers were assigned temporary ids. These temporary ids are prefixed by the species abbreviations. The species and their respective temporary id prefixes are as follows: Naegleria gruberi: Ngru Phytophthora sojae: Psoj Phytophthora ramorum: Pram Phaeodactylum tricornutum: Ptri Chlamydomonas reinhardtii: Crei Thalassiosira pseudonana: Tpseu Monosiga brevicola: Mbre Nematostella vectens: Nvec Branchiostoma floridae:Bflo Note: The temporary ids are not linked to proteins 1. AKTIP FINAL -HHHHHHHH---------EEE-------EEEEEEEE-----------EEEEEEE---------------------------------------------------EEEEE---EE---------EEEEE-------------HHHHHHHHHHHH-------------------------HHHHHHHHH---HHHHHHHHHHHHH-------------------EEE------------------ 124429462 Paramecium_tetraurelia IIKFEYKLLSVYAHKFS-IILLPKTEQITEWHGMISISYGLYAK--GKFKFVLNFPKTFP--ESIP---------------------------------------KIRFL-CPLLHPLID-EYNNVDVDTLIPK-WKYGQDCMVYNLLNKLYEMFIDVSYY------------DCVTSFNPRAAQLFSNDINSYAEEVKKQVAQSE--QQLYDNR-----DDFVLKLKE---PTYNTQTILQKL 58258487 Cryptococcus_neoformans_var_neoformans_JEC21 EIAQEYASLRVPNGCPKGMFLVPTEETLLRWHGVLFLHHGPYSG--SILRFTVDFPSNYP--QSGP---------------------------------------TVRFN-SDVFHPLVDARTKIWHPRGNKMQ-WRPKVDHI-SRLLHELKSSFSRVTLD----------LIEEREAVNKHVWSLYQHSRQTFISLTAQRAVHSASQTVLFPAV-----YSQSPSSTT---PPRRRQTSLNSL 71003864 Ustilago_maydis_521 DISFQYAQLCLASNCPLGIYVQLDDDDTHLWHCVIFVSNGPFHG--GIFRFDIVFPPTYP--TSTP---------------------------------------QVYFP-PTLLHPLVDPDSGRLLLASRFQA-WKPRVNFV-THVLHFVKEVFTEEFLQ----------GLRERAVANTEVWRMFRSHRGLFNKLALQSVALSTAPSSLYDVSGGSGFPNPSIGLNR--VQPRSVEGKVGRG 154421289 Trichomonas_vaginalis_G3 ERLLLQDFRACLQQVRKGVLVQPSVANTFQWQVTMMPNNGIYK---DQILTYLIIFDNFP--ASVP---------------------------------------KIVFQ-PGIVHPLIYPQTRVFDTSEMFKE-WNVSVRVY-TLINYIYDSFIEILIPG-------------NREVPNPEAASKMRRGGDAYAKKLIEQLPPPPSPTELSELNVPK--NWNKQKERI---AHILSS------------ 123415931 Trichomonas_vaginalis_G3 SQCLLLEFQACVEQLPKEIVVQPSIRTMRQWQVCMTPNNGFYN---GKIIIYNIYLENYP--NNIP---------------------------------------DVIFQ-SGIVHPLINPHSTNFASQLLINE-WNRYTRIY-ELIQAIYNSFVEIPQLK---------------QVPNPEAYNILKSPDAKNTILG--QLKTPS-AAENHEHNEPK--KWTPQKEKL---CHVLYD------------ 23509112 Plasmodium_falciparum_3D7 NYSILTEYSFLMNEIPRNFYCLPQIDNLLIWDVFIMLYSTVYKN--AKLKLQIRLSHNYP--NTCP---------------------------------------EVFFI-TPIFHPLVNIQTGKLNLGTSLSN-WDPSCHYM-SLIFLYIKNLFYLQEEY------------NKETVENQEALFLLNNDKDEFIKNVQKYINQGN--KKIYDHM-----ENCMFNFN----QKEEHIEIKDKL Mbre1000000752 Monosiga_brevicollis EDRVKAMCHRSCQPSKSRRSLAPCL--AIAWDAVYFVREGIYRG--GIFKLRLLIPPTYP-DKVVP---------------------------------------RIFFQ-SNVFHPRVNDITGELDMSRYYVS-WNPDVDRL-KHLFKYVHKMFRKIDAD---------------NAVNEGAAAMFRVSMEAFEAPVQRCVKASQ--ATAHTP------TGNSIELG----HSDNRDGIMQAI 66816191 Dictyostelium_discoideum_AX4 NYRLLFEYKKLSDLMIRGLYVIPSFNSIHEWHCVLFIRNRIYRS--AIIRFNVVIPDDFP--ESCP---------------------------------------SIKFL-TPIFHPLIGPE-GEIDLSHQFPD-WSAEKYMI-AHVLCYIKSIFHNPDKY-------------DFPPLNTDAHNLMKNNPDSFQKRVSEYVEDSI--KNVYDNP-----QDNAICFSKEWKDKNQLENAKKQI VIIb#55.m05100 Toxoplasma_gondii LYSLLIEYSQTSQNSPEGVYCIPSWDNLRVWDGVILLRHGIYQG--GIFKFRIKVPPSYP--ADPP---------------------------------------GVEFV-SRVFHPLVDPETKTLNMKPQFAT-WRPDKDYL-PMLLLYLKSIFYKREFL--------KGTDAEDAWLNPEAGKTFREDKKNFLEKVKACVEESQ--AKVYDKE-----ENFVFNFSE---FHREKKPIIQAL Pram1000008760 Phytophthora_ramorum DYGLMIEYKHLRQHVPSGIYVLPSFDHSRVWYGAIFVHAGLYRN--GIFKFTIFLPESYNGPGTYP---------------------------------------RIVFN-TNVFHPYVYEDTKELDLKPKFPD-WDPELHYM-VAVLTYLKGIFYMKDFP------------DLIQVANGAALDMFRHDPENYVSKVEECVDESL--TNVYNNE-----QGSTIRFTK---HNPAHDNLRQEL Ptri1000004528 Phaeodactylum_tricornutum DYKVTIEYKHLKSHAPGGVYLIPALHDLRHFYGIIFVRRGPFTN--GIFKFELTLSPEYNDNNQHP---------------------------------------RIAFS-SSVYNPHVHYESGELDIKSAYPR-WDPSRHYL-VTVLTFLKKIFYAKSFE--------------DAKANAEARELATKYPAEFRQKVDACVLASQ--KQVFQND-----ETSTAQFSE---EELSHRVLLDLL Tpseu1000010538 Thalassiosira_pseudonana DYKLSIEYKHLKQHCPGGVYLVPSSTSLRLFHGVIFVRRGSFTN--GIFKFTLECPPRYNDVDCHP---------------------------------------KVVFS-SYVYNPHVHPETGELDLKSAYPQ-WDPAKHFL-VTVLTYVKRIFYIKEYV--------EEGEAEKKYPNQEALRLFKGDGEGYRRRVQECVRESQ--RSVF-------------------------------- Nvec1000015413 Nematostella_vectensis EQFILAEYKAVYRNPQSGVYVIPSLKSLQVWHGIIFLRAGPYRD--GIFRFFIFLQDDFP--DSRP---------------------------------------LVKFT-SNVFHPQIH-ENGVFNLEVAFPN-WHRGKHCI-WQILKYMKSCFYSTDTW---------------GGVNQDAVNVVHSSLEEFKEKARNCALESQ---KIFDEE-----TESSGKFD---MSYV--------- Nvec1000020327 Nematostella_vectensis EQFILAEYKAVYRNPQSGVYVIPSLKSLQEVIGAM-GRHTVVRDRTSSISTTNMTRKSFP--DSRP---------------------------------------LVKFT-SNVFHPQIH-ENGVFNLEVAFPN-WHRGKHCI-WQILKYMKSCFYSTDTW---------------GGVNQDAVNVVHSSLEEFKEKARNCALESQ---KIFDEE-----TESSGKFD---MSYV--------- 17933724 Drosophila_melanogaster EYKILAEYKMIESEKLSGIYVIPSYANSLQWFGVFFGRQGLYAE--SVFRFTILLPDRFPDDKSLP---------------------------------------SIIFQ-QDVIHPHVCPYTHSLDVSHAFPE-WRCGEDHL-WQLLKYLQVIFSDPLDS--------IRGIEVDKLKNSEAAELLMNNKEEYVARVQENIKESK--EHIFDTPPTE--DPHYIVFEK--FQQDVHGPVLERI 17566194 Caenorhabditis_elegans EQALAAEFVEVCRAPIDGIFVSPSANNKFLWFGVIFVRKGIFGG--GIFRFNIHIPLEFPDASDLP---------------------------------------RVIFEQSNLFHPLICTKSKELCLNRSFPEGWKKEKHSL-RRVLVVLQRSFYSYDVD-------------TDKCINPEASVLYKEHRDKFREIAKECVEASR--SMVYDELAEQEHDPNGIRLLP--WDALTHKAAREKM 24656026 Drosophila_melanogaster GYHILAEYNLV-KEELKNIYAIPSYACGLHWFGVIFVHSGIYAG--SVFRFSILLPENFPADISLP---------------------------------------TVVFS-TEVLHPHICPQNKTLDLAHFLNE-WRKDEHHI-WHVLRYIQAIFADPEGSICTGQSSSGDLVIMDEVRNMNALNMLAKSRPEYIKRVQEQAILSR--NLIYDRPPTE--DPHYIIVEP--YCAERHLKFMDQL 58381786 Anopheles_gambiae_str_PEST ----------LQSEDLGGIYVTPSYENPFLWFGVIFVRSGMYKD--GVFRFTISLPNRFPNDSTVP---------------------------------------VVAFQ-SDVFHPMVNPSDGVLNLSDTFPK-WQSGDSHI-WQMLKFVQFILQNLDDH----------TIPSEHVVNNEAYQLLMENRAEFLLRVEQCVEDSQ--RKLYDLPAQP--DRFYICFDR--FNPDVHGPVLQSM 91084089 Tribolium_castaneum EYVILAEYKMIQSENIQGVYVIPSRENPFIWFGVIFVRSGPYED--GVFRFNVVLDENFP-DSEHP---------------------------------------KVVFL-SEMFHPVVHPDTNEVHLLKAFPT-WNKADQHI-WQVLKYIHWMFYNLSPT-------------IAHSVNPEAADLFANNQEAFRNKVKEIVKTSQ--EKIYDEPPVE--DKHYLVFEP--YNPEVHEQAKSTM 66510005 Apis_mellifera EYNILSEYNMLCSQDLKGIYVIPSARNSLLWFGVQFVRQGLYQG--GIFRFDITLPQNFP-NGECP---------------------------------------KVTFQ-TPIFHPLIDPVSGELNTLWGFPE-WRKS-NRI-WQLIQFITKILSKVDLK--------------MNPVNHEAYNLLENNFEVFRDRVKKCVRESL--NKVYNSSTVD--DPHYIMFSP--YVDELYNSIKREI Nvec1000022094 Nematostella_vectensis EYTLMAEYNQLRSQRLPGVYVLPAAKSALVWYGVIFIRMGLYQD--GIFKFQMTIPENFP-DGDCP---------------------------------------TLVFK-PTIFHPVVNIETGELDVRRAFPR-WRRNINHL-WQVLLYAKRIFYKIDSR---------------DPLNPEAAEMYQNDKDRYKQKVNECLRRCH--NELH-LAVAD--DPHAIKFVE--LTPEKQDDVKNQI Bflo1000043060 Branchiostoma_floridae EYSLMAEYNLLQRQRIPGIYVIPSAKSPLFWFGVVFVRQGLFQE--GVFKFYVIVPDNYP-DGDCP---------------------------------------RVIFD-PPIFHPVVDIETGELDVKRGFTK-WRRNVNHI-WQVLLYVRRIFYKIDTK---------------NPLNPEAAVLYDKELDVFRRKVTETIRRCN--EHLYDPPTTD--DPHAIRFTT--WDPSVHEDAKKQV 115929982 Strongylocentrotus_purpuratus EYALLAEYNQLHKQKLPGVYVIPSAKSPLHWFGVLFIRQGLYQE--GVFKFDLLIPENYP-DGDCP---------------------------------------RLIFH-PAVFHPIIDPISGELDVKRAFTK-WKRNINHL-WQVLLYARRVFYKIDTK---------------SPLNQEAATLYEEDCDMFKSRVCDTVELSK--LKRYDPPYSD--DPHAIKFSR--WDESKHGETLRTI 114600887 Pan_troglodytes EYSLLAEFTLVVKQKLPGVYVQPSYRSALMWFGVIFIQHGLYQD--GVFKFTVYIPDNYP-DGDCP---------------------------------------RLVFD-IPVFHPLVDPTSGELDVKRAFAK-WRRNHNHI-WQVLMYARRVFYKIDRA---------------SPLNPEAAVLYEKDIQLFKSKVVDSVKLCT--ARLFDQPKIE--DLYAISFSP--WNPSVHDEAGEKI 58865430 Rattus_norvegicus EYSLLAEFTMVVKQKLPGVYVQPSYRSALVWFGVIFIRHGLYQD--GVFKFTVYIPDNYP-DGDCP---------------------------------------RLLFD-IPVFHPLVDPTSGELDVKRAFAK-WRRNHNHI-WQVLMYARRVFYKIDTT---------------SPLNPEAAVLYEKDIQLFKSKVVDSVKVCT--ARLFDQPKIE--DPYAISFSP--WNPSVHDEAREKM 6753918 Mus_musculus EYSLLAEFTLVVKQKLPGVYVQPSYRSALVWFGVIFIRHGLYQD--GVFKFTVYIPDNYP-DGDCP---------------------------------------RLLFD-IPVFHPLVDPTSGELDVKRAFAK-WRRNHNHI-WQVLMYARRVFYKIDTT---------------SPLNPEAAVLYEKDIQLFKSKVVDSVKVCT--ARLFDQPKIE--DPYAISFSP--WNPSVHDEAREKM 114662500 Pan_troglodytes EYSLLAEFTLVVKQKLPGVYVQPSYRSALMWFGVIFIRHGLYQD--GVFKFTVYIPDNYP-DGDCPLSRGTSVLCSSFRHAIRALFLCSGSAREAGCLPSCSTHQRLVFD-IPVFHPLVDPTSGELDVKRAFAK-WRRNHNHI-WQVLMYARRVFYKIDTA---------------SPLNPEAAVLYEKDIQLFKSKVVDSVKVCT--ARLFDQPKIE--DPYAISFSP--WNPSVHDEAREKM 61743933 Homo_sapiens EYSLLAEFTLVVKQKLPGVYVQPSYRSALMWFGVIFIRHGLYQD--GVFKFTVYIPDNYP-DGDCP---------------------------------------RLVFD-IPVFHPLVDPTSGELDVKRAFAK-WRRNHNHI-WQVLMYARRVFYKIDTA---------------SPLNPEAAVLYEKDIQLFKSKVVDSVKVCT--ARLFDQPKIE--DPYAISFSP--WNPSVHDEAREKM 114662502 Pan_troglodytes EYSLLAEFTLVVKQKLPGVYVQPSYRSALMWFGVIFIRHGLYQD--GVFKFTVYIPDNYP-DGDCP---------------------------------------RLVFD-IPVFHPLVDPTSGELDVKRAFAK-WRRNHNHI-WQVLMYARRVFYKIDTA---------------SPLNPEAAVLYEKDIQLFKSKVVDSVKVCT--ARLFDQPKIE--DPYAISFSP--WNPSVHDEAREKM 114662514 Pan_troglodytes EYSLLAEFTLVVKQKLPGVYVQPSYRSALMWFGVIFIRHGLYQD--GVFKFTVYIPDNYP-DGDCP---------------------------------------RLVFD-IPVFHPLVDPTSGELDVKRAFAK-WRRNHNHI-WQVLMYARRVFYKIDTA---------------SPLNPEAAVLYEKDIQLFKSKVVDSVKVCT--ARLFDQPKIE--DPYAIR------------------ 41056191 Danio_rerio EYSLLAEFTLVIKQKLPGIYVQPSYRSALMWFGVIFIRHGLYQD--GVFKFTVYIPDNYP-DGDCP---------------------------------------KVVFD-TPVFHPLVDPVSGELDVRRAFTK-WRRNHNHI-WQVLMYARTIFYKINTM---------------EPLNPEAAVLYDKDIQLFKSKVVDSVKLCN--SHLFDQPKID--DPYAISFSP--WNPAVHEEAKEKM 47216620 Tetraodon_nigroviridis EYSLLAEFTLVIKQKLPGIYVQPSYKSALMWFGVIFIRHGLYQD--GVFKFTVYIPDNYP-DGDCP---------------------------------------KLVFD-IPVFHPLVDPVSGELDVRKTFTK-WRRNHNHI-WQVLMYARTIFYKINTS---------------EPLNPEAAVLYEKDVHLFKSKVVDSVRLCN--SHLFDPPKID--DPYAISFSP--WNPAVHEEAKEHM consensus/100% ....................h.........h.sh.......h....u.hp.......pas.....P........................................l.F....hhpP.l......h.....b...Wp.....h...hh..h...h........................N..s...h......a...s...h..s......a................................ consensus/95% ............p.....hhh.P...p...a.shh..p.s.a....u.h+h.h.h..pas...p.P........................................l.F....lhpP.l......hp....h.p.Wp...p.h...lh.bhp..h........................N..s..hh..p...a...s...h..s.....ha................................ consensus/90% .b.h.hbh..l.p....slal.Ps.ps.b.W.uhhhlp.G.a....ulh+Fpl.hs.paP...s.P........................................l.F....lhHP.lp..s..lsh...hsp.Wp...p.l..plL.ahp.hF...p....................N.cA..hh.pp...a.pbs.pph..s....plas.......p.....h................. consensus/85% pb.lbhch..l.pp...GlYl.Ps.ps.b.WaGllFl+pG.aps..ulF+Fpl.hs.paP...s.P.......................................pl.F...slaHP.lp..s.plsl...asp.Wc...p.l..plL.ahp.hF.p.p..................shN.cAh.hhpps.p.abpbs.ppl..sp...plas.......p...h.h................. consensus/80% -h.lbhEap.l.pp...GlYlbPohcs.b.WaGVlFl+pGhYps..GlF+Fpl.lPpsaP..ss.P.......................................pl.Fp..slFHP.ls.pospLslpp.asp.Wc.s.p.l.hplL.ah+phFbp.s.................pshN.-Ah.lhppsbp.Fbp+V.pslb.sp...plas.......ps..hphp....p.......bp.h consensus/75% -h.lhhEap.l.pppb.GlYlbPShcs.b.WaGVlFlRpGlYps..GlF+Fsl.lPpsaP..ss.P.......................................plhFp..slFHPhls.pospLslppsFsp.Wc.s.pal.hplL.Yh+plFhp.s.................pshN.-Ah.lhppsbp.Fbp+V.cslb.sp...plaD.......ss.slphs....p...p..hbpph consensus/70% -YplhhEapbl.pppbsGlYVbPShcs.hhWaGVIFlRpGlYps..GlF+Fsl.lPcsaP.ssshP.......................................plhFp.ssVFHPlVcspoGcLclppsFsc.W+.sppal.hplLbYh+plFYpb-.................pslNsEAhsLacpsbp.Fbp+V.csVc.sp...plaDps.....-s.sIpFs....s...+p.hbpph 2. Tsg101 (UEV) FINAL -----HHHHHHHHHHHHHH--------E--------------EEEEEE---EEEEEE------EEEEE-----------EEEEEEEE---------EEEEE--------EEE-EE-----------EEEEEEE-----------HHHHHHHHHHHH---- 51247846 Mus_musculus YSQRQ---DHELQALEAIYGSDFQDLRPDARGRVREPPEIN--LVLYPQGLAGE---------------------EVYVQVELRVKCPPTYPDVVPEIDLKNAKG------LSNESVNL--LKSHLEELAKKQC--------GEVMIFELAHHVQSFLSEHNKSGPSSG 65305040 Theileria_annulata KKHFSNYEMLMCDINGLFGKY--HNFSCSMSTSF---DCHR----LNISGTVPYTFS------------------GFTLNAPLLIQVFSDYPFSCPSFFV-PSRN------TKIVKNHP--NVDLRGNVTLKYLDEWNH---TSKLVQAVDHLCYAFSKISPIITFSNS 71033227 Theileria_parva_strain_Muguga FLLKTNYEMLMIDLDGLFLKY--HNFKCSMSTSF---DGHR----LNIVGLVPYTFS------------------GFTVNAPLVIQVLSDYPFSTPLFWIRGHN-------IKIVKNHP--NVDLRGNVTLKYLDEWNH---TSKLVQAVDCLHYAFNKMSPILTFSNS 67466719 Entamoeba_histolytica_HM_1IMSS YNNFD---SVEQDIRSIHVKY--PQFRVDTWTSR---ISGK--QLLVLTGFLPILFN------------------GRKFGIPLLIGFPYDYPLSPPEIICNISEG------MEIVKKHP--EVDENGVI-RKVGDEWNP---SSDLLMVLESLANSFGRYPPVRQAQNS 67466467 Entamoeba_histolytica_HM_1IMSS YIYFK---RIEIEIQAIQKAF---PMSPTVLPHP---LNYL--QYATLVGTIPIQYL------------------GANYNIPIMIMFPYDYPMKPPFFFTDPSPD------MVIVPQHP--YAMDDRTIIHPLLQRWND---SSNSLDVLVCLQLDFSNYPPLKKRKEN 67481843 Entamoeba_histolytica_HM_1IMSS YLYYL---RVRVEITSVMQYY---KFSATVRTY----ADGI--ILASLVGTIPIVYR------------------GSQFCLPLCIMYPYDYPLSPPLFFTDPTPE------MEVVPGHP--YAMPNIVICHPILDRWSE---NVNTLSVLQVFVKDFSYMPPLRMKTSV 70834758 Trypanosoma_brucei YHKPDVLRRIANDLSLLCNMY---HLSCRVSPW----GGTQ-QLKLCVYGGLPISVRCC----VEAAEQTKNDGGKSKFVLPVQIWLTQQFPIDPPSIFIHCNEPG-----CKVLSNHK--YVDVTGRCHTPELAGWRPT--SSSLVSVISGLKELLEEENISPLCVDR 71659407 Trypanosoma_cruzi_strain_CL_Brener HCKPEVMGRILHDVENLCNAF---HFYCRDATW----GATQ-QEKLCIYGGLPINIKKSNSDHTSLTPDSREQAPPDRYVLPLQIWLTHLYPIEPPLVFLLSAEQG-----CRIASNHK--YVDATGRCHTPELAAWHPV--SSSLCDVVKRLCQLLSAEGLIPLCFGE 71656365 Trypanosoma_cruzi_strain_CL_Brener HCKPEVLGRILHDVENLCNTF---HFYCRDATW----GVTQ-QEKLCIYGGLPITIKKSNSDHTSLTPDSREQAPPHRYVLPLQIWLTHLYPIEPPLVFLLSAEQG-----CRIASNHK--YVDATGRCHTPELAAWHPV--SSSLCDVVKKLCQLLSAEGLIPLCFVE 71413390 Trypanosoma_cruzi_strain_CL_Brener HCKPEVLGRILHDVENLCNTF---HFYCRDATW----GVTQ-QEKLCIYGGLPITIKKSNSDHTSLTPDSREQAPPHRYVLPLQIWLTHLYPIEPPLVFLLSAEQG-----CRIASNHK--YVDATGRCHTPELAAWHPV--SSSLCDVVKKLCQLLSAEGSIPLCFGE 71419864 Trypanosoma_cruzi_strain_CL_Brener HCKPEVLGRILHDVENLCNTF---HFYCRDATW----GVTQ-QEKLCIYGGLPITIKKSNSDHTSLTPDSREQAPPHRYVLPLQIWLTHLYPIEPPLVFLLSAEQG-----CRIASNHK--YVDATGRCHTPELAAWHPV--SSSLCDVVKKLCQLLSAEGSIPLCFVE 71408683 Trypanosoma_cruzi_strain_CL_Brener HCKPEVMGRILHDVENLCNTF---HFYCRDANW----GVTQ-QEKLCIYGGLPITIKKSNSDHTSLTPDSREQAPPHRYVLPLQIWLTRLYPIEPPLVFLLSAEQG-----CRIASNHK--YVDATGRCHTPELAAWHPV--SSSLCDVVKKLCQLLSAEGLIPLCFGE 71666894 Trypanosoma_cruzi_strain_CL_Brener HCKPEVLGRILHDVENLCNTF---HFYCRDATW----GVTQ-QEKLCIYGGLPITIKKSNSDHTSLTPDGREQAPPHRYVLPLQIWLTRLYPIEPPLVFLLSAEQG-----CRIASNHK--YVDATGRCHTPELAAWHPV--SSSLCDVVKKLCQLLSAEGLIPLCFGE 71666892 Trypanosoma_cruzi_strain_CL_Brener HCKPEVLGRILHDVENLCNTF---HFYCRDATW----GVTQ-QEKLCIYGGLPITIKKSNSDHTSLTPDSREQAPPHRYVLPLQIWLTRLYPIEPPLVFLLSAEQG-----CRIASNHK--YVDATGRCHTPELAAWHPV--SSSLCDVVKKLCQLLSAEGLIPLCFGE 42569747 Arabidopsis_thaliana YTDPDQKWLIRKHLTSLLQDY--PNFELSTDTFNHNNGAKV--QLFCLEGSLRIRSS-----------------TTQLPTVQLTIWIHENYPLTPPLVFINPNS-------IPIRNNHP--FINSSGYTKSRYIETWEHP--RCNLLDFIRNLKKVLANDHPFLHTDSL 42759852 Saccharomyces_cerevisiae YNDGR---TTFHDSLALLDNF--HSLRPRTRVFTHSDGTPQ--LLLSIYGTISTGED-----------------GSSPHSIPVIMWVPSMYPVKPPFISINLENFDMNTISSSLPI-QE--YIDSNGWIALPILHCWDPA--AMNLIMVVQELMSLL-HEPPQDQAPSL 45184818 Ashbya_gossypii_ATCC_10895 YYDPR---TTFHDVVIALRAY--KKLRPRTRVFTDEAGRSK--LLLCLYGKVG--------------------------NVDVLIWIPFEYPNVAPHAYIDLEANKG----RSIIVNQR---LDGDGLFYLPILGHWHPQ--NCNVVKLVQDLEQAIMAQEPLRPAAGA 50289071 Candida_glabrata_CBS_138 YKQPK---LVFHDAVQTLTEF--KNLRPRTRVFTDEDGSPR--LLLCLYGGIPIEQ---------------------GLDVPVLIWIPESYPIAKPLLFIDLELLDKD---LQLATNES---VEPDGRVHTRLLRQWNPQ--SANLFNVIQDLADMCNAIAPIQPVTFS Bflo1000012054 Branchiostoma_floridae YTYSS---KTREETAHVISRF--PTLEAAVGEGLAEAGHIGSTVRLCLEGAVPIRHK------------------GKHYGIPMVFVLPRKFPHDPPTCLVRPTCTETGTAPIVESKVCL--CVETSGKITFPYLEDWNYR--TSNLTTLIEVITAHFGERPPVTDEVLA 66807691 Dictyostelium_discoideum_AX4 YRDPL---RISKDLKETFHLF------PNLSPFYENIPNRV--NLICIKGTIPICFK------------------GINYYLPIIVWVPLNYPQEFPTMVLDPTPE------MRIVKNHH--HVNLQGLVYHPYISSWSS---NSTMETRVSQQQQPQPPQNNISPPPYG 116056391 Ostreococcus_tauri SGNAR---VVREQLIDLVETY--PSLTIEEDEFYHVDGSSE--RLLCVRGTIPIDYA------------------NARYNIPVRAYAPSDFPRAAPMFFVTPTSD------MIVKPNHG--CVDASGTVALERVMRWDAR--TGRLSEAAAALARVFSVDPPLFSKPRV 47212486 Tetraodon_nigroviridis YRYPA---ETQADLRRVRSRF--SDLRLHVDYYRFPNKEKK--QLVYLAGTVPVLYE------------------GSCYNIPVSIWLHQTHPVSHPRCYVCPSVS------MVINPACS--CADAAGLLHLDGLRNWTGG--ASSLSLLVSEMVQVFQKDMPLYARSAP 68356050 Danio_rerio YCYPD---EVLEDIRNLSADF--PHLQLYVDYYLYPNNDKK--KLVYLGGTVPVAYE------------------GGKYNIPVCIWIHETHPKNPPRCFVCPSPS------MVINTKSS--NVDANGRVLLHCLSNWKIG--WSNLPIVLEEMIAAFQRETPLFATYPA 68368272 Danio_rerio YCYPD---EVLEDIRNLSADF--PHLQLYVDYYLYPNNDKK--KLVYLGGTVPVAYE------------------GGKYNIPVCIWIHETHPKNPPRCFVCPSPS------MVINTKSS--NVDANGRVLLHCLSNWKIG--WSNLPIVLEEMIAAFQRETPLFATYPA 58264686 Cryptococcus_neoformans_var_neoformans_JEC21 FPARE---TITQEVLYVLQQR--RTLAVKTDAFTFDSGHTA--LLLLLHGTLPVTYR------------------GAIYQIPIHLWVPHEYPRAPPLIFVMPTKD------MGVRK-SR--EVEPSGRVREEVVEGWWRAWEVKNLDMLLKHLADVFSAAPPVYAKPST 46098756 Ustilago_maydis_521 YTDPD---RVFADVDRALIAV--SSLSPKTEVFTFNDGRTQ--LLISLDGTIPVEFR------------------NTTYNIPVAYWIPRDYPREPPMAFVAPTPD------MAIRK-GP--NVDPSGEIGGDYLSRWRSKPEACNLLDLIHDCQHMFGREPPVYAKPKP 70998933 Aspergillus_fumigatus_Af293 YQDPN---RTYYDVANVLAQY--PSLSPRTDVYTYETGFSA--LLLHLTGTIPVSFR------------------GTVYKFPIAVWIPNTYPREPPMVYVTPTQD------MAVRV-GQ--HVTLEGRVYHHYLAHWAEAWDRSNLVDFLMILREVFAKEPPVKYKQPQ 85093619 Neurospora_crassa_OR74A YHDVN---RTYNDVAQVLSHY--PSLSPRTDVHTFPNGASA--LLVHLSGTLPVVFR------------------GTTYRFPISVWIPHAYPREAPLVYVTPTEH------IMVRP-GQ--HVDPQGQVYHPYLAGWSTYWDKSTILDFLAILRDVFAKEPPVIARPPP 42547092 Gibberella_zeae_PH_1 YHDVN---RAYNDVAQALDRF--SSLSPRTDVHTFSNGANA--LLLHLSGTLPVNFR------------------GTTYRFPLSIWVPHAYPREPPMIYVVPTET------MMIRP-GQ--HIDPQGFVYHPYLVRWAEFWDKSNLRDFLNILTDVFAKEPPVVARQPQ Mbre1000007539 Monosiga_brevicollis YREPD---LVLSHLMELTSVY--EDLKIVPGTFYFDNGQQA--DLLYAKGTIPVRVK------------------GKIYNFPVYIWLFREHPTYAPMCYVVPTRS------MKLPDPGKNSHVGPDGRVYLPYLHNWNRS--TSTLRELVDNMCAVFSQHTPLYSRAQP Tpseu1000001553 Thalassiosira_pseudonana YRDPQ---RVLRDAETLLNSPLGHHLRPTTEPLMLNDGTST-PPVLMLSGTLPMTYR------------------GVTYNLPIDMYLPPPYPLRPPTVFVRPVAS------MAIKENHR--HVGLDGKVYLPYLHEWRPQ--SHDLRELAVWMSSTFGDEPPCYAKPAN Ptri1000004948 Phaeodactylum_tricornutum YRDPA---RVDRDATAVLRSSVGEHLTPIASELYEDKGSHS--TVLVLQGTIAMDFR------------------GTTYQLLMDVYLPGGYPQRPPVCYVRLADH------MYLKENHR--HVASDGKVYLPYLHEWTPQ--QHNLVELVIQMSSVFSADPPVFSRAPA Ptri1000008763 Phaeodactylum_tricornutum YRDPA---RVDRDATALLRSSVGERLTPIAAELYEDNGSHS--TVLVLQGMIAVDFR------------------GTTYRQLMEIYLPGRYPQRPPVCYVRLAEH------IYLKNNHE--HVGSDGKVDIPYLDEWTSH--HHNLVELVIQMSSVFSADPPVFSRTSA 18399596 Arabidopsis_thaliana YEESN-KWLIRQHLLNLISSY--PSLEPKTASFMHNDGRSV--NLLQADGTIPMPFH------------------GVTYNIPVIIWLLESYPRHPPCVYVNPTAD------MIIKRPHA--HVTPSGLVSLPYLQNWVYP--SSNLVDLVSDLSAAFARDPPLYSRRRP 15240732 Arabidopsis_thaliana ENTKS---LIRQHLLNLISSY--TSLDPKTATFTHNDGRSV--ILLQADGTIPMPFQ------------------GVSYNIPVVIWLLESYPQYPPCVYVNPTRD------MIIKRPHS--NVSPSGLVSLPYLQNWIYP--SSNLVDLASHLSAAFSRDPPLYSQRRP Psoj1000006153 Phytophthora_sojae YPQSS---RVRGDVYNLLGQI--PSLQPNCGTFAHNDGTTS--TLLNLEGTIPIFYR------------------GSQYNIPVEFWVVETYPLAPPVCFVRPTAD------MMVKPGHP--HVTSDGYVKIPYTSDWRP---DFTLLELVAHMCSIFGNMPPVFRRPAN Pram1000004476 Phytophthora_ramorum YPQSS---RVRGDVYNLLGQI--PSLQPNCGTF----GSTL----LNLEGTIPIFYR------------------SNQYNIPVEFWVVETYPLAPPVCFVRPTAD------MMVKPGHP--HVTSDGYVKIPYTSDWRS---DFTLLELVAHMCSIFGNMPPVFRRPAN 68405668 Danio_rerio YDHRT---EVLSEISLVLSHY--QHLEPVLEKFVFNDGTAK--NLINLTGTIQVFYE------------------RKQYNIPVTLWLRESYPRTAPICYLKPTCE------MVVVT-SK--YVNSNGEIRMPYLNEWKHT--KCDLHSLIQVMMATFSEVPPLRMHLDQ 68405637 Danio_rerio YDHRT---EVLSEISLVLSHY--QHLEPVLEKFVFNDGTAK--NLINLTGTIQVFYE------------------RKQYNIPVTLWLRESYPRTAPICYLKPTCE------MVVVT-SK--YVNSNGEIMMPYLDEWKHT--KCDLHSLIQVMMATFSEVPPLRMHLDQ Bflo1000036388 Branchiostoma_floridae ---------TQLDVRNAVTNF--KDLKVHVDVFSTPGASKR--ELLCLKGTVPVIYK------------------EGTYNIPLKVWLFEDHPNASPVCYIVPTNN------MRINDRCK--HVNANGKVQLPYLDDWKDA--NSDLFSLIQVMRIV------------- 71982582 Caenorhabditis_elegans GKYAD---SAKKDIIGALSQF--KDLSPGTDTFMFPDGKRR--TAFRLKGTIPVYYK------------------GACYNIPVTVYLWDTHPYYAPICYVNPTST------MVIKE-SE--HVNKEGKVFLPYLNEWRFP--GYDLSGLLQVMAMVFQEKCPVFARSAA Bflo1000042438 Branchiostoma_floridae YKHKD---VTQKEVVATLKHF--EDLKPVVDTYVFNDGSQT--ELLCLCGTIPVKIK------------------ANTYNIPVCIWLTEFHPEIPPLVYVRPTGN------MVINE-SK--HVDMNGRVYMPYLHEWTHVS-TKCYSGTNSHM---------------- 23943814 Homo_sapiens YKFRD---LTVEELRNVNVFF--PHFKYSMDTYVFKDSSQK--DLLNFTGTIPVMYQ------------------GNTYNIPIRFWILDSHPFAPPICFLKPTAN------MGILV-GK--HVDAQGRIYLPYLQNWSHP--KSVIVGLIKEMIAKFQEELPMYSLSSS 114636481 Pan_troglodytes YKFRD---LTVEELKNVNVFF--PHFKYSMDTYVFKDSSQK--DLLNFTGTIPVMYQ------------------GNTYNIPIRFWILDSHPFAPPICFLKPTAN------MGILV-GK--HVDAQGRIYLPYLQNWSHP--KSVIVGLIKEMIAKFQEELPMYSLSSS 114636489 Pan_troglodytes YKFRD---LTVEELKNVNVFF--PHFKYSMDTYVFKDSSQK--DLLNFTGTIPVMYQ------------------GNTYNIPIRFWILDSHPFAPPICFLKPTAN------MGILV-GK--HVDAQGRIYLPYLQNWSHP--KSVIVGLIKEMIAKFQEELPMYSLSSS 114636495 Pan_troglodytes YKFRD---LTVEELKNVNVFF--PHFKYSMDTYVFKDSSQK--DLLNFTGTIPVMYQ------------------GNTYNIPIRFWILDSHPFAPPICFLKPTAN------MGILV-GK--HVDAQGRIYLPYLQNWSHP--KSVIVGLIKEMIAKFQEELPMYSLSSS 109461971 Rattus_norvegicus YKFRD---LTVEELKTVNMSF--PHFRYSMDTYVFKDTSQK--DLLNFTGTIPVMYQ------------------GKTYNIPIRFWILDSHPFAPPICFLKPTAN------MEISV-GK--HVDAKGRIYLPYLQNWSHP--KSVIVGLIKEMIAKFQEELPLYSVPSS 109458775 Rattus_norvegicus YKFRD---LTVEELKTVNMSF--PHFRYSMDTYVFKDTSQK--DLLNFTGTIPVMYQ------------------GKTYNIPIRFWILDSHPFAPPICFLKPTAN------MEISV-GK--HVDAKGRIYLPYLQNWSHP--KSVIVGLIKEMIAKFQEELPLYSVPSS 70780379 Mus_musculus YKFRD---LTVEELKNVSVSF--PHFRYSVDTYVFKDTSQK--DLLNFTGTIPVMYQ------------------GKTYNIPIRFWILDSHPFAPPICFLKPTAN------MEISV-GK--HVDAKGRIYLPYLQNWSHP--KSAIVGLIKEMIAKFQEELPLYSIPSS 47215244 Tetraodon_nigroviridis YKFRD---VAIEELQKFHRIF--PEMIPSTGTYTFTDSTQK--DLLKLIGNLPVQYE------------------GRTYNFPVQLWLLDSFPFTPPICLLRPTAN------MVIRE-GK--HVDARGRIFLPGLQNWDYP--KSSVVSLLKEMTAKFEEDPPLSSKAPA 68405599 Danio_rerio YKFRD---VAVEELQKVYRVY--PDMKIMAGTYTSSDSLQK--DLLKLVGNIPVVYQ------------------GRFYNIPILLWLLDSFPFTPPICYLRPTSS------MVIRE-GK--HVDSKGRIHLPALHNWDHP--KSSVNALLAEMIGKFEEEPPLGTKSSA Nvec1000024634 Nematostella_vectensis HPHAE---LCKRQILAAMAVY--KDLRPSMQKFVHNDGRES--ELLSLDGTIPVSFR------------------GSTYNIPVCIFLQETHPFIPPLVYVRPTST------MAIKV-SK--HVDNNGRVFLPYLTDWSHP--RSEIAGLIQILCCVFAEEPPVYAKPNN ci0100154515 Ciona_intestinalis YTQLD---LAKKDALSLMHHY--KDLQPKMDRFIYNDGSTK--NLMSLCGTIPVNYK------------------GTTYNIPIAIWLQESHPQLPPLCFVKPTSN------MQVKQ-GK--HVDANGRVYLPYLNEWTPH--RHTLIGLTQVLVAMFGEEPPVFAKPSG Bflo1000047714 Branchiostoma_floridae YKNKD---VTKREVLQVFTSY--GDLKPNLQPHIFNDGSRK--DLLTLEGTIPVQYR------------------GSTYNIPVCMYLMETHPFNPPLCYVRPTAT------MEIKT-GK--HVDSSGRVYLPYLHEWKHG--KSDLIGLIEVMRIVFGEEPPVFAKSAA 47203297 Tetraodon_nigroviridis YKYRD---LTVREITNVISQY--KDLKPVMDAYVFNDGSSR--DLMSLTGTIPISYR------------------GNVYNIPVCLWLLDTYPFNPPICFVKPTSA------MMIKT-GK--HIDANGKIYLPYLHEWKHVSLSFSLCVFGVDGVFVVANQTSLCFSVTA 68364140 Danio_rerio YKYRD---LTARDITNVTNIY--KDLKPMMDNYVFNDGSTK--ELLSLTGTVPVNYR------------------GNIYNIPVCLWLLDTYPYNPPICFVKPTSA------MMIKP-GK--HVDANGKVYLPYLHEWKPP--QSDLYGLIQVMIVVFGEEPPVFSRPSS 68364142 Danio_rerio YKYRD---LTARDITNVTNIY--KDLKPMMDNYVFNDGSTK--ELLSLTGTVPVNYR------------------GNIYNIPVCLWLLDTYPYNPPICFVKPTSA------MMIKP-GK--HVDANGKVYLPYLHEWKPP--QSDLYGLIQVMIVVFGEEPPVFSRPSS 50344832 Danio_rerio YKYRD---LTVREITNVISQY--KDLKPVMDAYVFNDGSSR--DLMSLTGTVPVSYR------------------GNVYNIPVCLWLLDTYPYNPPICFVKPTSA------MMIKT-GK--HIDANGKIYLPYLHEWKHP--QSDLYGLIQVMIVVFGEEPPVFSRPTT 47217083 Tetraodon_nigroviridis YKYRD---LTVREITNVISQY--KDLKPVMDAYVFNDGSSR--DLMSLTGTIPISYR------------------GNVYNIPVCLWLLDTYPFNPPICFVKPTSA------MMIKT-GK--HIDANGKIYLPYLHEWKHP--QSDLYGLIQVMIVVFGEEPPVFSRPTI 5454140 Homo_sapiens YKYRD---LTVRETVNVITLY--KDLKPVLDSYVFNDGSSR--ELMNLTGTIPVPYR------------------GNTYNIPICLWLLDTYPYNPPICFVKPTSS------MTIKT-GK--HVDANGKIYLPYLHEWKHP--QSDLLGLIQVMIVVFGDEPPVFSRPI- 114636476 Pan_troglodytes YKYRD---LTVRETVNVITLY--KDLKPVLDSYVFNDGSSR--ELMNLTGTIPVPYR------------------GNTYNIPICLWLLDTYPYNPPICFVKPTSS------MTIKT-GK--HVDANGKIYLPYLHEWKHP--QSDLLGLIQVMIVVFGDEPPVFSRPIS 114636478 Pan_troglodytes YKYRD---LTVRETVNVITLY--KDLKPVLDSYVFNDGSSR--ELMNLTGTIPVPYR------------------GNTYNIPICLWLLDTYPYNPPICFVKPTSS------MTIKT-GK--HVDANGKIYLPYLHEWKHP--QSDLLGLIQVMIVVFGDEPPVFSRPIS 11230780 Mus_musculus YKYRD---LTVRQTVNVIAMY--KDLKPVLDSYVFNDGSSR--ELVNLTGTIPVRYR------------------GNIYNIPICLWLLDTYPYNPPICFVKPTSS------MTIKT-GK--HVDANGKIYLPYLHDWKHP--RSELLELIQIMIVIFGEEPPVFSRPTV 48374087 Rattus_norvegicus YKYRD---LTVRQTVNVIAMY--KDLKPVLDSYVFNDGSSR--ELVNLTGTIPVRYR------------------GNIYNIPICLWLLDTYPYNPPICFVKPTSS------MTIKT-GK--HVDANGKIYLPYLHDWKHP--RSELLELIQIMIVIFGEEPPVFSRPTV 17648053 Drosophila_melanogaster YKYVA---ATKKDVVDVVTSF--RSLTYDLQRFVFNDGSSK--ELFTIQGTIPVVYK------------------NNTYYIPICIWLMDTHPQNAPMCFVKPTPT------MQIKV-SM--YVDHNGKVYLPYLHDWQPH--SSDLLSLIQVMIVTFGDHPPVYSKPKE 58388004 Anopheles_gambiae_str_PEST YRDPT---TTKKDVMNALRIY--HGLQHRVEEYVFNDGSTK--MLLNLHGTIPVKYK------------------GNTYNIPICIWLMDTHPKNAPICYVKPTPE------MRIKV-SA--YVDFNGKIYLPYLHDWNPK--NADLLDLIQIMSVTFGEVPPVYSKGPE 91080041 Tribolium_castaneum YHNPD---VTFRDVLNAVNHY--QGLLPFHEYYTFTDGSTM--ELVNLTGTIPVIYK------------------GNTYNIPICIWLMDTHPKNAPICYVKPTAD------MSIKP-SM--FVDQNGKIYLPYLHDWKTHDGTSDLLGLIQVMIVTFGDQPPVFARPKD 66531777 Apis_mellifera YQNSD---ITKKHVMKVLNLY--KGLKCNVEPFVFNDGSRK--ELFNLQGTIPVSYK------------------GNYYNIPICIWLMDTHPNNAPMCYVKPTAD------MSIKV-SM--FVDHNGKIYLPYLHDWLPH--SSDLLSLIQIMIVTFGEQPPVYAKPRQ consensus/100% ............p................................h...G.h..............................h.h.h...aP...P.h....................................................................... consensus/95% ............p...h........h...........s.......h.h.G.lsh........................h.h.l.hbh...aP..sP.hhl...........h.l........h..pG.h....h..W........h..hh..h...h............ consensus/90% a...........c...h...h...ph...........s.......h.h.Gslsh.hp.....................a.hPl.hhl...aP..sP.hal.ss........h.l........ls.pGbh....l..W.......sl..hl..h...h.....h...... consensus/85% ap..p......p-h..h...a...ph......a....s.p....bhph.GslPl.hp..................s..YslPl.hal.p.aPb.sP.hal.ss........h.l........ls.pGbl...bL..Wp....ppsl..hl..h...h....sh...... consensus/80% ap..p......p-l..l...a...php.....a....usp....hlpl.GslPl.hp..................s..YslPl.hal.ppaPb.sPhhalpss.p......h.l........VsspGbl..PhL.pWp....pssl..llp.h...h...sPl...... consensus/75% Yp..p...bs.p-l.plh..a...phps..ssa...suspp...llsl.GolPl.ap..................sppYslPlplWl.csaPb.sPhhalpPo.p......h.l.s..p..aVsssGcl.bPhLppWp....pssLh.llp.h...F.ppsPl..p.s. consensus/70% Yp..s...bs.p-l.slhs.a...phps..ssa...sGspp..pLlslpGTlPl.ac..................uppYslPlplWl.csaPb.PPhhalpPoss......M.I.s..p..aV-ssGcl.hPhLppWp....pusLhsllp.h...FspcsPlb.pss. 3. RWD FINAL -HHHHHHHHHHHHH----EEEE-----------------------EEEEEEE-----------------EEEEEEEE-----------------------------------E-----------EEEE-----------HHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHH--- 45550424 Drosophila_melanogaster MDALQDEVESLEAILMDDVCIKRT---------------PSGEVEQIETTVLPL-TGEEEEQQ-----YVCVTLQVH-----------------------------------PTPGYP-EESPTFKLL----RPRGLD-DARLEAIRSACNAKIKESI--GFPVVFDLIEVVREHLSG-SNLPSGQCVVCLYGFADGDEFTRTECFHYLHSYC- 91094661 Tribolium_castaneum NERVQEEIEALEAILMDDISVSYG---------------DNGCPELVKSTIFPS-TADDTDKQ-----YVCVTLEVK-----------------------------------LPGDYP-DCEPLVQLR----NPRGLD-DTTLNHLYRAIKDKCNEFI--GQPVIYELIELIRENLTE-SNLPTCHCAVCLYGFSEGDSFTKTQCFHYFHSYC- 66565937 Apis_mellifera DERVTDEIEALKAILLDDELNIKE--------------NDRGEPEYIETVLFPS-TGEDSQSQ-----YVCVTLIVQ-----------------------------------LPAGYP-DISPTINLR----NPRGLD-ENTVRLMQSDAEAKCKNFI--GQPVMFELIELIREHLTR-SNLPTDQCAICLYGFREGDEFTKTECYHYFHSHC- Bflo1000014876 Branchiostoma_floridae NSTLQVELEILESIYLDELLVEHG---------------DSGSPSRVEITIHPA-TADNVEAQ-----FVRLTLSIT-----------------------------------LPREYP-QELPSLAIQ----NPRGLS-ELQVNSLYKSLQHLAQERQ--GESMLYEIIEFAKDSLTH-NNLPSCECAICLYPFNEMDDFTKTECYHYFHCHC- Bflo1000014871 Branchiostoma_floridae NSTLQVELEILESIYLDELLVEHG---------------DSGSPSRVEITLHPA-TADNVEAQ-----FVRLTLSIT-----------------------------------LPREYP-QELPSLAIQ----NPRGLS-ELQVNSLYKSLQHLAQDRQ--GESMLYEIIEFAKDSLTH-NNLPSCECAICLYPFNEMDDFTKTECYHYFHCHC- 114583343 Pan_troglodytes DWVLPSEVEVLESIYLDELQVIKG--------------NGRTSPWEIYITLHPA-TAEDQDSQ-----YVCFTLVLQ-----------------------------------VPAEYP-HEVPQISIR----NPRGLS-DEQIHTILQVLGHVAKAGL--GTAMLYELIEKGKEILTD-NNIPHGQCVICLYGFQAVGVQCPVCREPLVYDLA- 10946616 Mus_musculus DWVLPSEVEVLESIYLDELQVMKG---------------NGRSPWEIFITLHPA-TAEVQDSQ-----FVCFTLVLR-----------------------------------IPVQYP-HEVPQISIR----NPRGLS-DEQIHKISQALGHVAKEGL--GTAMLYELIEKGKEILTD-NNIPHGQCVICLYGFQEKEAFTKTPCYHYFHCHC- 114583341 Pan_troglodytes DWVLPSEVEVLESIYLDELQVIKG--------------NGRTSPWEIYITLHPA-TAEDQDSQ-----YVCFTLVLQ-----------------------------------VPAEYP-HEVPQISIR----NPRGLS-DEQIHTILQVLGHVAKAGL--GTAMLYELIEKGKEILTD-NNIPHGQCVICLYGFQEKEAFTKTPCYHYFHCHC- 34878787 Homo_sapiens DWVLPSEVEVLESIYLDELQVIKG--------------NGRTSPWEIYITLHPA-TAEDQDSQ-----YVCFTLVLQ-----------------------------------VPAEYP-HEVPQISIR----NPRGLS-DEQIHTILQVLGHVAKAGL--GTAMLYELIEKGKEILTD-NNIPHGQCVICLYGFQEKEAFTKTPCYHYFHCHC- 47217977 Tetraodon_nigroviridis ---LLSEIEVLQSIYLDELDVDRR----------------EDGAWEVSLVLHPS-TAEDSVSQ-----FVRLTLTLTLDQQVYLPPAGVSKCVWSVLTTSICSLSLSLTLVLFSPQYP-LTSPNISIH----NPRGLS-DDKLSSVQKCLQLEAQSCL--GSPVLYQLIEKAKEILTE-SNIPHGSCAICLYDFQEGEAFTKTSCYHYFHCHCL 41056245 Danio_rerio ESDVLSEIEVLQSIYLDELNVTQN----------D------EGGWTVSLVLHPS-TAEDCLSQ-----FVRLTLTMD-----------------------------------LDSQYP-YSSPYISIH----NPRGLS-DDKLLSLQKSLQMEAEECV--GTPVLYQLIERAKEILTD-SNIPHGNCVICLYDFKEGEVFTKTSCYHYFHSHCL 42733778 Dictyostelium_discoideum KDQQDMELEALQAIFNDEFITMDPIIINRNMDTIIEGIKPIRESARFRITIKPY-VGDDEK------CFVSIYLVVG-----------------------------------FPEKYP-VVLPSIQVL----VNKGLP-QKKAIELEEKLIRESQGKI--GNIMIFDLCEIAKDFLNE-NNFEPTQSIHHDFRSHQQKISIEISTNEQQQQQL- 30694992 Arabidopsis_thaliana NELLSEEITALSAIFQEDCKVVSD----------------SRSPPQIAIKLRPY-SKDMGYED----TDISAMLIVR-----------------------------------CLPGYP-YKCPKLQIT----PEQGLT-TADAEKLLSLLEDQANSNAREGRVMIFNLVEAAQEFLSEIIPESHDEESVPCLTAHRSTQFIEQPMLSNIAKSCS 50424915 Debaryomyces_hansenii_CBS767 EHRQQDEVNSIASIYGDIFEDITP----------SGLVWNKKPSPHFKILLLSS-ECADRPT-------ISLILDIE-----------------------------------FTSTYP-LSPPKIKIL----EPKNIL-KTRLQAIEKRIKDLIKEYP--EEEVSFTIICDVKDMLDD-FQQTTEK---VLSLEEERELRLKNERLQLEKLEDM 50305443 Kluyveromyces_lactis_NRRL_Y_1140 YEIQKLETEALQSIYMDDFTDLTK----------RHSSWDKHPQIIFEIGLRSQ-ESDPAE--------CSLNLHVV-----------------------------------LPSTYP-HAAPQITFK----NVINVP-DGQLNKLRQEIKEIHQRSK--GQEIIYEIIIATQDVLED-SQKHVVTDSLEDQRLQRIRDEEARMKLEEEQRQK- 45184741 Ashbya_gossypii_ATCC_10895 YEIQQNELEAMESIYAGDFVDLTQ----------RKASWDKKPQRVFEISLRSI-EKEPAE--------SSLTLHFV-----------------------------------LVSTYP-HAAPQITFK----KVQNVL-DSQLALLRDDIRRIHKSAR--GQEILFEITTRVQEVLDE-SQANANTQSLEAERLQRLEDEKVKLKLAEQENKE- 6320489 Saccharomyces_cerevisiae YEIQCNELEAIRSIYMDDFTDLTK----------RKSSWDKQPQIIFEITLRSV-DKEPVE--------SSITLHFA-----------------------------------MTPMYP-YTAPEIEFK----NVQNVM-DSQLQMLKSEFKKIHNTSR--GQEIIFEITSFTQEKLDE-FQNVVNTQSLEDDRLQRIKETKEQLEKEEREKQQ- 50288841 Candida_glabrata_CBS_138 YEIQCNELEAIKSIYMDDFEDRTR----------KKSKWDKQPQIQFDISLRSM-DKDPEE--------LSLVVHFA-----------------------------------MTPMYP-HTAPEISFG----ERSNVT-DTQIKVLENRLKEIFKELK--GQEMVFEITSYIQEKLDE-FQNNVTTSSLEDERLQRIKEEEEKMIREEEERKR- 46121513 Gibberella_zeae_PH_1 EELQQNEILALEAIYGDDFVMHSE----------TQSAWKKTEPH-FDIRIKAS---KDED--------FACTLSFV-----------------------------------MTATYP-KSPPLVTLK-----KHDLK-EVTQFKIQKFLETKPKIFAQDEQEMIDQIVEGVRDILED-AAQAKADGKHLPSLEEERERHEAYLAKLAQEKKE- 85103595 Neurospora_crassa_OR74A QEVQESEVMVLQAIYGEDFTQHEA----------AHGAWQK-SEPRFDIKIKPS---SDQE--------LSVTLGVV-----------------------------------MVATYP-KTPPLLTIK----DDHSLR-ESTKFKIQKFVETQPKIYAQAEQEMIDQIVEGIRDILEE-AAQKKVQGLEIPSLEEERAAHEAELARLAQSEKE- 63054755 Schizosaccharomyces_pombe_972h_ KEIQENEIEALKAIFMDDFEELKV-----------RNAWNVTNGHVYCIHLCSR-SANSKS-------IAKLDLCIE-----------------------------------LGRSYP-YVKPVIKLQ----NGENVL-NSQIRFLLDKLDTKAKDLL--GEEMIFELASIVQDYLND-WQSDLSS--QFASLEEERAVQLKHDRERAEVDLQL 70998472 Aspergillus_fumigatus_Af293 REIHQNEAEALRSIYGDDFEEIEH----------RPSAWQQSADVAFKLHLRAS---SNPE--------VRVDLLVE-----------------------------------LPTTYP-KTYPNLSLG----NSENIR-HRARLKIQDIIRNKPKELL--GSEMIYELAVSIQDVLED-VAQAQAQDKDLPSLEEERMVQEAAANQRAELERQE Mbre1000006089 Monosiga_brevicollis AQEQADELTALESIFIDELEILKR------------------DPICYNIHIVSEESPSADDPN----LTSKVTLNIT-----------------------------------YTATYP-EEGPNWKLI----DMSNIS-DELATRLDEVMLTTIEENL--GMGMAFAMQAALKEAVDD-YNAENRDA----AVRERQAQKEAEEQAELERLTAG 116056613 Ostreococcus_tauri EEEQAMEIEALESIFMDDMKRLPE---------NSGDGIAHASSSCYQVEISAYGENDSEPEEGKERDDAVLGLVFA-----------------------------------HTERYP-DEPPLLKCR----SVRGLR-EGELGEISALVLEQSETSV--GQAMIFDLTQTCKDWMRR-RAGASDF-----VDEETEEELRARLEHEAEERLRQ 66822007 Dictyostelium_discoideum_AX4 SEEKDMEVEALSAIYMDHFNSIDS--------------------DHVQITLLPN-PGGDEP------NFVAIILDII-----------------------------------FSVDYP-NSIPKIDLI----PHLGLE-KEDILELQGKVIQEAENNI--GMSMIFILCGLIKEWVDE-NNIDPD--------LEESEQSSSSEAEDDERPFEG 15218023 Arabidopsis_thaliana KQEQEMEIEALEAILMDEFKEIHS----------S-ESGLNTSNRCFQITVTPQ-DDELEELAI---PPVQLALVFS-----------------------------------HTENYP-DEAPLLDVK----SIRGIH-VSDLTILKEKLEQEASENL--GMAMIYTLVSSAKDWLSE-HYG---------QDDAAEFAEVEAAKEDEVIVPHG Pram1000014635 Phytophthora_ramorum KEEQAMEVEALEAIYMDEFTKLSD------------------EPLTYQVHVVPN-Q-DGEN------NFVAMLLKAE-----------------------------------IPDTYP-DVEPKLDII----VKKGLA-DSQLKEIKQLLDQQVEENM--GMAMMYTLCEAVREYLDG------SE---HQEMLRRMELKKKKEDKVEADKLEQ Psoj1000001657 Phytophthora_sojae -----MEVEALEAIYMDDFTKLSD------------------EPLTYQVHVVPN-QD--GENN-----FVALLLKAE-----------------------------------IPETYP-DVEPKIEVV----VKKGLA-DSQVKEIKQLLAQQVEENM--GMAMMYTLSEAVREYLVE-NNREGNDGSEHQEMLRRMELKKKKEEKVEADKLE- ci0100146916 Ciona_intestinalis EEEQRDELEALESIYPDSFTIVST------------------DPTAFSLKISTE-ESILDADK----AIISVTLKFT-----------------------------------YAKKYP-DEAPLFEVL----EEENFPFENTGEKLEVLIEEQIEENL--GCVMVFTIMSAVQEFLNQ-EDDRFKQ-----EIQEEEERKEREMIEEQERKCAG 28572074 Drosophila_melanogaster KEDQTNEVEALDSIYCGDMEILAT-----------------EPHHKFQIPIATE-EYSSEEPE----KGLACKLVFT-----------------------------------FTATYP-DGAPVVEIE----EPENFE-DMFETRLLEHLQKTIEENL--GMEMIFSLVSSAQEWLNE-RWDEHKF-----HQEELREQKLREIEEEERKKFEG 58383719 Anopheles_gambiae_str_PEST HEDQCNEIEALDSIYCGELEVLET------------------EPHKFRLPIATT-EYDPEVET----EGLSCKLLFT-----------------------------------YTAKYP-DTAPLVEIE----EPSNFH-DGYEEELLEHIHKTIEENL--GIEMIFSLVSSAQEWLNV-KYDELKN-----AAETAKEEAKRKVEEEEMKKFEG 48115538 Apis_mellifera KDEQLNEIEALESIYCGELEVLAT-----------------EPFYTFSIPIKTE-EYESGTEN-----GLSCCLQFT-----------------------------------YTEKYP-DEPLLISIE----EPENFE-EESSEKLKKHLIEQMSENL--GMVMVFTLVSAAQEWLNV-QWDKIKL-----NREECEAQKLREEEEAERRKFEG 91091940 Tribolium_castaneum TEEQKGEIEALESIYFGDLTLLGT-----------------EPYHKFSVQIKSE-EYDPETEN----TGLACDMVFT-----------------------------------YTPKYP-DEAPVIELE----NCDNFE-DGYEAQLLDFLKEQVQENL--GMVMIFTLVSSAQEWLNV-RWEGVKK-----ERDEEAARKLREEEEAERKRFEG 47218791 Tetraodon_nigroviridis AEEQRNELEAIESIYPESFTVLSD------------------DPTSFTITVTSD-PGDSGE-------TVEATIKFT-----------------------------------YVEKYP-DEPPLWEIH----SQENLE-ERDAQDILTLLQQQVEENL--GMVMIFTLVTAVQEKLNE-IVDVMKN-----EEKTEGGEAGRQG-QADGSLLCQ 68435817 Danio_rerio GEEQRNELEAIESIYPDSFTVLSD------------------APTSFTITVTSD-TGENEE-------TLELTLKFT-----------------------------------YVEKYP-DEPPLWEIF----SQENLE-DSDAEEILTLLKQQAEENL--GMVMIFTLVTAVQEKLNE-IIDQIKS-----RREEEKQRKQKEAEEAEKR---- 21735427 Mus_musculus GEEQRNELEALESIYPDSFTVLSE------------------SPPSFTITVTSE-AGENDE-------TVQTTLKFT-----------------------------------YSEKYP-DETPLYEIF----SQENLE-DNDVSDILKLLALQAEENL--GMVMIFTLVTAVQEKLNE-IVDQIKT-----RREEEKKQKEKEAEEAEKKLFHG 55953123 Homo_sapiens GEEQRNELEALESIYPDSFTVLSE------------------NPPSFTITVTSE-AGENDE-------TVQTTLKFT-----------------------------------YSEKYP-DEAPLYEIF----SQENLE-DNDVSDILKLLALQAEENL--GMVMIFTLVTAVQEKLNE-IVDQIKT-----RREEEKKQKEKEAEEAEKQLFHG 114609025 Pan_troglodytes GEEQRNELEALESIYPDSFTVLSE------------------NPPSFTITVTSE-AGENDE-------TVQTTLKFT-----------------------------------YSEKYP-DEAPLYEIF----SQENLE-DNDVSDILKLLALQAEENL--GMVMIFTLVTAVQEKLNE-IVDQIKT-----RREEEKKQKEKEAEEAEKQLFHG 22129753 Rattus_norvegicus GEEQRNELEALESIYPDSFTVLSE------------------NPPSFTITVTSE-AGENDE-------TVQTTLKFT-----------------------------------YSEKYP-DEAPLYEIF----SQENLE-DNDVSDILKLLALQAEENL--GMVMIFTLVTAVQEKLNE-IVDQIKT-----RREEEKKQKEKEAEEAEKKLFHG Nvec1000002214 Nematostella_vectensis EEEQRHEIEAIESIYPEEFTIISE-----------------SAPHSFQIHLESS-CEDKEDNTI---ITVSVQLQFT-----------------------------------FVEKYP-DEPPVVEVT----SSEGLE-DDDINQLTELLVQQSEENL--GMVMVFTLVSCAQEKLEE-IAEGIKK-----HRQEERIRKQKEVEEAEKRKFTG Bflo1000027816 Branchiostoma_floridae EEDQRNEIEALESIYPDIFEILET------------------EPPCFRLSVLAE-ADSYEECD-----PLGVDLQFT-----------------------------------YVPTYP-DTPPDMEVL----SPQNLT-EEDVSTIQELLQQQAEENL--GMVMVFTLVSAVQERLSE-LVEEKKK-----QAEEERDRKQREEEEKEKKRFEG Psoj1000006922 Phytophthora_sojae RELQAQELEVLQAIYDQDLETHSS-----------------TPLYTYIFAIRLL-CEATPGSA----TTAEVLLHFD-----------------------------------LTRAYPLKQPPNITVE----PKHGLS-DTETQKLQRGMEQLALEKL--GDAVVYDLVVFATDFIQD-HLKDQSS--FFDQMMTRQQDRETREKQAEDALLQQ Pram1000005575 Phytophthora_ramorum RELQAQELEVLQAIFDRDLVSQSA-----------------TPLYTYIFAIRLA-CEATPGSA----TTAEVLLHFD-----------------------------------LARAYPLQQPPNITVE----PKHGLS-DTETQKLQRGMEALALEKV--GDAVVYDLVVFATDFIQE-HLKDQSS--FFDQMMTRQQDRETREKQEEDALLQE 46100672 Ustilago_maydis_521 TELQRTEIEAIESILDQDFTRVEQ---------KAWKGAAVSQLHEFQVVIRPD-EERLKP-------LVSAYVSFR-----------------------------------LPKNYP-LVTPTIIVKLNDGRHKGLS-ASHLTSLGGALNRKAKSLI--GAEMIWELITVAQDFISLNNTIPKEVKDGAPSLSLEEEMQKRAKEQQERQRAEQ 17537697 Caenorhabditis_elegans QHLQEEEKLALDAVYLNQITYIKA-------------HWHVWVPTNCHILLKAL-DSCFLNGDPLGKSKLSVILHVK-----------------------------------CSEDYP-QRKPAVDLL----DPQGLS-KEDVQNLLTILRQMADTWE--GCVVIAELAHRVREFLTD-HTPRPAGS-FHDDMLANKVRTEAEKQRKRLDTEQK 58262126 Cryptococcus_neoformans_var_neoformans_JEC21 KALQEEELESLRAIYPDEWHDIPP-------TKTAWGTEVDAGWWEVKICAMED-------------ERVNVILKGK-----------------------------------MVQAYP-HQVPPLLLR----EPEYLT-ANHVQQLHKIIQDKARSKV--GEVMIFELIDTVRDFISE-NHAPLPSPGDVNLMEEKARREEAQRAAEEASRATE Ib#25.m01824 Toxoplasma_gondii --EQALELEALEALFTREEEFEKL------------------SPTSFRLSLLPCQEGEGTD-------HVAVTLLVE-----------------------------------YVPTYP-DDPPHWEIQ----SSKGLD-TEALEELKKEVSEAMKREV--GAPMMYTIAEFVQDWLRE-RNKPQQS--MYDQMMSRENAAVVEDESDEEEEEGD 28828088 Dictyostelium_discoideum NQSQLEEIECLKSIYRDEFEELPT----------------EAVCRRFKITLIPH-PTNQYQ------NFCAVRLYVK-----------------------------------YSPNYP-VSLPTIELE----KVRGLS-DDLIGELSFLLSQKMAP----GEIIIFELCQAVQEFLLL-YNKETVS--LHEEMIKRLNTNINRVNSSDDINNNN 58393600 Anopheles_gambiae_str_PEST RERQENELEVIKSIFHDEVEDLRP-------------RTGKWKPLELRLHLTPQ-RGSAKE------AYVKADLYVT-----------------------------------CSPKYP-KCPPKLELK----HAVGLS-DSLVRELTEKLEQLADELK--GEVMIFELANTVQAFLHQ-HNVPPKGS-FYDEMLANQQKQALARQNTLQAEETL 17137328 Drosophila_melanogaster RERQAQELEVIKSIFGCDVEDLRP----------Q-ANPSLWKPTDIRIQLTPL-RDSSNGLE----TYVCTKLHVT-----------------------------------CPSKYP-KLPPKISLE----ESKGMS-DQLLEALRNQLQAQSQELR--GEVMIYELAQTVQAFLLE-HNKPPKGS-FYDQMLQDKQKRDQELQDIQRQRESL ci0100139746 Ciona_intestinalis QERQENELELLHSIFGEEIEDLRK----------L-DKWKVVRPPEIILYLHPQ-EGMSGKKE----TFTKIDLKIK-----------------------------------FTPTYP-DSFPECKLE----NAKGLS-EDAINDLESGIKNLYKSLQ--GEVMVLEIAQFVKEFLHT-HNVPQCNS-VYEEMVKNKQRQLEKAAQEEEKKKEI Nvec1000001936 Nematostella_vectensis TERRNNELEALQAIYMDDFKDIRD---------------KPEDQPKVVLKLTPL-QSMWAKE-----VHSRVELVIE-----------------------------------YTPRYP-DQCPKLSLQ----KAVGLS-KENLNDLQRELECLAAELV--GEVMAHHLAHHVQGFLHA-HNKPRLS--FYDEMMTNKKNEEARLQAELENKKQK 115954900 Strongylocentrotus_purpuratus RERQENEVEVLKAIFMDDFQDKTR---------------DPSQGLELHLRLMPQ-QGMSGSGD----VHVTADMTII-----------------------------------CPPRYP-EQMPTVTIE----NGRGIS-KEKLKQIKKKVDIMAKKLR--GEVMILELAQHVQQFLHS-HNVPGPKS-FYEEMMSNQKRAEEKVAREQKKKMEL 115746620 Strongylocentrotus_purpuratus RERQENEVEVLKAIFMDDFKDKTR---------------DPSQGLELHLRLMPQ-QGMSGSGD----VHVTADMTII-----------------------------------CPPRYP-EQMPTVTIE----NGRGIS-KEKLKQIKKKVDIMAKKLR--GE--ILEMDEEIQRRQEAMKELKKKKDLTKQDSDETKDSRERHNSDPPGSQSPT Bflo1000005297 Branchiostoma_floridae EERQENEVELLSAVYVDDFKDLRD-----------QDAWKIKRAPEVCLTLVPQ-QSMHGTAD----VYAKVDLHIK-----------------------------------CPPNYP-DVEPELDLK----NSKGLS-DDNLRQLKHELHKMAQQNL--GEVMLLDLAQHVQTFLHA-HNKPPRGS-FYEEMMSNKRRQEEKVAREIQKKLDV Bflo1000010868 Branchiostoma_floridae EERQENEVELLSAVYVDDFKDLRD-----------QDAWKIKRAPEVCLTLVPQ-QSMHGTAD----VYAKVDLHIK-----------------------------------CPPNYP-DVEPELDLK----NSKGLS-DDNLRQLKHELHKMAQQNL--GEVMLLDLAQHVQTFLHA-HNKPPRGS-FYEEMMSNKRRQEEKVAREIQKKLDV 47230269 Tetraodon_nigroviridis TIQQDNELEALASIFGDDFQDLRN-----------KDPWKVKRPPEVYLCLRPN-GLNNGQG-----CYATVDLQVK-----------------------------------CPPSYP-DVPPELELK----NVKGLS-NEKVQNLQNELTKLAAARC--GEVMIYELADHIQGFLSE-HNKPPPRS-FHEEMLKNQQRQQEKRAQEEQQRMDQ 114656310 Pan_troglodytes PQRQDHELQALEAIYGADFQDLRP-----------DACGPVKEPPEINLVLYPQ-GLTGEE------VYVKVDLRVK-----------------------------------CPPTYP-DVVPEIELK----NAKGLS-NESVNLLKSRLEELAKKHC--GEVMIFELAYHVQSFLSE-HNKPPPKS-FHEEMLERRAQEEQQRLLEAKRKEEQ 114656308 Pan_troglodytes PQRQDHELQALEAIYGADFQDLRP-----------DACGPVKEPPEINLVLYPQ-GLTGEE------VYVKVDLRVK-----------------------------------CPPTYP-DVVPEIELK----NAKGLS-NESVNLLKSRLEELAKKHC--GEVMIFELAYHVQSFLSE-HNKPPPKS-FHEEMLERRAQEEQQRLLEAKRKEEQ 114656306 Pan_troglodytes PQRQDHELQALEAIYGADFQDLRP-----------DACGPVKEPPEINLVLYPQ-GLTGEE------VYVKVDLRVK-----------------------------------CPPTYP-DVVPEIELK----NAKGLS-NESVNLLKSRLEELAKKHC--GEVMIFELAYHVQSFLSE-HNKPPPKS-FHEEMLERRAQEEQQRLLEAKRKEEQ 114656304 Pan_troglodytes PQRQDHELQALEAIYGADFQDLRP-----------DACGPVKEPPEINLVLYPQ-GLTGEE------VYVKVDLRVK-----------------------------------CPPTYP-DVVPEIELK----NAKGLS-NESVNLLKSRLEELAKKHC--GEVMIFELAYHVQSFLSE-HNKPPPKS-FHEEMLERRAQEEQQRLLEAKRKEEQ 65287717 Homo_sapiens PQRQDHELQALEAIYGADFQDLRP-----------DACGPVKEPPEINLVLYPQ-GLTGEE------VYVKVDLRVK-----------------------------------CPPTYP-DVVPEIELK----NAKGLS-NESVNLLKSRLEELAKKHC--GEVMIFELAYHVQSFLSE-HNKPPPKS-FHEEMLERRAQEEQQRLLEAKRKEEQ 62645579 Rattus_norvegicus SQRQDHELQALEAIYGSDFQDLRP-----------DARGRVREPPEINLVLYPQ-GLAGEE------VYVQVELRVK-----------------------------------CPPTYP-DVVPEIELK----NTKGLS-NESVNLLKSHLEELAKKQC--GEVMIFELAHHVQSFLSE-HNKPPPKS-FHEEMLERQAQEKQQRLLEARQKEEQ 7305017 Mus_musculus SQRQDHELQALEAIYGSDFQDLRP-----------DARGRVREPPEINLVLYPQ-GLAGEE------VYVQVELQVK-----------------------------------CPPTYP-DVVPEIELK----NAKGLS-NESVNLLKSHLEELAKKQC--GEVMIFELAHHVQSFLSE-HNKPPPKS-FHEEMLERQAQEKQQRLLEARRKEEQ consensus/100% ......E...h.ulh...............................h...h......................h..........................................YP......h.h.........h........h...h.................h....p......................................... consensus/95% ......Elbhlpulh..ph...........................h.h.l.s....................l.h....................................hs..YP....P.h.h........sh..p.....l...h..........G..hhhpl....p..lp..................................... consensus/90% ...b..ElbhlpuIa..ph...........................h.l.l.s................h.h.L.h....................................hs..YP....P.hpl.......psl..p.....l.p.l....p.....G..hhhpl...hpp.lp.................pp.................. consensus/85% .p.b..ElbhlpuIa.sphp.bp......................ph.l.l.s................h.h.L.h....................................hs.pYP.p..P.hplb....p.pslp.p.....lbp.l..b.pp....G..hlaplh..hppbLp................bpp......p......p..p. consensus/80% .p.b.pElbhLpuIa.s-hp.bp......................phpl.l.s................hph.L.hp...................................hs.pYP.p..P.lplb....p.cslp.ppphp.lbp.l.pbhpp....G..hlaplhp.hppbLpp.......p.......bppb..b..p..b..bpp.p. consensus/75% .pbQppElEhLcuIa.c-hp.lp....................s.phpl.l.s....ps.p........lphpL.hp...................................hs.pYP.c.sP.lplb....s.cslp.cpplpplbp.lppbhpp.h..GbsMlapLhp.hp-hLpc.......p.....b.bpcb.pbppp..b..bpp.p. consensus/70% .pbQppElEuLcuIY.--hpslp...................ps.phpl.l.sp...ps.p........lphsLphp...................................hs.pYP.c.sP.lplb....sscsLp.cpplpplbp.lppbhcp.h..GbsMlapLhp.hp-hLs-.ps....p.....bbbpcbppbppp..bpbbppbp. 4. Apg10 FINAL ---------HHHHHHHHHHHHHHHHH-------------E----------------------------EEE--------------------------------------------------EEEEEEEEEEE---EEEEEEEEE------------HHHHHH----------------------EEEE----------EEEE---HHHHHHHHHH--E----------------EEEEEEEE----EE---------------- 19113870 Schizosaccharomyces_pombe_972h_ LILA-QKLSKGGISCELIEFDECILK-------LEWHTELL-------------------DKN--DSLLYEQEEDD--------------------------------------ILSLMNPMITMHAWIRDSPSFEVPQFFFQP--YANGSDPLTKMEQIFELL-------EGSSQN-LAYD-ALAIGDCPGTV-GIAWYIHPCRTRDYFEMLQIDKEDPK-------------YLSLWLLYI-HQVLSPLTQPIIKAVDDAEK 111067784 Phaeosphaeria_nodorum_SN15 LSAF-PHLTAEEFAQACSDLRRRYRERDSSR--GDWQSVEII------------PSTDTPYLR--IAKELREDAKHDAGFHVDSEDDEVEEDDDE------------------VLNTPRKIPALIQYDILLSPVYQVPVLYFGIS-DLQHRY-PPTMTTLYEHL--IPSQFKAQAENTGVMG-GVTINDHPATN-RPVFFIHPCQTAEVMEASVGDKAITAEE-----------YLLMWIGAL-GGCVGLSVPLALMRQETSNT 145255833 Aspergillus_niger_CBS_51388 ----------------------MNEE-------LQWKASE---------------------------EDPVRRAG---------------------------------------TLSIPLSILKVSSNIKQEALIRAPETSVRL--QDNHPG-PLGIDAVYQYL--VPEQYRKELQSIGIMG---------DSG-TPAFFVHPCQTTDAMRHIARQLCLTPEI-----------YLIIWLGLV-GNRLGLQLPKEFFTVEGMET 123446557 Trichomonas_vaginalis_G3 ----------MDFQTKIKNFVKTFSK-------DGWILKESN-------------------GK--TYATNERRVL---------------------------------------PVEINDETVFVTYYIDVDEVFQAPVLSAYF--YTESGH-RLTYDELLKIM-----PEKLDMNS-------VSERIHPITG-IPLFFIHPCKTIEYITPIEHHGID---------------FMNAWIGVY-GPLFLYRLPI---------- 56758348 Schistosoma_japonicum MDQD-GQLTLTAYKNAIHTLKNNLICNS-----DEWFIDEFV-----------------GHPD--ELEMVHRKTLQLSNNSCFVA-----------------------------SSSSISEEILAEYRVVYSQSYKVPVLLIRF--QTKSGR-LLHHNKCWSEN---FRSSLFSLSQVPLPLFALSEIEHPHLG-IPFYEFHPCRTAAVMKEVLSSEPVQYHNPV--------KYMIMWLSLI-ASPFGLHLPQKCLFT----- 71984851 Caenorhabditis_elegans ------MLTEQQFRDEIRGFVDKMNENN-----FHWQLKE--------------------------IGKYARETHL--------------------------------------KTLSDGRVITSETHILYNSTYQVPTIWFNF--FENNGS-PLPFRTVIRDV-LNISETEESEAS-IRS--RISHYEHPFMG-VLYYNIHPCNTSNIMKELNTDRS----------------YLMSWFSVY-GQQIGLKLPDRVL------- 39587070 Caenorhabditis_briggsae ------MLSEQKFREELQKFVEQMSQ-------NKWNWQL-----------------KEVKIR--IMIFSAKFKFQNGKYARETHL----------------------------ETLSSGKIVSADIHILYNSTYQVPTLWFNF--FENNGA-PIPFEEVVRDI-LKIPESEESDSS-LRQ--RISHYEHPILG-VLYYNIHPCNTEKIMKELKTDS-----------------YLISWLSVY-GQQLGLRLPDSL-------- 118779191 Anopheles_gambiae_str_PEST -------MKLSEFTLYCDQLVHASRNCV-----DRWRWVR---------------EKEPDFLC--LETVRHVRSTESNPQTEMLEGMAELDEEDDPGLAAVS------------GGQPTARMLAFEYHVLYCESYEVPILLFNI--YEKGGA-RLNLEEAWEVL---QISGCVPPDE-RYS--AITMVHHPVLY-RPYLKLHPCKTAE-LVGSLSGSTN---------------PVLSFITTY-APYVNLELDELARALQSDSK 67469027 Entamoeba_histolytica_HM_1IMSS -----MNFNKKEFNFQAKKIVNCLNKLGK----YQWKWDTR------------------------GFIIHSCELGELLQVDDNHCNEEIYEEKDDT------------------ICQIQQKRYYFIHHILYNTSYQVPQFGVSLF-DKKENK-YITQLEQATTL-LSHLSGIEKCDNNNPLNLFITMNEHPLIEDYYMFFIHPCGTSSSLLPIIEFSLSD--------------YLICYLSSY-GPYLCCFENWLSVQHKLITE 110761124 Apis_mellifera MDGP-GTITWEEFLGDAEKFVQMSNYIS-----DGWDLRGNK------------NIPGEVYVVR-RQKQYIYNNSNNYTALENNDLNFMLKKEDPFE-----------------ATCPIERPLITEHHVLWSMSYSVPVLYFNGWKSDFPGINSISMEEAQSFV--------CDREL-KYK--DLSQAIHPILG-TPFLYLHPCMSHELLQITSKSKN----------------KLVSWLSTV-APAALNLKLRPEYYQLTI-- 108873613 Aedes_aegypti -MTT-GTLTEEQFRDAAEQFLERSHQIG-----DEWELVK------------------SENGT--YLTKLLKQTLKLEQKAGTADEDEELAVDDPSLGK---------------STPNDEEVYQFEYHVVYSVSYQVPVLYFNA--YKSDGT-MLQLEEAWQGF---RDLASESREQ-LRR--TLTQMEHPILF-RPFLALHPCQTAQVLSNVATGCGN---------------LLVAFISSY-GPFVNLNLDVRYALISEFIL 108869693 Aedes_aegypti -MTT-GTLTEEQFRDAAEQFLERSHLIG-----DGWELVK------------------SENGT--YLTKRLKQTLKLEEKAETADEDEELIVDDPSLGK---------------PTPNDEDVYQFEYHVVYSVSYQVPVLYFNA--YKSDGT-MLRLEEAWQGF---RDLASESREQ-LRQ--TLTQMEHPILF-RPFLALHPCRTAQVLGNVATGCGN---------------LLVAFVSSY-GPFVNLNLDVRYAALRK--- 125810412 Drosophila_pseudoobscura ---M-SEMTWKDFLSQAQEFLKISEQLG-----DGWVLHE-------------------KDAN--EPNTFLKYSHKIKCQE---------------------------------SKDSDFTLINVEYHIVYSVSYQVPMLFFQA--HRSDGS-LLDLEATWRVF-----MPDTASKD-LYQ--MLTQTEHPVLF-RPFMALHPCRTAEVLDQFGQPSAN---------------RVLTFISLY-GPHVKLQLANAYGLSK---- 73853344 Drosophila_melanogaster FSIM-GDLSWKDFLSQAKQFLEISQKLG-----DNWILEQ-------------------KDSN--EPNTYLKCSQKIKCRG---------------------------------GKDNSAELVSVEYHVVFSVSYQVPMLFFQA--HRSDGS-LLDVEATWRMF-----MPESKASD-LHQ--ILTQMDHPVLF-RPFMALHPCRTAEVLKQFGKPSCN---------------QVLTFISLY-GPHVQLHLQNAYGLSQEYT- Psoj1000014309 Phytophthora_sojae LSYE-QFCTEAELLQERSHEVASAQTVGSGRSVATWEWRLGNRQHL--------DGDSYLVSTGNVVLYQSGDGQDVKDDKDLGGDIDELLLVEEESLIEDDEVQTP-------LQCKGAKTALVEFHIVYHTIYQTPVLYFRA--LAVDGT-PLPASVVTRDV-------HFPGSD-GRST-FVAMEEHPVLG-KPYSFLHPCETAAAMQLLQAQVQPSATDTKTPCDVEVPQYLASWLSLV-QPLTGIS-PVDYYSVGLDFD 60475073 Dictyostelium_discoideum_AX4 ------MLTSKDFRDQAINLIKKWNNIIDE---IPWQWNQINEL----------NNESKGYFT--TKRYHKINNNNNNNNNNNIENKNNNNIENFEEIKETIDDSSTTIIKSN-NNNNENNIIIFQFDIIYSKSYQVPVLYLNGF-SSFDSS-PLSWNEIWNNL---PLSNLDKNQQ-STIP-YITQVEHPILG-NPCYQLHPCETDNLMKLILLKEKDYNDNND-KKEYFKDYYLLSWLSII-GPMVNIKIPFDLLKNNNI-- 30680336 Arabidopsis_thaliana EVSD-GRLTVEGFSVASRAFADKWKIHNQSF--PPWSWVPLINRTLLVS-----KKQEEGYLS--LEKIIILSSLEEEIPEDESLNVATDCLEKEETVDHTIL-----------VPTMENEAHYYDFHIVYSASYKVPVLYFRG--YCSGGE-PLALDVIKKDV---PSCSVSLLLE-SKWT-FITQEEHPYLN-RPWFKLHPCGTEDWIKLLSQSSSSSGCQMP------IVLYLVSWFSVV-GQVVGLRIPLEMLN------ 125590884 Oryza_sativa_japonica_cultivar_group PCIF-GVLTCDDMDQALNRAGGK----------AGNKGAEAA------------LTAEEGYLA--LEGVYRNHGGSQVQSCSGNLHFYDYHVVYSFSYK---------------VQSCSGNLHFYDYHVVYSFSYKVPVLYFQG--HQSGGQ-LLTLDEIKEDL---PSLSLKLLGE-SRWT-FITREEHPHFS-RPWFTLHPCGTSDCMKLLLEGMQDKDQQVR---------YLPAWLTVV-GQAVGLKIPLGLHCNS---- 125579467 Oryza_sativa_japonica_cultivar_group SIGE-GTLSLGDFVASAKALI------------EKWKEE--------------------GYLA--LEGVYRNPGGRHVSIVLVYLHVLQEQIGDSSNFDDADIVSDDAW-----AQSSSESVHIYDYHVVYSFSYKVPVLYFQG--HQAGGQ-LLTLDEIKEDL---PSHSLKLLGE-SKWT-FITREEHPHFS-RPWFTLHPCGTSDCMKLLLEGVENKDQHVQ---------YLPAWLSVV-GQAVGLKIPLELYASGLKTQ Bflo1000028874 Branchiostoma_floridae CSFL-ADMQRWQFLAEGVQVGIWWHAK------HSWEAGNTNFPPQPGGKPRSQVDQAGGSFL-VK-KDVVRPTSERVCVKNCVPLEDREDEDDTDVLDVEDSMEEADINTVQ-CEVKETQYYRYQYHVLYSTSYCVPVLYFIA--SRPDGS-LASLDEVWSSV---PASFQSRLQD-DRWT-FLTQQEHPVLG-CPMFQLHPCHTATMMQPLMQETQDEEMGRC--------NYLVSWLSTV-GPVVGLNLPLSYAHIRHVT- Bflo1000011200 Branchiostoma_floridae -MAA-GTLTWCRFKDEASQLLRHSERIK-----DGWEWTEVE------------VDQAGGSFL-VK-KDVVRPTSERVCVKNCVPLDDTDVLDVEGSMEEADINTVQ-------CEVKETQYYRYQYHVLYSTSYCVPVLYFIA--SRPDGS-LASLDEVWSSV---PASFQSRLQD-DRWT-FLTQQEHPVLG-CPMFQLHPCHTATMMQPLMQETQDEEMGSC--------SYLVSWLSTV-GPVVGLNLPLSYAHIGHVT- Nvec1000017690 Nematostella_vectensis -IVS-GVPVKYLVKRSIVAIESHKAEDG-----ERSLQYLEGLE----------ASESYVSDDAGHSQISKESVTSPPAFPHFAHFFECELISI--------------------ASRAKDSYWIFGYHGLYSPSYSVPVLYFTA--NKQDGT-PLSLEEVWSNI---PHVYHQRLQY-EKWT-FLTQQEHPILG-APCFQLHPCHTADLMKNTSPTGSFTAQGSK----QIGTNYLVSWLSQV-GPVVGLNLPLQYCSSQEGNC 80751161 Danio_rerio KPAS-CFLDENTFRLCCRLFLQHSESIQ-----DGWIWEQI-------------KGSDEGFMK--KTVLIPVKSSLLDKQHESIQAENTELPTDDFEADAEDETAGVD------AVCESHAVLRYEYHVLYSCSYQIPVLYFRA--SALDGR-SLSLEEVWSNV---HPNYRQRLKQ-EPWD-TLTQQEHPLLG-QPFFMLHPCRTEEFMKPALELAHAQNRRVN---------YIVSWLSVV-GPVVGLDVPLSFSTAVSAPD 118104401 Gallus_gallus VEED-FFLEEKRFKQYCEEFVKHSQQIG-----DGWEWRTT-------------KDLGDGYLS--KTHFQVTNKSISPDLKKENGSTEQTLLTHVEESSDDSQVA---------GVCATEEVIRYEYHVLYSSSYQVPVLYFRA--CFLDGR-PLTLDEIWKSV---HVCYQARLQE-GPWD-TITQQEHPLLG-QPFFVLHPCRTNEFMCSILAGSQKDNRHRN---------YIVLWLSTV-GPVVGLTLPLSYAKLEPEEN 109464245 Rattus_norvegicus -MED-EFFGEESFQHCCAEFIRHSQQIG-----DGWEWRTA-------------KECSDGYMC--KTQFQIKKETPTPHRETPASVHTCLPTEENLELPVDDSEA---------VGPAAAAVIRQEYHVLYSCSYQVPVLYFRA--SFLDGR-PLALEDIWEGV---HECYKLRLLQ-GPWD-TITQQEHPILG-QPFFVLHPCKTNEFMTAVLKNSQKINRNVN---------YITSWLSIV-GPVVGLNLPLSYAKATSQSE 146413429 Pichia_guilliermondii_ATCC_6260 -------MNLAQFNHAISTFHKFVADNL----------------------------------------YHQNKPVPVTGYCKHAENRKYS------------------------FMVMEINEYLIESVITFHPFYQVPVMYFRVL-SNSNNQ-ALSIDQIAKIF-----------SD-FSPT-SVTLDSPEVVP-GVWFCIHPCETAQQM----ETCSTTNPEE----------YLRLWYAFY-GVGATFPTISVRPTIND--- 62286623 Mus_musculus -MED-EFFGEKSFQHYCAEFIRHSQQIG-----DGWEWRTA-------------KECSDGYMC--KTQFRIKNEASTPHVGTPASVLTCLPTEENLELPMDDSEVT--------RPAAVAEVIKHEYHVLYSCSYQVPVLYFRA--SFLDGR-PLALEDIWEGV---HECYKPRLLQ-GPWD-TITQQEHPILG-QPFFVLHPCKTNEFMTAVLKNSQKINRNVN---------YITSWLSLV-GPVVGLNLPLSYAKATSQSE 73952324 Canis_lupus_familiaris MEED-EFFGEKTFQHFCAEFIKHSQQIG-----DGWEWRTS-------------KDCSEGYMC--KTHFQIKPGMPMSPPGTSAHIQTCLPMEEALELPLDDFEKT--------ETTTGSEVIKYEYHVLYSCSYQVPVLYFRA--SFLDGR-PLALKDIWEGT---HECYKTRLLQ-EPWD-TITQQEHPILG-QPFFVLHPCKTNEFMTPVLKNSRKINRSVN---------YITSWLSVV-GPVVGLNLPLSYARTASQDE 109077825 Macaca_mulatta MEED-EFIGEKTFQHYCAEFIKHSQQIG-----DSWEWRPS-------------KDCSDGYMC--KIHFQIKSGSVMSHPGASTHGQTCPPMEEAFELPSDDCEVI--------ETAAASKVIKYEYHVLYSCSYQVPVLYFRA--SFLDGR-PLTLKDIWEGV---HECYKTRLLQ-GPWD-TITQQEHPILG-QPFFVLHPCKTNEFMAPVLKNSRKINKNVN---------YITSWLSIV-GPVVGLNLPLSYAKATSQAE 18594496 Homo_sapiens MEED-EFIGEKTFQRYCAEFIKHSQQIG-----DSWEWRPS-------------KDCSDGYMC--KIHFQIKNGSVMSHLGASTHGQTCLPMEEAFELPLDDCEVI--------ETAAASEVIKYEYHVLYSCSYQVPVLYFRA--SFLDGR-PLTLKDIWEGV---HECYKMRLLQ-GPWD-TITQQEHPILG-QPFFVLHPCKTNEFMTPVLKNSQKINKNVN---------YITSWLSIV-GPVVGLNLPLSYAKATSQDE 55624036 Pan_troglodytes MEED-EFIGEKTFQRYCAEFIKHSQQIG-----DSWEWRPS-------------KDCSDGYMC--KIHFQIKNGSVMSHLGASTHGQTCLPMEEAFELPLDDCEVI--------ETAAASEVIKYEYHVLYSCSHQVPVLYFRA--SFLDGR-PLTLKDIWEGV---HECYKMRLLQ-GPWD-TITQQEHPILG-QPFFVLHPCKTNEFMTPVLKNSQKINKNVN---------YITSWLSIV-GPVVGLNLPLSYAKATSQDE 119616289 Homo_sapiens MEED-EFIGEKTFQRYCAEFIKHSQQIG-----DSWEWRPS-------------KDCSDGYMC--KIHFQIKNGSVMSHLGASTHGQTCLPMEEAFELPLDDCEVI--------ETAAASEVIKYEYHVLYSCSYQVPVLYFRA--SFLDGR-PLTLKDIWEGV---HECYKMRLLQ-GPWD-TITQQEHPILG-QPFFVLHPCKTNEFMTPVLKNSQKINKNVN---------YITSWLSIV-GPVVGLNLPLSYAKAMSQDE 134085799 Bos_taurus MEEN-EFFGEKTFQHYCAEFIKHSQKIG-----DGWEWRAS-------------KDCSDGYMC--KTHFQVKNGTAVSHQGTSDHVQTFLPVEEALDPPLDDLEVN--------ETTTAAEVIKYEYHVLYSCSYQVPVLYFRA--SFLDGR-PLALKDIWEGV---HECYKMRLLQ-GPWD-TITQQEHPILG-QPFFVLHPCKTSEFMTPVLKNSQKINRNVN---------YITSWLSMV-GPAVGLNLPLSYATAASQDE consensus/100% .... ..............................................................................................................................p...p.P...h...................p.. ................................h..hHPC.o...h.........................h..ah..h .................... consensus/95% .... .......h...................................................................................................................l..p..apsP.h.h........s......p...p.. ...................ls...........hh.lHPC.T...h........................bh..ahs.h ...h................ consensus/90% .... ..h....h......h...............W.........................................................................................p.pl.hs.saplP.hhh.......ss...h.hp.h.p.h ...........p.......lob.pHP......hh.lHPC.T.phh........................bl..ahu.h u..h.h.............. consensus/85% .... ..hs...a...h..h.p.............W....................................................................................p.h.hpaallas.oYpVPhLaFp......sup..lshc.hbp.h ...........p.......lob.-HPhh...Phh.lHPC.T.phhp.h.....................hl..alo.h G..lslpls..h........ consensus/80% .... ..hsb.pF...h.ph.p..pp.........Wbh.................................pps..........................................s...phhbhpaHllYssSYpVPlLaFph.....sGp..Lshcphhp.h ...........p...b...lob.-HPhls..Pha.lHPCpTsphhp.h.p.p.................Yl.sWlS.h GshlsLpls..h........ consensus/75% .... ..hsbppFp..h.phhpp.pp.........Wbh.................................pps..........................................s...phhbhcaHllYSsSYpVPVLaFph.....sGp..Lshcplhpsl ........p..p...b...lTb.EHPhls.pPha.lHPCpTsphhp.l.pssp................Yl.sWlS.h GshlsLpls..h........ consensus/70% .... ..hsbppFpp.h.phlcp.pp.......s.Wbhp.......................s..b.pbb.pps.........................................ss...phhbh-aHllYSsSYpVPVLYFpu..p..sGp..Lslcclhpsl ......p.pb.p...bs..lTbbEHPlLs.pPaa.LHPCpTschhp.l.psspp...............Yl.uWlShl GshVuLplPhphh....... 5. Apg3 FINAL ----HHHHHHHH---EEEE---------EEEE--EE-------HHHHHHHHHHH--------EE----------------------------EEEEEE------E---------------------------------------------------------EEE------------------------------------------------------------------------------------------------------------------------------------------------------------------EEEEEEEEEEEE----EEEEEEEEE----------------EEE-----------E---------EEEEE------------------------EEEE----HHHHHHHHHHHHH---------------------------------------------------------------------------------EEEEEEEEE-EEEE--EEEE-------- 119600075 Homo_sapiens MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPTGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNT--GIT------GITEAVKEI-------------TLENKDNIRLQDCSAL-------------------------------CEEEEDEDEGEAADMEEYEE----------SGLLETDEATLDT------RKIVEACKAKTDAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYPSLYVRLV-AKWLLTIFFLR-NL------ 126137846 Pichia_stipitis_CBS_6054 ------MLRSKLSSLREYLTPI-NHNS-NFSTTGEISPEE---FVQAGDYLVYKF----PTWQWSTCP------------KNLQKSFLPADKQFLITRH-VPSYQRASNYLTG--GNVDFDDDEEYED----------------------DEE------GDGWVKSRKVV-----------SVTKPQDGQ----------------EEEPQEINDIDDL-------------------------------IDVTAEGAEDDGDNLEDFDD--------------------LDI------------------------------RNDGGSVRKYDMYITYLTSYRVPKMYLVG--FNANGI--PLTPDQMFEDI---NADYKDK---------TATIENLPVAH--------------NTTSVSIHPCKHSLVMKVLMKHSKLKKTREE----------------------------------VNHLSEELGKTNISDKSQEDTGKDAEAAAGPEAEDSIRVDQYLVIFLKFI-ASVTPGIEYDY-TMDA---- 70998182 Aspergillus_fumigatus_Af293 ----MNILHSTLSTWRDRLAPV-SRTS-TFRTTGQITP-----EEFVLAGDYLVYKF--PTWSWADASS-----------PAKRVSYLPPGKQFLVTRG-VPCHRRLNENFAG-----DAGHEDEIVR------DMLSGA---------DADD------DDGWLRTGGGR--DLA------EKQAE-RIK------------DVRTVDESGNMGEQE---------------------------------------DDEEDIPDMEDDDD---------------DEEAIIRE------PA---------------------GKSTTQPTRTYNLYITYSNFYRTPRLYLSG--YLSPSE--PLPPHLMMEDI---VGDYKDK---------TVTLEDFPWFD-------------GGVKMASVHPCRHASVMKTLLDRADAALKIRRDKLKQAHSADQ------------ANRINSERGLEGLVDETRGLSLNEQQGHAAGGDEWEVLQHDEEDQVAIRVDQYLVVFLKFI-ASVTPGIEHDF-TMGV---- 123505459 Trichomonas_vaginalis_G3 ---MKNQIHQKWMSFVNKYNSV-PHTS-TLIKDGKLTP-----EEFVAAGDCLIANC--PVWSWCSAP------------EGHEVDYLPKDKQYLINRR-VVCQKRATDL--------SKMMQEEVDI-------------------------------GDGWCQAGEAA------------------------------------------AQAALEI-------------------------------------NDDEEAVDLDEIDI----------------DEIEPEV-----------------------------QEVDVIDYRTYDISICYDKFYNVAHIFLYG--VNNEGV--PLTLEQMYQDI---SADYADK---------TVTYENHPFTA---------------TKNLSIHPCQHGHVMVRLVERLDHPEKFCA-------------------------------------------------------------------------PMYYFIFLKFI-HTVIPTIDISTPTLDFEA-- 65303096 Theileria_annulata ----MRRAYEVFTSSFSFLFD--SEGS-SLGLKGSLNT-----SNFVEYGDDLVSIN--KQWKWAGKT------------PSMFDSNLPEDKQYLCCDN-VACLRLLKSDITL----------------------------------------------HESWLVS------------------------------------------------------------------------------------------EPHDNEAD-NLYEE----------------DPASSVE----------------------------------TSWRTYSLTITYDRYNETARFWLRG--YNQFGL--PLSKDEMFEDV---PEVYVGK---------TVTVERHPFTG---------------FLNLTVHPCNQKEIVKLENQNFRLELVKFK--------------------------------------------------------------------------------------------------------- 71027271 Theileria_parva_strain_Muguga ----MRRAYEVFTSSFSFLFD--NEGS-ALGLKGSLNT-----SNFVEYGDGLVSIN--KRWRWAGKT------------QSMYDTNLPEDKQYLYCDN-VACLKLPKPESVL----------------------------------------------HESWVVP------------------------------------------------------------------------------------------QLQDNVPD-NLYEE----------------DPVTSVE----------------------------------TTWRTYSLTITYDRYNETARFWLRG--YNQFGL--PLSKDEMFEDV---PEVYVGK---------TVTVERHPFTG---------------FLNLTVHPCNQKEIVKREEQNLRL--------------------------------------------------------------------------------------------------------------- 134065134 Leishmania_braziliensis -MSIRRSLYEGFKHVHNKLHNV-KTTS-DFQTTGRLTP-----KEFVEAGDELVHKN--PVWQWIGG-------------PESVQDYLPKEKKCIVYRG-APCNERAPVD--------HSSASPETVD-------------------------------EDGFVLTEVPQ------------------------------------------VALPTTV-------------------------------------IEEDKVLTWDEHDD--------------DSGEEDLIA-----------------------------TAKDNSSLRIYDVYIVYDRYYQTPRMYLVG--YTSDHVT-PLTLDQMKEDV---YRSNYGK---------TVTIDPHPILS---------------IPCISIHPCRHAETMRSLMHRMQENYDREK------------------------------------------------------------ANVPNAEPFVFPTHLAMLLFLKFI-STVLPTIQYDV-SSGFNL-- 68128829 Leishmania_major -MSLRRSLYEGFKNVHNKLHNV-KTTS-DFQTTGRLTP-----KEFVEAGDELVQKN--PVWQWVGG-------------PESVQDYLPKEKKCIVYRG-APSTERAPVD--------PTNASPEAVD-------------------------------EDDFVLTEAPK------------------------------------------VALPATV-------------------------------------IEEEKVLTWDEDDD--------------DSGEDDVVA-----------------------------TATDNSNMRVYDVYIVYDKYYQTPRMYLVG--YASDHVT-PLSMDQMKEDV---YRSNYGK---------TVTIDPHPVLS---------------IPCISIHPCRHAETMRSLIHRMQENYNREK------------------------------------------------------------ANDPSAEPFVFPTHLALLLFLKFI-STVLPAIQYDV-SSGFHL-- 146097479 Leishmania_infantum_JPCM5 -MSIRRSLYEGFKNVHNKLHNV-KTTS-DFQTTGRLTP-----KEFVEAGDELVQKN--PVWQWVGG-------------PESVQDYLPKEKKCIVYRG-APCTERAPVD--------STNASPEAVD-------------------------------EDDFVLTEATQ------------------------------------------VALPATA-------------------------------------LEEEKVLTWDEDDD--------------DSGEEDVVA-----------------------------TATDNSNLRVYDVYIVYDKYYQTPRMYLVG--YASDHVT-PLSMGQMKEDV---YRSNYGK---------TVTIDPHPVLS---------------IPCISIHPCRHAETMRSLMHRMQENYNREK------------------------------------------------------------ANDPNAEPFVFPTHLALLLFLKFI-STVLPTIQYDV-SSGFHL-- 71421296 Trypanosoma_cruzi_strain_CL_Brener --MNKRGLYEKYKKLYNCLNGV-KTVS-NFQVTGTLTP-----LEFVEAGDELVQKM--PVWAWAEG-------------EEGIQPFLPPRKKYLVYHG-APCYQRGPDA--------DSLGENEMEG-------------------------------EDGWVTTHAER------------------------------------------QPSKNVV-------------------------------------MAPAKTINWDEEDD-------------EDQDHA---------------------------------EDIGERKCRLYDVYMVYDQYYQTPRIFLIG--YAEDHTT-LLTKDEMMQDV---YASNREK---------TVSIDPHPFLK---------------AACISIHPCRHAETMKRLIQHMKTRYEAEG------------------------------------------------------------ANEAEKDAFVFPTHMAFFLFLKFI-SSVVPTIEYDL-STSIDM-- 84043456 Trypanosoma_brucei_TREU927 --MNKQSLYEGFKKVYNSVVGV-KTTS-SFHETGTLTP-----MEFIQAGDELLHKM--PVWSWAEG-------------PENIQPFLPPNKKYLVYRG-APCYERAAVA--------GNDDADEIVE-----------------------DD------DDEWITTHANR-----------------------------------VLKATTEIAAEKTI-------------------------------------NWDDDDDD-DDANNNNNVVVVDSSRKDEGDDDEDADR-----------------------------DQTERRRCRLYDVYMVYDQYYQTPRIFLIG--YAEDHVT-PLTTSEMMEDV---YPVNRER---------TVSIDPHPFLQ---------------AACISIHPCRHAETMRRMIQHMKQRFEESS---------------------------------------------------------------PETAKFVFPTHMALFLFLKFI-SSAVPSIEYDL-STGIDI-- 19113456 Schizosaccharomyces_pombe_972h_ ---MAQRLTSAFLNWREHITPA-SKTS-DFENTGMISP-----EEFVLAGDYLVSKF--PTWSWECG--------------DRIRGFLPKDKQYLVTRH-VFCVQRNINIGV-----------------------------------------------NEEWVDIETDD-TRNK------DDDQDDDAI-------------SSIHSDTSDIASAERL------------------------------KGQSKELSDSGPLP-LKDEED----------------DDQMVSP------------------------------VIKEDEGRYYDLYIVYDKYYRTPRLFLRG--WNAGGQ--LLTMKDIYEDV---SGEHAGK---------TVTMEPFPHYH-------------SHNTMASVHPCKHASVLLKLIKQHRERNDPIR------------------------------------------------------------------------VDQYMVLFLKFV-STMLPYFEIDY-TIQA---- 111057680 Phaeosphaeria_nodorum_SN15 --MAYNFVRSYIDTFRERTAAI-SHTS-TFRETGQITP-----EEFVLAGDFLVFKF--PSWQWGDASS-----------AGKRVPYLPEGKQYLMTRG-VPCHRRLDDNFAG-----EAGQDETIVG------DGFAGS---------EGAD------DDGWLRTGGMA--ASQ------EAKVRDVRT----------------VDESGNLGAA----------------------------------------EEEDEIPDMEDMDD---------------DEEAIIRD------PQG--------------------GSSSTAPLRTYTLYICYSAHYRTPRLYLSG--YGATSV--PLKPQEMMEDI---VGDYKDK---------TVTIEDFPFFE-------------HALKTASVHPCKHASVMKVLLDRADAALKLRLAKLKAGKDIGK--------------LDTGMEGLVDDTRLLKLSEQAKGKEGDEAKDDWEVLSEGGDDEVAIRVDQYLVVFLKFM-ASVTPGIEHDF-TMGV---- 67901462 Aspergillus_nidulans_FGSC_A4 SKPKMNILHSTLSTWRDRLAPV-SRTS-TFRTTGQITP-----EEFVLAGDYLVYKF--PSWSWGDASS-----------PSKRVSYLPPGKQFLVTRG-VPCHRRLNENFAG-----DAHLDDEIVR-DFLSGGAGDSD---------GGDD------NDGWLRTGGGG--KRH------ESTIRDVRT----------------VDESGKEEAEV---------------------------------------EEEEDIPDMEDDDD----------------EEAIIRE------PAG--------------------TTSTTQPTRTYNLYITYSNFYRTPRLYLSG--YLSPSE--PLPPKLMMEDI---VGDYKDK---------TVTLEDFPWFD-------------GSLQMASVHPCRHASVMKTLLDRADAALKLRREKIKQTAASSSPQEKAKLAQPESGLEGLVDDIKGLSLGDAQKQCQGDKGQAGGDEWEVLQHDE--EEEVAIRVDQYLVVFLKFI-ASVTPGIEHDF-TMGYCVYN 145235341 Aspergillus_niger_CBS_51388 ----MNILHSTLSTWRDRLAPV-SRTS-TFRTTGQITP-----EEFVLAGDYLVYKF--PSWSWADAAT-----------PAKRVSYLPPGKQFLVTRG-VPCHRRLNDNFAG-----DAGHEDEIVR------GMLAGE---------GDND------DDGWLRTGGGD--NAA------SSSSSARIG------------DVRTVDEAGNMGEVEE--------------------------------------PEEDEIPDMEDEDD---------------DEEAIIRE------P----------------------VTGTTQPTRTYNLYITYANFYRTPRLYLSG--YLSASE--PLPPQLMMEDI---VGDYKDK---------TVTLEDFPWFE-------------GSVKMATVHPCRHASVMKTLLDRADAALKIRRDKLKAAATAHA--------------------------------------EPDAKVAAGAGGLEGLVDDVAIRVDQYLVVFLKFI-ASVTPGIEHDF-TMGV---- 121713242 Aspergillus_clavatus_NRRL_1 ----MNILHSTLSTWRDRLAPV-SRTS-TFRTTGQITP-----EEFVLAGDYLVYKF--PSWSWADASS-----------PAKRVSYLPPGKQFLVTRG-VPCHRRLNENFAG-----DAGHEDEIVR---------DML---------AGDD------GDGWLRTGGGR--DLA------EGHADRTGD-------------VRTVDEAGNIGEREEG-----------------------------------DDEEEEEIPDMEDEDD---------------DEEAIIRE------PAG--------------------TSSTTQPIRTYNLYITYSNFYRTPRLYLSG--YLSPSE--PLPPHLMMEDI---VGDYKDK---------TVTLEDFPWFD-------------GGVKMATVHPCRHASVMKTLLDRADAALKIRRDKLKQAHSPAE----ASRISAERGLEGLVDETRGLSLGEQQQQGG---GGSGGDEWEVLQHDE--EDQVAIRVDQYLVVFLKFI-ASVTPGIEHDF-TMGV---- 119479681 Neosartorya_fischeri_NRRL_181 ----MNILHSTLSTWRDRLAPV-SRTS-TFRTTGQITP-----EEFVLAGDYLVYKF--PTWAWADASS-----------PAKRVSYLPPGKQFLVTRG-VPCHRRLNENFAG-----DAGHEDEIVR------DMLSGA---------DADD------GDGWLRTGGGR--DLA------EKQAERIKD-------------VRTVDESGNMGERE---------------------------------------DDEEDIPDMEDDDD---------------DEEAIIRE------PA---------------------GKSTTQPIRTYNLYITYSNFYRTPRLYLSG--YLSPSE--PLPPHLMMEDI---VGDYKDK---------TVTLEDFPWFD-------------GGLKMASVHPCRHASVMKTLLDRADAALKIRRDKLKQAHSAAE----ANRINSERGLEGLVDETRGLSLNEQQG------HAAGGDEWEVLQHDE--EDQVAIRVDQYLVVFLKFI-ASVTPGIEHDF-TMGV---- 83772271 Aspergillus_oryzae ----MNILHSTLSTWRDRLAPV-SRTS-TFRNTGQITP-----EEFVLAGDYLVYKF--PSWSWADASN-----------PAKRVSYLPPGKQFLVTRG-VPCHRRLNDNFAG-----DAGHDDELVR-----DMLSGGT---------GGVD------DDGWLRTGGGQ--DSA------DRQENRIKD-------------VRTVDESGNMGERE---------------------------------------EEEDEIPDMEDEDD---------------DEEAIIRD------P----------------------ASGTTQPTRTYNLYITYSNFYRTPRLYMSG--YLSPSE--PLPPHLMMEDV---VGDYKDK---------TVTLEDFPWYD-------------GNVKMASVHPCRHASVMKTLLDRADAALKLRREKLKQAQSDPS-------KAPSVGESGLEGLVDDIKALSLSDQQQHGSDKSGGDEWEVLQHDE--EEQVAIRVDQYLVVFLKFI-ASVTPGIEHDF-TMGV---- 116193741 Chaetomium_globosum_CBS_14851 ----MNIIYSTVNSLRDRYTPA-SHTS-TFRNTGEITP-----EEFVAAGDYLVFKF--PSWTWSDAET-----------PAKRVAHLPPEKQYLVTRN-VPCHRRLNDDFAG-----DAGHEEAVVE----------GG---------KNSG------DDGWLRTGGLA--SSQ------PLKARDVRT----------------VDDAGNVADRGAI-------------------------------------DDEDDIPDMEEEED----------------DEAIIQD------G----------------------SHGKHSGSRTYSLYITYSPWYKTPRMYMLG--YQPNGQ--ALIPHLMMEDI---VGDYKDK---------TVTLEDFPFFA-------------TSVKTASIHPCKHAPVMKTLLDRADAALKLRKERQKAGLEVGS---------------NQGLEGLEAQVVKLAVSGTGTDVANDANDEWEEVQHDAADQDVAIRVDQYL-----FI-ASVTPGIEHDF-TMGV---- 85112065 Neurospora_crassa_OR74A ----MNFLRSTAATLLDKYTPV-SHTS-TFRNTGQITP-----EEFVAAGDYLTFKF--PSWSWADADS-----------PSKRLPFLPPGKQFLVTRH-VPCHRRLNDDFAG-----DAGHEEALVE---------GNK---------GGAD------DDGWLRTGSMT--SSQ------PLRVREVRT----------------VDDAGNVGDREVV--------------------------------------DEDDIPDMEDDDD----------------DEAIIRA---EGDNSNR-------------------QDNISTGKRTYTLYITYANAYKCPRMYMSG--YLSNGQ--PLPPHLMMEDI---VGDYKDK---------TVTLEDFPFFS-------------HSVKMASVHPCRHASVMKTLLDRADAALKLRREKMKAGQGSGS----------EQGMEGLVDEINKLDVSGAHA--NAVEAAPGEDAEWEEVPHDVADQEVAIRVDQYLVVFLKFI-ASVTPGIEHDF-TMGV---- 39970985 Magnaporthe_grisea_70_15 ----MNSLYSVVNTLRDRYAPV-SHTS-TFRQTGEITP-----EEFVAAGDYLVFKF--PTWSWGDADS-----------ESRRASHLPPGKQFLVTRN-VPCNRRLNENFAG-----DAGLEEAVVD-----------D---------GDEFKGSKGDDDGWLRTGGLS--SSQ------PLKAREVRT----------------VDDAGNAGERAQP-------------------------------------DDDDDIPDMEDEED----------------DEAIIRD------TD---------------------ASGQTSSRRTYTLYIMYSPYYRTPRLYLSG--YGANGQ--PLPPHNMMEDI---MGDYKDK---------TVTLEDFPFFA-------------NNIKMASVHPCKHAPVMKTLLDRADAALKLRREKLKTGDASGQ----------QAGLEGLADEFNKLGVSGKGD----ASKIDKNDEWEDIQHDDVADQEVAIRVDQYLVVFLKFI-ASVTPGIEHDF-TMGV---- 42548126 Gibberella_zeae_PH_1 ----MNFLYSTVNTLRDRYTPV-SHKS-TFRQTGQITP-----EEFVAAGDYLVYKF--PTWSWGDADS-----------PERRVSHLPPGKQFLVTRN-VPCHRRLNDDFAG-----DAGHEEALVN----DGDDFKGN---------AGDD------EDGWLRTGGLA--SSQ------PLKVKEVRT----------------VDDSGNVGDREVV--------------------------------------EDDEIPDMED-ED---------------DDEAIIRD------SG---------------------ADSKNSAHRTYTLYIMYSPYYRTPRLYLSG--YLANGQ--PLPPTDMTEDI---VGDYKDK---------TVTLEDFPFFA-------------NNIKMASVHPCKHASVMKTLLDRADAALRLRREKLRAGNASSS--------QAPSGMEGLVDEIGKLDVKGAQE-------AADKDEWEEVQEAEIDDQEVAIRVDQYLVVFLKFM-ASVTPGIEHDF-TMGV---- 50553884 Yarrowia_lipolytica_CLIB122 HKSTMLHVRAAASSLREYLTPV-SNTS-TFRTTGEITP-----DEFVKAGDYLVEKF--PTWSWASAS------------KSKVRDFLPPDKQVLVTRH-VPSHVRASTV---------SGPVTLGEE-------------------------------EDGWTSFGVAN----------------------------------------TKDADGDDT-------------------------------------EEIAEIAD-SDFEE-------------LDDDDDDAAE-----------------------------APATATDHRTYNLYIAYSTSYRVPKMFLSG--YSPEGS--PLTPEDMFEDI---IPEYRDK---------TVTIERPTFQD--------------NITMVAIHPCKHANVMRVLMERVEAKGDKDI------------------------------------------TRGVAKLGVADADDGEGEEEWEEVENSAMRVDQYLVTFLKFI-ASVTPGIEHDY-TMSA---- 146448928 Lodderomyces_elongisporus_NRRL_YB_4239 ------MLRSKLSSLREYLTPI-NHNS-NFLTTGEISP-----EEFVKAGDYLVYKF--PTWQWASCP------------KDLQKLFLPTDKQVLVTRH-VPSHQRANEY-FEGEFEVEIDEKDRDLA------------LGNESNLKNGENGENDDEAEYGWIRSGRSS--SEK------GTGEVLDPQ------------RVEEVNDIDELIDETAE---------------------------------GEEEEEEGEEGEGEGNGN---------GADFHADDDADYDD------LDI--------------------VQGSHSKLRRYDLYITYSTSYRVPKLYLVG--FDANGI--PLLPQQMFEDI---NSDYKDK---------TATIEQLPVAH--------------NTTSVSIHPCKHSSVMRVLMKHQRARREHEN---------------------------VAENMKRLSIGSEDHKEAMNHIRRLSAGSKELAQKQEEPNDSEIKVDLYLVIFLKFI-ASVTPGIEYDY-TMDA-L-- 50420719 Debaryomyces_hansenii_CBS767 ------MLRSKLSSLREYLTPI-RHTS-DFTTTGEISP-----EEFVEAGDYLVYKF--PTWQWSSAP------------DKLKKDFLPPDKQYLITKH-VSSYQRAVTYLGI---KSDLDEDEEEL--------------------------------EDGWVKSHKIN-----------NDPSRLKTD-------------ALGENSGGNTNDNDDT----------------------------------NEINDIDELIDENAEEQ-------------SSEDENDFEE------LV---------------------ETNANSNLRKYDLYITYSTSYRVPKMYLVG--FNSNGI--PLLPKQMFEDI---SGDYRDK---------TATIETLPVSY--------------NTMSVSIHPCKHSSVMKVLMAHAAASKKREN---------------------------------PADLTDLAEQTGSLNLKDKVPGRDFGEDVDIEENVPGIRVDQYLIIFLKFI-ASVTPGIEYDY-TMDA---- 146413535 Pichia_guilliermondii_ATCC_6260 RTYRLLMLRSKLSSLREYLTPI-NHNS-NYETSGEISP-----EEFVQAGDYLVYKF--PTWQWGSAP------------KKLQKDFLPPDKQFLITKH-VPSYQRAQSY---------LGNTEDLAE---------------------DEEEL-----DDGWVKSHRLT-----------------------------------------HEDPKRDI-------------------------------------ATDKKVPDIEDLDD------------FIDEDAEDADG------EEF--------------------QDLGNSNLRRYDLYITYSTSYRVPKMYLVG--FNDNGI--PLLPHQMFEDI---SGDYRDK---------TATIENLPVSF--------------NTTSVSIHPCKHLSVMKVLMKHAKTSREKAR----------------------------------------------EFEPEAFVTGENLEPATETTQDTGIRVDQYLVVFLKFI-ASVTPGIGYDY-TMDA---- 68473758 Candida_albicans_SC5314 -----MSLRSKLSSLREYLTPI-NHNS-NFVTTGEISP-----EEFVKAGDYLVYKF--PTWQWGNDCP-----------KNLQKSFLPPDKQYLVTRH-VPSYQRASNYLTGEDKKGANPEEDDEEE---------------------EEED------EEGWVKSKKIH-----------KVIDDTHDS---------------QINKGEEINDIDDF------------------------------IDENAEEQEHDQIGDHELDDD--------------EFDDLDIIN------------------------------DSKNNKLRRFDLYITYSTSYRVPKLYLVG--FDSNGI--PLLPQQMFEDI---NSDYKDK---------TATIENLPVAH--------------NTTSVSIHPCKHSSVMKVLMKHSKLNKKNLQ--------------------------------QKDESLSDDLSKLSVNEKKTQDEHSQINNDDKEEEEEGIRVDHYLIIFLKFI-ASVTPGIEYDY-TMDA---- 68473967 Candida_albicans_SC5314 -----MSLRSKLSSLREYLTPI-NHNS-NFVTTGEISP-----EEFVKAGDYLVYKF--PTWQWGNDCP-----------KNLQKSFLPPDKQYLVTRH-VPSYQRASNYLTGEDKKGANPEEDDEEE---------------------EEED------EEGWVKSKKIH-----------KVIDDTHDS---------------QINKGEEINDIDDF------------------------------IDENAEEQEHDQIGDHELDDD--------------EFDDLDIIN------------------------------DSKNNKLRRFDLYITYSTSYRVPKLYLVG--FDSNGI--PLLPQQMFEDI---NSDYKDK---------TATIENLPVAH--------------NTTSVSIHPCKHSSVMKVLMKHSKLNKKNLQ--------------------------------QKDESLSDDLSKLSVNEKKTQDEHSQINNDDKEEEEEGIRGDHYLIIFLKFI-ASDTPGMEYEY-TMDA---- 126030345 Saccharomyces_cerevisiae ----GSMIRSTLSSWREYLTPI-THKS-TFLTTGQITP-----EEFVQAGDYLCHMF--PTWKWNEESS-----------DISYRDFLPKNKQFLIIRK-VPCDKRAEQC-------VEVEGPDVIMK-------------------GFAEDG------DEDDVLEYIGS------------ETEHVQST------------------PAGGTKDSSID-------------------------------------DIDELIQDMEIKEE----------DENDDTEEFNAKG----------------------------GLAKDMAQERYYDLYIAYSTSYRVPKMYIVG--FNSNGS--PLSPEQMFEDI---SADYRTK---------TATIEKLPFYK-------------NSVLSVSIHPCKHANVMKILLDKVRVVRQRRR---------------------------------------------KELQEEQELDGVGDWEDLQDDIDDSLRVDQYLIVFLKFI-TSVTPSIQHDY-TMEG---- 50290141 Candida_glabrata_CBS_138 ------MIRSALSNWREYLTPV-SHKS-TFLTTGQITP-----EEFVQAGDYLCHMF--PTWKWNDMAD-----------DNKYRDFLPKDKQFLVIRK-VPCSERAQAVVTM-------DEIENGTS-------------------------------TDAFSAADDED-----------------------------------NDDDSIEIIPVSKS-----------------------------------SSGADNDVNDIDELME----------EMELEEDDDIVAN-------------------------------KTNEMLRYYDLFITYSTSYRVPKMYIVG--FNGNGT--PLTPKEMFEDI---TPDYRKK---------TATIEKLPFYK-------------RNVPSVSIHPCKHANVMKVLLDKISVVKERQR----------------------------------EEEMQKNAEVGAPKSAGSDDGDNENWEDLQQDIDDSLRVDLYLVVFLKFI-TSVTPTIQHDY-TMEG---- 50310697 Kluyveromyces_lactis_NRRL_Y_1140 ------MLRSTLSNWREYLTPI-SHTS-TFETSGQLTP-----EEFVKAGDYLVHMF--PTWKWNGNDFQ----------NVHHKDFLPNDKQFLVTKK-VPCKLRANNYLEL-----DDTETKDV---------------------------------GDGWALQEEHS-----------QDSERKSTN----------------NADDSELPEELEE--------------------------------LHIVDDGDDEEYDDQLYDN-------------EFADDDIVDI--------------------------------RPSTLRFYDLYITYSTSYRVPKMYLCG--YDNDGT--PLSPDQMFEDI---AADYRSK---------TATIEPLPFLK-------------GNNISVSIHPCKHANVMKVLMEKVRSSRSRAR-------------------------------------------------KVDPQTTDEDWEDLQSDVDDGLRVDQYLVVFLKFI-TSVTPGIEHDY-TMEG---- 45198343 Ashbya_gossypii_ATCC_10895 ------MLRSTLSNWREYLTPV-THQS-TFENTGQITP-----EEFIKAGDYLCHMF--PTWRWNQQQG-----------GMVYRDFLPQDRQFLVTRK-VPSNMRAADS-------VNVGGEEETSA-------------------------------GEYWVLQPQQE---------------------------------------SADGGEEIDI---------------------------------------DEMLQEMDIEDQ--------------SGEQDIIQL--------------------------------RPSHTRFYDLYITYSTSYRVPKMYLVG--FNNDGT--PLTPQQMFQDI---APDYRTK---------TATIEKLPFFK-------------SAVVAVSIHPCRHANVMRVLMEKVRAAKQKPV------------------------------------------------EEEQPDGPREDWEDLQDEVDSSLRVVQYLVVFLKFI-TSVTPGIEHDY-TMEG---- 124506667 Plasmodium_falciparum_3D7 QINVKHKIGDTCRKLYSYFKTV-NNTS-TFIQNGTLTP-----SEFVDSGDFLVYKF--KTWEWQEAD------------KDRVVPYLPENKQFLITKN-VPCKQRIKDL---------NNIVHDLI--------------------------------MIGYSQVMKKI--TIP------QIYMNIYLI-----------VNTQLMTKMYKMYKITSC-------------------------------------YSKNMLHDSNSFKN----------------------------------------------------NFIIIIKNFTCLNIITYDKYYQTPRIWLFG--YNENGD--PLKSEEIFEDI---LSDYSYK---------TVTVKIKIIFM------------------ILIRVLVHAEAILNVVNNWISEEKEPR---------------------------------------------------------IKKRINVYMYVFCFRHDLYLLFLLKFI-SGVIPTIEYDF-TTDIEIPR 68069687 Plasmodium_berghei_strain_ANKA ----MHKIGDAYRKVYSYFKPI-TNSS-SFIKNGTLTP-----SEFVDAGDFLTHKF--KTWEWMEAD------------DNRTVRHLPEKKQFLISRN-VPCKHRIKDL-------------NNIFH-------------------------------NLQLLLLFFFC-----------------------------------QEHDPAAIDPSRLY---------------------------------------------LEAFGN----------DRKIKNIKFKILIILCNIYDLYISSNH---------------HISDIISIRSYDISITYDAYYETPRIWLFG--YNENRH--PLKSEEIFEDI---LSDYSYK---------TVTVFYFELLN------------------LFIFFYRHAEIMLNVINNWIGEGKEPS-----------------------------------------------------MRTIIVIFHFIYLFIYFFRHDLYLLFLLKFI-SGVIPTIEYDF-TTDIEIPS 71003271 Ustilago_maydis_521 MN----ALQTHFWAVREYLSPV-LRES-KFKEHGRITP-----DEFVAAGDFLSYKF--PTWQWCAGS------------SSKARDYLPKDKQFLISCG-VPSLRRVSQI-EKGVGVGVKDDDEKLMSF--------------------GEEGGADAPEDDQWVATHFDD--QQT------GSSSQ-VAD----------------MLDIPDIGEDQLAPEHQLTEGQQDDLAARVAGVTIGHQSDSIHASASGDGM--GDIDDIPDIPDMDDETDELAAGVHEDEDPATAAP------PTHNAARTGWASAG---------DNGKLLSVRKYDCIITYDKYYQTPRMWLVG--YDEHGV--PLKPAQIFEDV---SSDYAQK---------TVTIEPFPHGHAGPDSSVTSSASAVGVATASIHPCKHASVMKKVIERMNASVIEEQRRAAACSGTAS---VAGEKKKKKGWSVSSAVKRVTGGATDGSSAPTAEAKEDGSTTAEAAATEAEDDVDGLRVDQYMIIFLKFM-ASIVPAIEIDA-T--QAL-- 58260390 Cryptococcus_neoformans_var_neoformans_JEC2 MNNPLLAIQSQYWAVRDYLSPV-LRES-KFKEHGRITP-----EEFVAAGDFLTFKF--PVWQWEKGE------------SSRARDFLPPDKQYLVTRN-VPCLRRATAV-DYTNADEDAEKLLSFLD---------------------DAEEAPGP--DDDWVATHINR--SPP------HRPTDMDEI--------------PDIPDSPTTAPTREM---------------------------AGLNVSSGGKLEEDEIPDIDDIPD-------MDEEGLEDLEDDAAVR---IVHPSEAEVNST--------------AGKNLLQVRTYDCIISYDKHYQTPRFWLFG--YDEHKN--PLTPAQVFQDV---PADHAFK---------TMTMESFPHSG---------------AQLASVHPCKHASVMKKFIDRMEAAQGPAP--------------TAEPETISSTSGTSGSAGGKEEKEKKKKWGLGSMVRKVTGGSVPKVEKDKDEVVTGVPVDFYLVIFLKFI-ASIVPTIEVDS-TTSTAL-- 66359970 Cryptosporidium_parvum_Iowa_II MQSITHRLADQFRSVVGNFIPIDCSNS-KFESDGFLTP-----KEFVDSGDYLILQF--PNWFWRSAS------------EEYIVRWLPQNKQYLHIDN-VPCRKRLDSS--------KLCISKNCLD-------------------ITSDSK------GDEWILPTNEN----------------------------------------------------------------------------------------VEKLGNININDD-----------------------------------------------------------VRYYDISVTYDKFFQTPRIWLFG--YNKEGY--PLSTEEMVEDI---ISDYATK---------TITLDPHPFTG---------------ILCVSIHPCNHSSLLKKMAKNHP-------------------------------------------------------------------------------PHLSIVILLKFI-TTVIPSIELDN-TIDIDIKF 67609529 Cryptosporidium_hominis_TU502 MQSITHRLADQFRSVVGNFIPIDCSNS-KFESDGFLTP-----KEFVDSGDYLILQF--PNWFWRSAS------------EEYIVRWLPQNKQYLHVDN-VPCRKRLDSSKLC------ISKNSFDIT---------------------SDNK------GDEWILPTNEN----------------------------------------------------------------------------------------VEKLGSININDD-----------------------------------------------------------VRYYDISVTYDKFFQTPRIWLFG--YNKEGY--PLSTEEMVEDI---ISDYATK---------TITLDPHPFTG---------------ILCVSIHPCNHSSLLKKMAKNHP-------------------------------------------------------------------------------PHLSIVILLKFI-TTVIPSIELDN-TIDIDIKF 56471771 Entamoeba_histolytica_HM_1IMSS LSTLKRKAYETYTKSVDLVKPT-LTES-QFIEKGVLTP-----EEFVNAGDFLVNKY--RTWQWVGASD-----------CKKTVDYLPADKQFLITRN-IRCATRAQGG-------PPTKTETLIVD-------------------------------GEEFEVPLEEK-------------------------------------PEEVFEEDSDDI-------------------------------------VDADDIVDADELSE--------------EADDTVATV--------------------------------DVGKTRTYDISIIYSHFDRTPKVWLLG--YDEDHK--PLTESQMFEDL---SATHAGQ---------TATTDTHPFLD---------------IKEIYIHPCRHAQVMKKRVDEMIADGKTPR------------------------------------------------------------------------VDLYLMIFLKFL-ATVIPTMEYDY-GKDF---- X#46.m01688 Toxoplasma_gondii GSNVTHRLADMGRNLVASFTSA-PTAS-SFISKGMLTP-----SEFVDAGDLLTHKF--PTWQWKGVGPT----------GKRASGWLPEDKQYLITKN-VPCYRRVRDM---------DDALNTRVG----------------------HDV------EGGWMLPLLNDEEREG------GSSGEAPDL-------TQSMQNLRLNAECGHELRKPAP---------------------------PTPTQTSASTASNQDLINFADIDC----------LVQEDDDPAAAEA------PSVVRTS----------------PDAEIVAARSYDLSITYDKYFQTPRIWLFG--YSENGV--PLLPEEIFEDI---LTDYAAK---------TVTVDPHPCTG---------------IPTASIHPCRHASVMKKVVDSWVESGVRPR------------------------------------------------------------------------HDLALLILLKFV-SSVIPTIEYDF-TMDVDMLI Tpseu1000007918 Thalassiosira_pseudonana ----------------EYLSPT-LKSS-AFLTRGVLTP-----EEFVKAGDELVYKC--PTWTWESGD------------PAKRKKHLPADKQYLATRG-VPCTARVSSLENVVAVSNHNEGCGAIGG---------------------LDDD------DGDWLVSQILT--TEE------DEFDILDGE--------------GEVMDGGVEGKVAQL-------------------SLGGDKDNSENNQGGADDDQEDEYADMADFED----------DNVLEDEAAVVAA---PVSTNATTKGEN--------------DNNHILKVRTYDLSITYDKYYQTPRVWLLG--YADDGSSRPLTGDEMMQDV---ISDYAHR---------TVTIENHPNIS---------------GAHASIHPCQHGAVMKTIVKNLTREEGSGG----------------------------------------------------------------------PSVEMYLFIFLKFV-SSMIPTINYDF-TMD----- Ptri1000002031 Phaeodactylum_tricornutum -------MMGHFWAAREYLTPT-LKTS-AFLEKGVLTP-----DEFVRAGDELVFRC--PTWSWQGNSRGSGSQ------ASATKTYLPAGKQYLVTRN-VPCQARVASM----------ETAMDLQR---------------------GEDD------EGDWLISNFIQ--HKE------RCIEDEFDI----------------LDETGEIMDVPKT-------------------------------ATLEESDGNDEYADMADFED----------DNVIRDDVATAVV------VD---------------------RDDNLIKTRTYDLSITYDKYYQTPRVWMMG--MSAEGQ--PLSGQEMMEDV---ISDYANK---------TVTIEAHPHVS---------------GPHASIHPCQHGKVMKTIVRNLMQSSTDGD--------------------------------------------------------------------EGPSVEMYIFIFLKFV-SSIIPTINYDF-TMDVTAST 116508016 Coprinopsis_cinerea_okayama7#130 MLSRVNLQRASLDPRPGFAWSP-MV---DLRRGAALHS-----TP--RRSYHASSAC--SVPLRD---------------PSKTRDFLPADKQYLVTRG-VPCLRRAQSLAYT----DADEDAERLVNFSGDEPTTSTPAASSSGGKSKDKDKNAAAADDDDWVETHAGRKVTQPGTLGAGEGLGEIPDV----------------DDEDDDDGLSKGM---------------GGVSLGGAGGGVGGGQGGAQETPDLDDIPDMEEDLE--------------EADEATATKPAAAAPAAGAAKTSAVVEP----------AKTNLLQVRTYDVMITYDKYYQTPRLWLIG--YDENRT--PLTPQQIFQDI---SADHAQK---------TVTIEQFIHST--------------SLQAASVHPCKHSSVMKKIIERMNDKVVAEQLAAKEAKEKAL---------GGASGKDAKEPKEKKKWLFGRKSSGGKEASSSSSAAGAAGDDKDSDEPEGMRVDFYLVVFLKFI-ASIVPTIEVDS-TTSF---- 39585989 Caenorhabditis_briggsae MQDIVNSFKSAALSIGETFTPV-LKES-KFRETGVLTP-----EEYVAAGDHLVHHC--PTWKWSTASD-----------PSKIRPFLPADKQFLITKN-VPCHKRCKQM-EY------DEKLEKIIN---------------------DDEGEYATGEESGWVDTHHYE--KEK------ETTAVVTET---------------SPAPAPPTSPESDD-------------------------------------DSDGEALDLDDLIE----------SGALDSDENDDDP------NRFVNEKAAKLNTSSGDA-----AGAEVEKIRTYDLHICYDKYYQVPRLFLMG--YDENRR--PLTVDQTYEDF---SADHSNK---------TITVETHPSMD---------------LQMPTVHPCKHAEMMKRLINQYAESGKELG------------------------------------------------------------------------VHEYLFLFLKFV-QAVIPTIEYDF-TRAIKL-- 17543646 Caenorhabditis_elegans MQNLVNNLKSAALQIGETFTPV-LRES-KFRETGVLTP-----EEYVAAGDHLVHHC--PTWKWAGASD-----------PSKIRTFLPIDKQFLITRN-VPCHKRCKQM-EY------DEKLEKIIN---------------------EEDGEYQTSDETGWVDTHHYE-----------KEHEKNEEK------------------EQSTAPPPAAP---------------------------------EDSDDDDEEPLDLDELGL--------------DDDEEDPNRFVVEKKPLAAGND----------------NSGEVEKVRTYDLHICYDKYYQVPRLFLMG--YDENRR--PLTVEQTYEDF---SADHSNK---------TITVEAHPSVD---------------LTMPTVHPCKHAEMMKRLINQYAESGKVLG------------------------------------------------------------------------VHEYLFLFLKFV-QAVIPTIEYDY-TRAIKL-- 56753163 Schistosoma_japonicum MDTVRQAVTRTALGLAEYVTPV-LKAT-KFRETGVITP-----EEFVAAGDFLVYHC--PTWQWSIG-------------DKPARPYLPKEKQYLVTRS-VPCYKRVKQMADH------HEEFEKVLE---------------------EEDG------DGGWVDTHHYA-TKYG------DLNPEVEKT---------NEMSVPSVSHSLKKSNLEET---------------------------------IISDCEDEEDEDMEAFLQ---------SGMLDECDPAAVKARTKVANDGNTSQAMADFGIDQNIDR----ENNGILQTRTYDLYITYDKYYQTPRLWLFG--YDEQHQ--PLTETQMYEDF---SQDHAKK---------TVTTEAHPHLS--------------GHMMPSIHPCRQADVMRKLIEAVADNGAELA------------------------------------------------------------------------VHQYLMVFLKFV-QAVIPTIEYDY-TRNFNLSS 91092526 Tribolium_castaneum MQNVINSVKGTALGVAEYLTPV-LKES-KFRETGVLTP-----EEFVAAGDHLVHHC--PTWQWAAGD------------ESKAKPYLPKDKQFLVTRN-VPCTRRCEQM-EY------SEDLERIIE---------------------SDDA------DNGWVDTHHYD-QDQT------TAVVEEKIS------------EMTLSNSKGESMEEAAL----------------------------NDEENDDEDDDDDEAADMEAFEE----------SGMLDDAQATTVI------QPKVTAKKELD------------EDGEIIHTRTYDLHITYDKYYQTPRLWLTG--YNEHHK--PLTVEELYQDV---SKDYAMK---------TVTMETHPHLS--------------VPKMASVHPCRHAEVMKSIIETVTEGGGELG------------------------------------------------------------------------VHMYLIIFLKFM-QSVIPTIEYDY-TQNFTM-- 66564768 Apis_mellifera MQSVINSVKGTALGVAEYLTPV-LKES-KFRETGVLTP-----EEFVAAGDHLVHHC--PTWQWATGD------------EDRVKSYLPKKKQFLLTRN-VPCTRRCKQI-EY------SKEQECIIE---------------------ADDP------EGGWVDTHHYD-MSLS------GIEERVTEMTLDETQLTQSIGTEENNGEENDDGDNNDD-----------------------------------DDDDDEDAADMEAFEV----------SGMLDEEDKYAAK---ITKKPIKDKWESF-------------SEGEIIHTRTYDLYITYDKYYQTPRLWLSG--YDENRK--PLTVEEMYEDV---SQDHAKK---------TVTMEIHPHIP--------------GPLMASVHPCRHAEVMKKIIETVMEGGRELG------------------------------------------------------------------------VHMYLIIFLKFV-QSVIPTIEYDY-TQNFMLNA 125978913 Drosophila_pseudoobscura MQSVLNSVKGTALNVAEYLTPV-LKES-KFRETGVLTP-----EEFVAAGDHLVHHC--PTWQWAAGD------------DTKTKPYLPKDKQFLITRN-VPCYRRCKQM-EY-------VGEETLVE---------------------EESG------DGGWVETHQLNDDGTT------QLEEKICEL------------TMEETKDEMHTPDSDKS---------------------------APGDAVDGEDEDDDEAIDMDAFEE----------SGMLDLVDPAVATTTRKLEAEPKSSPAAAATSEA--------SGDSVLHTRTYDLHITYDKYYQTPRLWVVG--YDEQRK--PLTVEQMYEDV---SQDHAKK---------TVTMESHPHLP--------------GPNMASVHPCRHADIMKKIIQTVEEGGGQLG------------------------------------------------------------------------VHLYLIIFLKFV-QTVIPTIEYDF-TQNFNMS- 21357935 Drosophila_melanogaster MQSVLNTVKGTALNVAEYLTPV-LKES-KFRETGVLTP-----EEFVAAGDHLVHHC--PTWQWAAGD------------ETKTKPYLPKDKQFLITRN-VPCYRRCKQM-EY-------VGEETLVE---------------------EESG------DGGWVETHQLN-DDGTT-----QLEDKICEL------------TMEETKEEMHTPDSDKS---------------------------APGAGGQAEDEDDDEAIDMDDFEE---------SGMLELVDPAVATT-TRKPEPEAKASPVAAASGDAEA------SGDSVLHTRTYDLHISYDKYYQTPRLWVVG--YDEQRK--PLTVEQMYEDV---SQDHAKK---------TVTMESHPHLP--------------GPNMASVHPCRHADIMKKIIQTVEEGGGQLG------------------------------------------------------------------------VHLYLIIFLKFV-QTVIPTIEYDF-TQNFNMS- 38047557 Drosophila_yakuba MQSVLNTVKGTALNVAEYLTPV-LKES-KFRETGVLTP-----EEFVAAGDHLVHHC--PTWQWAAGD------------ETKTKPYLPKDKQFLITRN-VPCYRRCKQM-EY-------VGEETLVE---------------------EESG------DGGWVETHQLN-DDGT-----TQLEDKICEL------------TMEETKEEMHTPDSDKS---------------------------APGAGAEAEDEDDDEAIDMDDFEE----------SGMLELVDPAVATTTRKPEAEAKASPVAAASGDAEA------SGDSVLHTRTYDLHISYDKYYQTPRLWVVG--YDEQRK--PLTVEQMSEDV---SQD------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ 58379448 Anopheles_gambiae_str_PEST MQNVLNSVKGTALGVAEYLTPV-LKES-KFRETGVLTP-----EEFIAAGDHLTHHC--PTWSWAVGD------------ESKIKPYLPKDKQFLITRN-VPCRRRCKQI-EF-------VGEENLVE---------------------ENDP------DGGWVETHHYNPDEAG----SSGLEDKVCEM--------KLDSSRIEDEPAADMDDPRNL--------------------------EDGDGDGGQDDDEDGAAIDMDEFEE-------SGLLEMVDPSNALLPA------PNEKPKPTVAASET---------EGDSVVRTRTYDLHITYDKYYQTPRLWVIG--YDENRK--LLSVEQMYDDV---SQDHAKK---------TVTMETHPHLP--------------GPNMASVHPCKHADIMKKIIQTVEEGGGELG------------------------------------------------------------------------VHMYLIIFLKFV-QTVIPTIEYDF-TQNFNI-- 108883736 Aedes_aegypti MQNVINSVKGTALGVAEYLTPV-LKES-KFRETGVLTP-----EEFVAAGDHLTHHC--PTWSWAVGD------------ETRIKPYLPKDKQFLITRN-VPCYRRCKQM-EY-------VGEETLVE---------------------ESDQ------DGGWVETHHFNPDGAS----GSGLEDKVCEM--------TLEGAKMDDDVAADMDDPRNL---------------------------EDGDGNGDDDDDEGAAIDMDEFEE----------SGMLDQVDPSLAT------VVPTENVPKKPSEQ---------DGDSVVHTRTYDLHITYDKYYQTPRLWVVG--YDENRK--PLTVEQMYEDV---SQDHAKK---------TVTMETHPHLP--------------GPNMASVHPCKHADIMKKIIQTVEEGGGELG------------------------------------------------------------------------VHMYLIIFLKFV-QTVIPTIEYDF-TQNFNITN ci0100142936 Ciona_intestinalis MQNVFNSVKSSALGVAELLTPV-LKES-KFKETGVLTP-----EEFVIAGDFLVHHC--PTWQWLAGD------------KTKTKAYLPQDKQYLMTRN-VPCHKRCKQM-EY------NEEHEAIID----------------------DIG------DGGWVDTHHNV--EVD------KAQEEIKEM-------------TLNKEKTNNVNLNEDD-------------------------------GDDDDDDDDEEALDMENYAE-----------MLESEDKATVDL------VAVKKANTSSTTN----------EDGGIIQTRTYDLHITYDKYYQTPRLWLSG--YNEDRK--PLSMVEMYEDI---SQDHVNK---------TVTIEPHPHLP--------------PPNMCSVHPCKHADVMKKIMQTVEDGGGKLE------------------------------------------------------------------------VHMYLLVFLKFV-QAVIPTIEYDY-TRHFSL-- 41053345 Danio_rerio MQNVINSVKGTALGVAEFLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWKWASGE------------EAKVKPYLPNDKQFLLTRN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTFHNS--GVT------GVTEAVREI-------------SLDNKDNMNMNVKTGA----------------------------CGNSGDDDDDEEGEAADMEEYEE----------SGLLETDDATLDT------SKMADLSKTKAEAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEDRQ--PLTVDQMYEDI---SQDHVKK---------TVTIENHPNLP--------------PPAMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 148237896 Xenopus_laevis MQNVFNTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFLAAGDHLVHHC--PTWQWSAGE------------ESKIKPYLPNDKQFLMTKN-VPCYKRCKQM-EY------SDEQEAIIE---------------------EDDG------DGGWVDTFHHT--GLS------GVTEAVKEI-------------TLETQDCGKTTDNIAV--------------------------------CDDDDDDEGEAADMEDYEE----------SGLLENDDATVDT------SKIKEACKPKADLG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRR--PLAVENMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 58332690 Xenopus_tropicalis MQSVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFLAAGDHLVHHC--PTWQWSAGE------------ESKIKPYLPNDKQFLMTKN-VPCYKRCKQM-EY------SDEQEAIIE---------------------EDDG------DGGWVDTFHHS---LT------GVTEAVKEI-------------TLETQDCGKTTSNIAV--------------------------------DDDDDDDEGEAADMEDYEE----------SGLLDNDDATVDT------SKIKEACKPKADLG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRR--PLTVENMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 50729624 Gallus_gallus MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWASGE------------ELKVKAYLPTDKQFLVTKN-VPCYKRCKQM-EY------SDEQEAIIE---------------------EDDG------DGGWVDTFHNA--GIV------GATEAVKEI-------------TLDSKDNIKIPERSAS-------------------------------CEDDDDEDEGEAADMEEYEE----------SGLLETDDATLDT------RQIVEVKAKVDVG----------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 119600077 Homo_sapiens MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPTGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNT--GIT------GITEAVKEI-------------TLENKDNIRLQDCSAL-------------------------------CEEEEDEDEGEAADMEEYEE----------SGLLETDEATLDT------RKIVEACKAKTDAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYPSLYVRLV-AKWLLTIFFVS-SYFLEICT 114588464 Pan_troglodytes MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPTGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNT--GIT------GITEAVKEI-------------TLENKDNIRLQDCSAL-------------------------------CEEEEDEDEGEAADMEEYEE----------SGLLETDEATLDT------RKIVEACKAKTDAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYPSLYVRLV-AKWLLRFFFFFEKFSVTLMI 45708793 Homo_sapiens MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPTGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNT--GIT------GITEAVKEI-------------TLENKDNIRLQDCSAL-------------------------------CEEEEDEDEGEAADMEEYEE----------SGLLETDEATLDT------RKIVEACKAKTDAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYPSLYVRLV-AKWLLTIFFFE-KFSVTLMI 126325703 Monodelphis_domestica MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------DLKVKAYLPNGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNS--GIT------GVTEVVKEI-------------TLESKDSVKLRDCSAL-------------------------------CEEEEEEDEGEAADMEEYEE----------SGLLETDEATLDT------RKIVEANKVKNDAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 74002684 Canis_lupus_familiaris --MCINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPTGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNA--GVT------GITEAVKEI-------------TLESKDSIKLQDCSAV-------------------------------CEEEEEEDEGEAADMEEYEE----------SGLLETDEATLDT------RKIVEACKAKTDAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 13385890 Mus_musculus MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPTDKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNT--GIT------GITEAVKEI-------------TLESKDSIKLQDCSAL-------------------------------CDEEDEEDEGEAADMEEYEE----------SGLLETDEATLDT------RKIVEACKAKADAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 61211340 Rattus_norvegicus MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPTGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNT--GIT------GITEAVKEI-------------TLESKDSIKLQDCSVL-------------------------------CDEEEEEEEGEAADMEEYEE----------SGLLETDEATLDT------RRIVEACKAKADAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 109033028 Macaca_mulatta MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPAGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNT--GIT------GITEAVKEI-------------TLENKDNIRLQDCSAL-------------------------------CEEEEEEDEGEAADMEEYEE----------SGLLETDEATLDT------SKIVEACKAKTDAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 19526773 Homo_sapiens MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPTGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNT--GIT------GITEAVKEI-------------TLENKDNIRLQDCSAL-------------------------------CEEEEDEDEGEAADMEEYEE----------SGLLETDEATLDT------RKIVEACKAKTDAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- 115496284 Bos_taurus MQNVINTVKGKALEVAEYLTPV-LKES-KFKETGVITP-----EEFVAAGDHLVHHC--PTWQWATGE------------ELKVKAYLPSGKQFLVTKN-VPCYKRCKQM-EY------SDELEAIIE---------------------EDDG------DGGWVDTYHNT--GIA------GITEAVKEI-------------TLESKDSIKLQDCSAL-------------------------------CEEEEEEDEGEAADMEEYEE----------SGLLETDEATLDT------RKIVEACKAKTDAG---------GEDAILQTRTYDLYITYDKYYQTPRLWLFG--YDEQRQ--PLTVEHMYEDI---SQDHVKK---------TVTIENHPHLP--------------PPPMCSVHPCRHAEVMKKIIETVAEGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFTM-- Nvec1000006208 Nematostella_vectensis MQDVFNKVKGTALGVAEYLTPV-LKES-KFQETGVITP-----EEFVAAGDHLVHHC--PTWQWSMGE------------ESRVKPYLPKDKQYLYTRN-VPCYKRCKQM-EH------QEENEAIVE----------------------PDE------DGGWVDTHHKV-DPSG-EKVTAGVQEQFSEM--------------KLDSDKVPLLHCHAL------------------------DDFLIRENDDDDEEDDEEAEDMEAFEE----------SGMLENDEATLAV------PVAPVSQSESSDGGTDAT-----SGEGILQTRTYDMYITYDKYYQTPRLWLYG--YNENRK--PLSVEEMYEDM---SQDHAKK---------TVTIEAHPHLP---------------MTMASVHPCRHADVMKKIIQTVADGGGELS------------------------------------------------------------------------VYMYLLIFLKFV-QAVIPTIEYDY-TRQFTM-- Bflo1000029400 Branchiostoma_floridae MQSMINSVKGTALGVAEFLTPV-LKES-KFKETGVLTP-----EEFVAAGDHLVHHC--PTWQWAAGD------------ESKCKPYLPKDKQFLITRN-VPCYKRCKQM-EY------QEEYEKVID---------------------EEDG------EGGWVDTHHNADPSSI-----TSVTETVSEM-------------TLEPKKTEGAAVVADD--------------------------------DDDEEDDDEEAEDMEAYEQ---------SGMLEAEDNATLDV------KAAVKSESPGEGAS---------LESGILQTRTYDLNITYDKYYQTPRLWLFG--YDENRK--PLTVEQMYEDI---SQDHAKK---------TVTMEAHPNLP--------------GPPMASIHPCKHADVMKKIIQTVADGGGELG------------------------------------------------------------------------VHMYLLIFLKFV-QAVIPTIEYDY-TRHFIM-- Ngru1000009601 Naegleria_gruberi KGSTSFRAFSLFKNFAEYFIDV-PKES-HFYERGVLTP-----EEFEKAGDLLVSKC--PTWSWSAGE------------PSKRKDYLPADKQFLITRN-VPCLKRCSELIEM-------AKDEEPVE-------------------------------DGEWIATHINH-----------TKEKEKEEI----------------GDITGGVDDLVIH-------------------------------SAEDDEEDDGEAIDIDNYSD--------------SELEDTIKD------KGALEAN----------------NGDSILQTRTYDFSITYDKYYQTPRVWLFG--YDERGA--KLESEKILEDI---HADYGNK---------TVTIEQHPHLN---------------TQWASIHPCRHAEVMKKMVDRLVGGGSGEK--------------------------------------------------------------------QFVRVDLYLFLFLKFI-SSVIPTIEYDF-TTQVD--- Crei1000012355 Chlamydomonas_reinhardtii MSNLRHTLHTLFKQTVETVTPP-LTKS-QFEEKRVLTP-----DEFVAAGDYLVHAC--PTWSWEGGD------------PKKRRTYFPPNKQFLVTRN-VPCLKRATELEGY------NPNSEFDVG---------------------GGEG------EDAWVATHSNP-AAAS------GSAGKGEVP----------------SIDGAGAGGSGGA--------------------------------GAAGGNKDDDIPDITDLEL------------NEADDEAAAPS-----GRPYLRAEEP---------------ADNIMRTRTYDLYITYDQYYQVPRFWLVG--HDESRK--PLLPQQVMEDV---SEEHARK---------TITVDPHPHLA--------------GLSAASIHPCRHADVMKKLVDNLLEAGREFK------------------------------------------------------------------------VEQYLVLFLKFI-ASVVPTIQYDY-TMSVGGE- Psoj1000017156 Phytophthora_sojae -------MQSLFHGVREYLTPV-LTES-SFEDKGLLTP-----EEFVKAGDLLVYKC--PTWRWESGE------------PSLRRSYLPADKQFLVTRN-VPCRRRVTSLDQS------YQTEEAVEG-------------------------------EDEWVAASSYA-NNEG------GAGADAVTD---------------LSDEMGDISLSDKP------KKQQSQAGGILGAIVDEHFVDGGAAPSASGAAAEPELRDLSSYEE---------EDNLVEDDEAALGP----AAGSYLVASEPDD------------ADDAILRTRTYDLSITYDKYYQTPRVWLFG--YDERNA--PLSGDQMFEDI---MQDYANR---------TVTMEPHPHRS--------------SLVHASIHPCQHGAVMKRIIANLKARKPGDK-------------------------------------------------------------ETEEQLANEIRSDQYLFLFLKFI-QSVIPTIDYDY-TIE----- Pram1000014227 Phytophthora_ramorum ----AEAMQSLFHGVREYLTPV-LTES-SFEDKGLLTP-----EEFVKAGDLLVYKC--PTWRWESGE------------ASLRRAYLPDDKQFLVTRN-VPCRRRVTSLEQS------YQTEEAVEG-------------------------------EDEWVAASSYA-TEGVNANAVTDLSDEMGDM-------------SVASPKKSRDGILGAI-------------------------------VDEHFVDGSSDVRDLGSYEE---------EDNLVEDDEAALGP----SASSYLVASEPDD------------ADDAILRTRTYDLSITYDKYYQTPRVWLFG--YDERNA--PLSGDQMFEDI---MQDYANR---------TVTMEPHPHRS--------------ALVHASIHPCQHGAVMKRIIANLKARKPGET-------------------------------------------------------------QTEEQVANEIRSDQYLFLFLKFI-QSVIPTIDYDY-TIE----- 145348199 Ostreococcus_lucimarinus_CCE9901 MHALRHAVHGAYKTTMEAVTPI-RSSS-AFASEGVLTP-----EEFVAAGDALTRAC--PTWTWTTGQ------------GARARTYLPREKQYLTTRR-VPCARRARDVEAY------AGAEEALTG------------------------E------DEGWIATGAGG--RSD------GAT----------------------MEAIPDLGAMTLE-------------------------------------ADAEDDAE-DEYED--------------EDDDAVVLP---TATAIAKEAAGANGER----------DDENIVKTRTYDVSITYDKYYQTPRVWLNG--YDENRL--VLKPSKTLEDI---SADHAQK---------TVTIDPHPHTG---------------VPSASIHPCKHASVMKKLVNSARAQSGEAP-----------------------------------------------------------------------SVDSYMFVFLKFI-ASVVPTIEYDY-TL------ 66818030 Dictyostelium_discoideum_AX4 LTSFQQAVHKAYVKTVEKVTPT-LSTS-KFLEEGVLTPEEVVFFNFVQAGDLLTDKC--QTWTWESGD------------PSRNVSYLPKEKQFLLTRN-VPCYNRVRTLENE----SKASKADEIQI---------------------EDDG------EDSWVAPQPVG--NQD------DIEDEKVDISSLKIDDKKPNTTTTTTAKPTNNNNNNND---------------------------------DEDEDEDGDIPDLDDFQD---------DNIIEEEDPAVLSK---NNKTTTTTTANNNNNNNSENKVE---DNDNILRTRTYDISITYDKYYQTPRVWLFG--YDENRK--PLKPEEIFEDI---SEDHAHK---------TVTIDSHPHLG---------------ISFAYIHPCRHAAVMKKLVDRQSENGKEPR------------------------------------------------------------------------VDQYLFLFLKFI-SVVIPTIEYDF-TLEFDT-- 28828597 Dictyostelium_discoideum LTSFQQAVHKAYVKTVEKVTPT-LSTS-KFLEEGVLTP-----EEFGQAGDLLTDKC--QTWTWESGD------------PSRNVSYLPKEKQFLLTRN-VPCYNRVRTLENE----SKASKADEIQI---------------------EDDG------EDSWVAPQPVG--NQD------DIEDEKVDISSLKIDDKKPNTTTTTTAKPTNNNNNNND---------------------------------DEDEDEDGDIPDLDDFQD---------DNIIEEEDPAVLSKNNKTTTTTTANNNNNNNSENKVE------DNDNILRTRTYDISITYDKYYQTPRVWLFG--YDENRK--PLKPEEIFEDI---SEDHAHK---------TVTIDSHPHLG---------------ISFAYIHPCRHAAVMKKLVDRQSENGKEPR------------------------------------------------------------------------VDQYLFLFLKFI-SVVIPTIEYDF-TLEFDT-- 125524794 Oryza_sativa_indica_cultivar_group -MQVKQKVYELYKGTVERVTGP-RTVS-AFLDKGVLSV-----PEFILAGDNLVSKC--PTWSWEAGD------------PSKRKPYLPPDKQFLVTRN-VPCLRRAVSLEEE----YDAAGAEVVLG-------------------------------DDEDASKQEEE--------------EDIPSM------------------DTLDIGKTEGI------------------------KSIPSYFSAGKKAEEEEDIPDMDTYED-------------SGNDSVATAQ------PSYFVAEEP--------------EDDNILRTRTYDVSITYDKYYQTPRVWLTG--YDESRM--PLKPELVFEDI---SQDHARK---------TVTIEDHPHLS--------------AGKHASVHPCKHAAVMKKIIDVLMSRGVEPE------------------------------------------------------------------------VDKYLFIFLKFM-ASVIPTIEYDY-TMDFDLGS 115435110 Oryza_sativa_japonica_cultivar_group -MQVKQKVYELYKGTVERVTGP-RTVS-AFLDKGVLSV-----PEFILAGDNLVSKC--PTWSWEAGD------------PSKRKPYLPPDKQFLVTRN-VPCLRRAVSLEEE----YDAAGAEVVLG---------------------DDED------GEGWLATHGVQ-ASKQ------EEEEDIPSM------------------DTLDIGKTEGI------------------------KSIPSYFSAGKKAEEEEDIPDMDTYED-------------SGNDSVATAQ------PSYFVAEEP--------------EDDNILRTRTYDVSITYDKYYQTPRVWLTG--YDESRM--PLKPELVFEDI---SQDHARK---------TVTIEDHPHLS--------------AGKHASVHPCKHAAVMKKIIDVLMSQGVEPE------------------------------------------------------------------------VDKYLFIFLKFM-ASVIPTIEYDY-TMDFDLGS 125569399 Oryza_sativa_japonica_cultivar_group -MQVKQKVYELYKGTVERVTGP-RTVS-AFLDKGVLSV-----PEFILAGDNLVSKC--PTWSWEAGD------------PSKRKPYLPPDKQFLVTRN-VPCLRRAVSLEEE----YDAAGAEVVLG-------------------------------DDEDASKQEEE-EDIP-------------------------------SMDTLDIGKTEGI------------------------KSIPSYFSAGKKAEEEEDIPDMDTYED-------------SGNDSVATAQ------PSYFVAEEP--------------EDDNILRTRTYDVSITYDKYYQTPRVWLTG--YDESRM--PLKPELVFEDI---SQDHARK---------TVTIEDHPHLS--------------AGKHASVHPCKHAAVMKKIIDVLMSQGVEPE------------------------------------------------------------------------VDKYLFIFLKFM-ASVIPTIEYDY-TMDFDLGS 38260612 Sisymbrium_irio -MVLSQKIHGAFKGAVERMTGP-RTVS-AFKEKGVLSV-----SEFVLAGDNLVSKC--PTWSWESGD------------PSKRKPYLPSDKQFLITRN-VPCLRRAASVAED----YEAAGGEVLVD----------------------DED------NDGWLATHGRP-KDRG------NEDENLPSM------------------DALEINERDTI----------------------------QPKPKYAGGEEEDDIPDMAEFDE-------------IDNDPATLQS-------NLLVAHQQ--------------DDDNILRTRTYDVSITYDKYYQTPRVWLTG--YDESRM--LLQPELVMEDV---SQDHARK---------TVTIEDHPHLP---------------GKHASVHPCRHGAVMKKIIDVLMSRGVEPE------------------------------------------------------------------------VDKYLFLFLKFM-ASVIPTIEYDY-TMDFDLGS 9758466 Arabidopsis_thaliana -MVLSQKLHEAFKGTVERITGP-RTIS-AFKEKGVLSV-----SEFVLAGDNLVSKC--PTWSWESGD------------ASKRKPYLPSDKQFLITRN-VPCLRRAASVAED----YEAAGGEVLVD----------------------DED------NDGWLATHGKP-KDKG------KEEDNLPSM------------------DALDINEKNTI----------------------------QSIPTYFGGEEDDDIPDMEEFDE---------ADNVVENDPATLQS-------TYLVAHEP--------------DDDNILRTRTYDLSITYDKYYQTPRVWLTG--YDESRM--LLQPELVMEDV---SQDHARK---------TVTIEDHPHLP---------------GKHASVHPCRHGAVMKKIIDVLMSRGVEPE------------------------------------------------------------------------VDKYLFLFLKFM-ASVIPTIEYDY-TMDFDLDL 89290045 Tetrahymena_thermophila_SB210 EVILSEYQQDQLVDEMEISFKT-PSPE-DFDKHGFLTP-----KQFLESGDQLTL----MGWKWE---------------KVDDNKKLNKRKKFNRIKKNV------------------FDDLEEKVN-------------------------------SEGFVETSQRQ-----------------------------------------------------------------------------------------SNAQDNQEEEE----------------------------------------------------------EMRIYNFSITYDTYYHVPRIWFSG--VDENQK--PLKKEQMFEDV---MPEYRDE---------TVTLEKHPHLG---------------YDQMTIHPCKHSQILKSFIDQAKENGRTIK------------------------------------------------------------------------PNQALIIFLKFV-GSVLPTLEIET-TTDLEI-- 145504745 Paramecium_tetraurelia LLIIKVQMFQGINNFVNSFATP--KVQ-DFYSKGWLTP-----EQFVEAGDQLTM----TGWQWKKA-------------QVKKGVDPPHPEKMYLIAN-ATSQTRIKEFLSF--------DFQNNQG-------------------------------QDGFL-------------------------------------------------------------------------------------------------CVDMSKKQQ----------------------------------------------------QALNEQETRVYTISITYDRKYHCPRLWLQG--VALNSGL-PLKHQEIYEDI---MSVYQNE---------TVTVEEHPYLH---------------YQQVTIHPCNHSTTMKAFLDKAKQNGAEIK------------------------------------------------------------------------PMQALFIFLKFM-QSVMPTVVYDT-TIDICLGV 124396640 Paramecium_tetraurelia -------MFQGINNFVNSFVTP--KPQ-DFYSKGWMTP-----EQFIEAGDQLTM----TGWQWKKA-------------QVKKGVDPPHPEKMYLIAN-ATSQTRIKEFLSF--------DFQNNQG-------------------------------QDGFL-------------------------------------------------------------------------------------------------CVDMSQKQQ----------------------------------------------------QALNEQETRVYTITITYDRKYHCPRLWLQG--VALNSGL-PLKHQEIYEDI---MSVYQNE---------TVTVEEHPYLH---------------YQQVTIHPCNHSTTMKAFLDKAKQNGADIK------------------------------------------------------------------------PMQALFIFLKFM-QSVMPTVVYDT-TIDICLGV 124419471 Paramecium_tetraurelia FSNLGNYANNLVQAVGAALIAP-PTKS-VFLTKGMLTP-----EEFINAGDRLISNG--GNWKWCKAIS-----------DQYKNKYLPNDKQFLIQEN-IISYKRIKDL-NR----------------------------------------------GGTFTEQQEGE--DVT------IIR------------------------------------------------------------------------SEEQPIQEITQSQD------------------------------------------------------------RYYTLYITYDLYYFTPRLYLSG--KVDDR---QLTYQEVKEDV---SGEYADK---------TVTEENFLELN---------------IKLPTIHPCKHADTLKFFVDQMRDNGCPEE---------------------------------------------------------------------KIHPDNSLTIFLKFM-NSVIPTIQFDF-VNTIEL-- 118353993 Tetrahymena_thermophila_SB210 YDNIKNKLNNVKNDIISVVYAP-PTES-RFFEEGKLTP-----QEFVTSGDALINMC--PQWKWMPASA-----------EKYKNKYLPAEKQYLLMEK-VPCDQRVQEL-------MDSIAVNEKED-------------------------------EEEYIINDQKK-----------NNDQNIIET------------------KIGQLSIKEHF-------------------------------------TEEKQGGEEKKEDN--------------DDDNVVVVE--------------------------------APIERRYYDLSICYDLVTYTPHLFLQG--VDEDNV--PLKQNQVFEDI---VSHYSNK---------TITFEVMPQTG---------------IVQASLHPCKHSQVIKHMVDNINQSGGSIK------------------------------------------------------------------------SHQCLFVFLKFL-QSVIPTIEYDV-AGDIIFDE consensus/100% ........................... ................................................................................................................................................................................................................................................................................................... ........................s...h....................... ................................................................................................................................................................................... consensus/95% ........................... .......h.........h...............W................................................................................................................................................................................................................................................................. ...........hph.l.as..apsP.hah.............l..p.h.p.h ...................ho............................lHPC.p...h..............................................................................................hh..h..................... consensus/90% ........................... .......hp.......phh...p..........W.h........................s..b..h..p..s.......................................................................................................................................................................................................................... .........b.hph.l.Ys..YpsP.hah.u...........L..p.hhpsl ........p..........ho....P.....................h.lHPCpps.hhp.h.p.....................................................................................hh..alphh...h.s............... consensus/85% ..........................p .bbpp..hp.......pFl..up.l......ssWpW.......................hs..bb.h..c..ssp........................................................................................................................................................................................................................ .........b.aph.l.Ys..YpsP.hah.u.....p.....L..p.hhpsl ....p...p.........slTbp.aP.....................h.lHPCpps.hhp.hhp.....................................................................................hl..aLphh..sh.s..b............ consensus/80% ........................p.s .hbpp..hps.....ppFl..up.ls.....ssWpW.....................sahs..+bbhh.+p.sss........................................................................................................................................................................................................................ .......p.b.aph.lsYsp.YpsP.lah.u..h..p....sL..pphhcsl ....s...+.........TlTbp.aP.h...................hslHPCcpuphMp.lhpp....................................................................................Ylh.aLphh..sh.s..b....s....... consensus/75% ........................p.s .Fbpps.hss.....p-Fl..uc.ls.....ssWpW..s..................sals..Kbhlh.+p.sss................................................................................................................................................................p....................................................... ...s...p.b.Y-h.IsYsp.YpsP.lah.G..a.pp....PLp.cphhEDl ....sa.p+.........TlTbcpaP.hs..................hslHPC+puphM+.lhpp....................................................................................YlhhaLphl..slhs.bb....s....... consensus/70% ..................h.s...p.o pFbppG.los.....pEFl.uGD.Ls.....ssWpW.su...............p.psaLP..Kbalhp+s.sss..b.........................................................................................................p.............................................p..p..sbp..................................................... ...ss..p.RpY-haIsYsp.YpsP+lal.G..asps....PLs.cphhEDl ....sa.p+.........TlTh-pHP.hs.................hhslHPC+puplM+.llcph..................................................................................bYLhhaLphl.ssVlsshb.c..sb...... 6. Ufc1 FINAL --------------EEEE-------HHHHHHHHHHHHHHHHHHHHH-------EEEEE--------EEEEEEEEEE-----EEEEEE------------EE---------EE-----EEHHHHHHHHHH----HHHHHHHHHH-------------------------HHHHHHHHHHH------ 65305321 Theileria_annulata MSSGLSHLSVEGIPLCKTNTGPFDNPEDWETRLNEEFAALIQYVEENKQNNTEWFTLD-CNDNGT-CWFGECWYVHNMKTYKFQLELEIPAAYPNAPFDIIIRSLEGKTAKMYRGGRICLDAHFLPLWQKNAPKYGIAHGLALG---------------------LAPWLACEIPHLVNIGVLDT--- 71032621 Theileria_parva_strain_Muguga MSSGLSHLSVEGIPLCKTNTGPFDNPEDWETRLNEEFAALIQYVEENKQNNTEWFTLD-CNDNGT-CWFGECWYVHNMKTYKFQLELEIPAGYPNAPFDIIIRSLEGKTAKMYRGGRICLDAHFLPLWQKNAPKYGIAHGLALGVSFKHFS-----------SVQLAPWLACEIPHLVNIGVLDN--- 124411671 Paramecium_tetraurelia ------MQAQPTQPSITTSAGPRD--PQWVDRLKEEYTALINYIKNNKSEDNDWVKLEPANKECT-NWKGKCWVVHNLIRYEFDFQFEIPPTYPLAPIEIEIPSLDGLTPKMYRGGKICIDIHFAPLWQKNAPKFGIVHALQLA---------------------LAPWLAAEIPVLIEEGKIKKD-- 124416518 Paramecium_tetraurelia ------MQTQSQQTNITISAGPRD--AQWIDRLKEEYAALINYIKNNKSEDNDWVKLEPANKECT-NWKGKCWVVHNLIRYEFDFQFEIPPTYPLAPIEIEIPSLDGLTPKMYRGGKICIDIHFAPLWQKNAPKFGIVHALQLA---------------------LAPWLAAEIPVLIEEGKIKKD-- 89284506 Tetrahymena_thermophila_SB210 --MEGIKSIVEKIPLCKVNAGPRD--EKWLDRVKEEYLILIKYIELNKQQDNDWIKIA-SDKEGK-VWKGKCWYIHNLVRYDFDLEFEIPATYPASPIELCLPELDGKTHKMYRGGKICLDIHFAPLWAKNSPKFGIAHALALGVISKYSNIFINLKKLFNINIQLGPWLAAEIPVLIDQGVIKKN-- V#31.m00087 Toxoplasma_gondii ----MSAHGVEEIPVCSVSAGPRS--PEWKERLKEEYISLIAYISQNKRSDKEWFKIE-SNPEGT-AWKGRCWYIHEMVKYEFQLLFDIPPTYPLTPIELRLPELDGKTSKMYRGGRICLDVHFAPLWQKNAPKYGIAHALALG---------------------LGPWLAAEVPYLVEEGTIVGV-- 68125492 Leishmania_major -MEPSVKESVSRIPLLKTKAGPRDG-DKWTARLKEEYASLITYVEHNKASDSHWFHLE-SNPQGT-RWYGTCWTYYKNEKYEFEMNFDIPVTYPQAPPEIALPELEGKTVKMYRGGKICMTTHFFPLWARNVPYFGISHVLALG---------------------LGPWLSIEVPAIVEEGYLKPASA 71744660 Trypanosoma_brucei_TREU927 -MDPAVRESVSRIPLLKTKAGPRDG-EQWTQRLKEEYTSLIQFVENNKASDNHWFKLE-SNEAGT-RWYGTCWTYYKNERYEFNMNFDLAVTYPQAPPEIALPELEGKTVKMYRGGKICMTTHFFPLWARNVPYFGISHALALG---------------------LGPWLSIEVPAMVEDGVLKPKKV 71650234 Trypanosoma_cruzi_strain_CL_Brener -MEPAVKESVSRIPLLKTKAGPRDN-EKWTQRLKEEYMSLIKYVENNKASDNHWFQLE-SNEAGT-RWYGTCWAYYKNEKYEFKVNFDLAVTYPQAPPEIALPELEGKTVKMYRGGKICMTTHFFPLWARNVPYFGISHALALG---------------------LGPWLSVEVPAMVEEGILKPTNK Ngru1000001149 Naegleria_gruberi -MDNSTKETLQKIPLLKEKVGPRDA-DQWEKRLKEEYAALIQYVKMTKENDSAWFSIQ-SNKSGT-KWFGKCWFVHNLLKYEFKVEFEIPVTYPTTAPEFKIPELIGKTAKMYHDGRVCQTLHFHPLWARNVPHFGIAHALALG---------------------LGPWLASEVPDLVSKNLIQAKDT Crei1000013700 Chlamydomonas_reinhardtii AWDNKTKDTVKKIPFLTEKAGPRDK-EKWQARLKQELHALITYIKMNKESDTDWFTIQ-PNADGT-HWSGKCWYVHELIKYEFDFQFDIPATYPDTAPEIEIPELDGKTVKMYRGGKICLTIHFKPLWSKNSPHFGVAHALCLG---------------------LAPWLAAEVPFLVESGAIKAKV- Pram1000015146 Phytophthora_ramorum --DEQTKRTVQQIPLLKTRAGPRDG-EQWTARLKEEYVALIQYVKANKEADNDWFTIA-SNKAGT-RWTGKCWSFYNGLRYEFELEFEIPASYPVANPELCIPELEGKTSKMYRGGKICLTIHFAPLWQKNVPRFGVAHALALG---------------------LAPWLAAEVPDLVERGVI----- 90970505 Dictyostelium_discoideum_AX4 -MDNHTKQTVQQIPLLTVKAGPRDG-DKWIDRLKEEYQALIKYVEINKKADNDWFNIE-SNPLGT-RWQGKCWYIYNFKKYEFDLEFDMPVTYPETAPEIAIPELDGKTEKMYRGGKICLTIHFKPLWSRNVPHFGIAHALALG---------------------LAPWLAAEVPHLVDNGIIKHKED 18396376 Arabidopsis_thaliana GWDPNTKSTLTRIPLLTTKAGPRDG-AAWTQRLKEEYKSLIAYTQMNKSNDNDWFRISASNPEGT-RWTGKCWYVHNLLKYEFDLQFDIPITYPATAPELELPEIDGKTQKMYRGGKICLTVHFKPLWAKNCPRFGIAHALCLG---------------------LAPWLAAEIPILVDSGAIKHKDD 17552592 Caenorhabditis_elegans -MDDATKSSLKAIPLCKTKASPRDG-DLWIERLKEEYEAIIAAVQNNKDCDRDWFQLE-SNERGT-KWFGKCWYFHNMVKYEFDVEFDIPITYPVTAPEIALPELDGKTAKMYRGGKICLSEHFKPLWARNTPKFGIAHAFALG---------------------LGPWMAVEIPDLIEKGLIQPKA- Mbre1000002981 Monosiga_brevicollis MVDAHTRKTLAAIPLLRVNAGPRSG-DLWRARHKEELMSLITYVKNNKENDNDWFRLE-SNPEGT-RWFGKCWIVVDMLKYEFDLEFDIPVAYPSTAPEIAIPELDGKTAKMYRGGKICLTDHFKPLWARNVPHFGIAHAMALG---------------------LGPWLAVEIPDLVSKGIVKHKET Nvec1000012015 Nematostella_vectensis MVDEATKKTLAAIPLLKTKAGPRDG-KDWVDRLKEEYTSLIKYVSNNKEADNDWFRLE-SNKEGTRKPLTIIVNYYELILCVF-FSPKIPITYPTTAPEIALPELDGKTAKMYRYVYW---------------------------------------------------------------------- ci0100146879 Ciona_intestinalis MVDAATKKTLSNIPLLTIKCGPRDK-EEWVKRLKEEYLSLITYVKNNKENDNDWFRLE-SNKDGT-KWFGKCWFIKDLKKYEFEIEFDIPITYPTTAPEIALPQLDGKTAKMYRGGKICLTDHFKPLWARNVPKFGIAHAMALG---------------------LGPWLAVEIPDLIDKGLVKHSDE 51011123 Danio_rerio MADEATRKAVSEIPLLKTNSGPRDK-ELWVQRLREEYLALIKYVENNKAADNDWFRLE-SNKEGT-RWFGKCWYIHELLKYEFDMEFDIPVTYPATAPEVAIPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGLAHLMALG---------------------LGPWLAVEIPDLIAKGLIQHKDQ 114560788 Pan_troglodytes MADEATRRVVSEIPVLKTNAGPRDR-ELWVQRLKEEYQSLIRYVENNKNADNDWFRLE-SNKEGT-RWFGKCWYIHDLLKYEFDIEFDIPITYPTTAPEIAVPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGLAHLMALG---------------------VSFVYLA---------------- 55588606 Pan_troglodytes MADEATRRVVSEIPVLKTNAGPRDR-ELWVQRLKEEYQSLIRYVENNKNADNDWFRLE-SNKEGT-RWFGKCWYIHDLLKYEFDIEFDIPITYPTTAPEIAVPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGLAHLMALG---------------------LGPWLAVEIPDLIQKGVIQHKQK 118138483 Homo_sapiens MADEATRRVVSEIPVLKTNAGPRDR-ELWVQRLKEEYQSLIRYVENNKNADNDWFRLE-SNKEGT-RWFGKCWYIHDLLKYEFDIEFDIPITYPTTAPEIAVPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGLAHLMALG---------------------LGPWLAVEIPDLIQKGVIQHKEK 114560786 Pan_troglodytes MADEATRRVVSEIPVLKTNAGPRDR-ELWVQRLKEEYQSLIRYVENNKNADNDWFRLE-SNKEGT-RWFGKCWYIHDLLKYEFDIEFDIPITYPTTAPEIAVPELDGKTAKMYSWVHGWQWKSLI--------------------------------------------------------------- 51172600 Rattus_norvegicus MADEATRRVVSEIPVLKTNAGPRDR-ELWVQRLKEEYQSLIRYVENNKNADNDWFRLE-SNKEGT-RWFGKCWYIHDFLKYEFDIEFEIPITYPTTAPEIAVPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGLAHLMALG---------------------LGPWLAVEVPDLIQKGVIQHKEK 13384768 Mus_musculus MADEATRRVVSEIPVLKTNAGPRDR-ELWVQRLKEEYQSLIRYVENNKNSDNDWFRLE-SNKEGT-RWFGKCWYIHDFLKYEFDIEFEIPITYPTTAPEIAVPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGLAHLMALG---------------------LGPWLAVEVPDLIQKGVIQHKEK Bflo1000025213 Branchiostoma_floridae MVDAATKKTLSNIPLLKTKAGPRDR-DLWVQRLKEEYQALIQYVENNKKEDNDWFRLE-SNKEGT-RWFGKCWHYQDLMKYEFEIEFDIPITFPTTAPEIAIPELEGKTAKMYRGGKICLTDHFKPLWGRNVPKFGIAHAMALG---------------------LGPWLAVEIPDLISKGLVQHKDD 115937394 Strongylocentrotus_purpuratus MVDAVTKKTLSNIPLLKTKAGPRDK-DLWPARLKEEYQSLIKYVGNNKEADNDWFRLE-SNKDGT-RWWGKCWHIQNLLKYEFELEFDIPVTYPATSPEIAIPELDGKTAKMYRGGKICLTDHFKPLWGKNVPKFGIAHAMALG---------------------LGPWLAVEIPDLIEKGIVVHKEG 72013071 Strongylocentrotus_purpuratus --------------------------------MNKQTKALLNYVGNNKEADNDWFRLE-SNKDGT-RWWGKCWHIQNLLKYEFELEFDIPVTYPATSPEIAIPELDGKTAKMYRGGKICLTDHFKPLWGKNVPKFGIAHAMALG---------------------LGPWLAVEIPDLIEKGIVVHKEG Nvec1000018661 Nematostella_vectensis MVDEATKKTLAAIPLLKTKAGPRDG-KDWVDRLKEEYTSLIKYVSNNKEADNDWFRLE-SNKEGT-RWFGKCWYIHNLLKYEFDVEFDIPITYPTTAPEIALPELDGKTAKMYRGGKICMTDHFKPLWGRNVPRFGIAHAMALG---------------------LGPWLAVEIPDLIEKGLIKHKDK 66561255 Apis_mellifera MVDESTKKTLSNIPLLQTKAGPRDK-ELWVQRLKEEYQALIQYVKNNKESDNDWFRLE-SNKEGT-RWFGKCWYIHNLLKYEFEIEFDIPVTYPITSPEIALPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGIAHAMALG---------------------LGPWLAVEIPDLIEKGAITHKDK 91084695 Tribolium_castaneum MVDESTRKTLSSIPLLKTKAGPRDK-ELWVQRLKEEYQSLIKYVQNNKDADNDWFRLE-SNKEGT-RWFGKCWFIHDLLKYEFDVEFDIPVMYPSTAPEIALPELDGKTAKMYRGGKICLSDHFKPLWARNVPKFGIAHAMALG---------------------LGPWLAVEIPDLIAKGVVTHKEK 58389907 Anopheles_gambiae_str_PEST MVDDGTRKALSGIPLLKTKAGPRDK-ELWVQRLKEEYQALIKYVQNNKASDMDWFRLE-SNKEGT-KWFGKCWYMYNLHKYEFDVEFDIPITYPTTSPEIALPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGIAHAMALG---------------------LAPWLAVEVPDLIEKGVISYQEK 19922352 Drosophila_melanogaster MVDDSTRKTLSNIPLLQIRAGPREK-DVWVQRLKEEYQALIKYVENNKQSGSDWFRLE-SNKEGT-KWFGKCWYMHNLLKYEFDVEFDIPVTYPTTAPEIALPELDGKTAKMYRGGKICLTDHFKPLWARNVPKFGIAHAMALG---------------------LAPWLAVEIPDLIEKGIITYKEK consensus/100% .................................pbb...ll.h...sK..s..Whpl..ss..sp....s.hh.h.p...h.F.h..chs..aP.ss.-h.l.pl.GbT.KMY.............................................................................. consensus/95% .............s.hp.psuPbp....W..RhpcEh.uLI.al..NK..spcWhpl..sN..sT..W.GpCW.h.p..pYcFphph-ls.sYP.ss.-l.l.pL-GbT.KMY+............................................................................. consensus/90% .........l..IPhhp.psGPR-....W..RL+EEa.uLIpYlp.NK..DscWFplp.uN..GT.pWbGpCW.h.p..+Y-FphpF-IPhsYP.ss.El.lPpL-GKTsKMYRGG+IChs.HF.PLW.+NsP.aGlsH.h.Lu.....................LuPWhu.ElP.hl..s.l........ consensus/85% ........sl..IPlhp.psGPRD....W..RLKEEa.uLIpYlp.NK.sDs-WFplp.uN.pGT.pWbGcCW.h.phb+YEFphpF-IPhoYP.ss.El.lPpL-GKTsKMYRGG+IChs.HF.PLW.+NsP.aGluHhhsLG.....................LuPWLuhElP.hl..G.l........ consensus/80% ......+pslp.IPllpspAGPRD..c.W.pRLKEEY.uLIpYVppNKpsDs-WFpl-.SNcpGT.+WbG+CW.haphb+YEFchpF-IPhTYP.ssPEIhlPEL-GKTsKMYRGGKIChs.HF.PLW.+NsP+FGluHhhALG.....................LuPWLAhElP.LlppGhl........ consensus/75% ..-..s+pslp.IPlLpspAGPRD..-bW.pRLKEEY.uLIpYVppNKpsDsDWF+LE.SNccGT.+WhGKCWahashhKYEF-lpF-IPlTYPsosPEIhlPELDGKTsKMYRGGKIClo.HFbPLWu+NsP+FGlAHhhALG.....................LuPWLAhElP.LlppGhl........ consensus/70% ..D..s+pslppIPlLpspAGPRD..-bW.pRLKEEY.uLIpYVcsNKpsDNDWF+LE.SNc-GT.+WaGKCWalashlKYEF-lEF-IPlTYPsTsPEIhlPELDGKTuKMYRGGKICLo.HFbPLWu+NsP+FGlAHuhALG.....................LuPWLAhElPsLlpcGhl........ 7. UbcI FINAL -------HHHHH--HHHHHHHH-H-------------EEE-----------EEEEEEE-EE----------HHH----HHHHHHH-------EEEEEEE---------------EEEEEEEE----E----EEE---EEEEHHHH--------------------------------HHHHHHHHHHHH------------------------------------------HHHHHHHH--------- Ngru1000013764 Naegleria_gruberi AGYVEYNLSAKS--KLLTELKF-INSLNGRN-----GWKVEL-----VDENIFVWTVH-LYNFP--TSSTIASG----LRSFQEKFNIGNG-DIVLEVRFAN-----YPEEPPHIRVVSPRI----QYLTGLVQFDGSLCVDFLTR-----------------------NNWAENRCLSMESIFSKIKSSLIE----------------NDACIDL---------RTFKPYDVKAALNSYVR----HT Crei1000012978 Chlamydomonas_reinhardtii -------------------MRS-LQRCPLATGPKPQISHIEPV----NDDNMLLWRLRVLPAFD--EDVAAGRQLNADLRRLGQMRGSGGQDYILMEVSFPQD----YPTNPFFLRVVSPRC----VMYTGHVTAGGSICIEALVA-------------------TGGPGGWQP--DYCVEAVLVLVLANMLTAEVAQVRTATGPGGISGPLRVDL--------SAGLAPYSDFEARAAYDRTVANHG 116058204 Ostreococcus_tauri MAESR---------VLYREFKA-LKRREGELFDNLTMCSEDL--------GVWRFELR-SHHFD--PGAIGTRELKNDLKKLNNRHNV---DHILVECSFRLNGSAPFPTCPPLIRVVRPRM----KWYTGHVTSGGSFCTEMLVN-------------------TKGLNGWRS--NYRIEAVIQALVISAVHCPAIVISTPYNGRQVSGPLRVDL---SCEYTDNVLREYSEYEAVSAFTRAEAHHR ci0100140471 Ciona_intestinalis LETRPKTVNGR---RLMKEFRF-LRKASEDSE---GAFEVFPYSEDGEGNDLSEWDIR-YYKFD--ESSHLWRS----MQDHGI-------DCVRFRISFPAD----FPFSPPFVRLVSPYI------ENGFVMNGGAICLEVLTA-----------------------QGWSS--AYTVEALLVQVAAALAH----------------GGAVVSK------HQKRKQHKLTKKRAEAEFQRIVKIHS Mbre1000002317 Monosiga_brevicollis MQK-----------RLMQELKL-LDKCTSIKD---GVFEVSL-----VNDNLFEWDVM-LHKFD--PDTLIAHD----LEMMRRSHGV---GSIWLRISFPQN----FPFVPPFVRVLAPVV------HGGFVLSGGAVCMELLTP-----------------------DGWSQ--AYRMESVILQTMSTIGK----------------GQARIVR---------QVRRPLTDEEAKRSYDHLVRVHK 115966376 Strongylocentrotus_purpuratus EGAEEGAEKGAE--AVKKALEDGGFMTNEKLT--GPPYAVEL-----VNDSLADWNIK-LFHVN--PKSSLSKD----MKKHNY-------EYILFNMTFPDN----FPFVPPFVRVVSPHV------EYGYVLDGGAICMELLTP-----------------------QGWSS--SYTIDAVIMQLGATLVA----------------GDGRIVQ------SSFRSDVPFNKQEAEVSFRAIVETHE 91082929 Tribolium_castaneum DGPSHKSVRAR---RLMKEYRD-LQRLQNSKT--DPVFTVEL-----VDDNLFEWHVK-VYKLD--AESELGND----MKELGI-------NYILLHVVFPEN----FPFAPPFMRVISPRI------EKGFVMEGGAICMELLTP-----------------------RGWAS--AYTVEAVIMQFAASVVK----------------GQGRIQR-------KTKGQKVFSKRTAEESFRSLVKTHD 48104639 Apis_mellifera MAMPDKRVRLR---RLMKELSE-IQRMQHRLE---STFTAEL-----VNDNLFEWHVR-LHKID--PESELAAD----MRELNI-------PYILLHVVFPEN----FPFAPPFMRVISPRI------EKGFVMEGGAICMELLTP-----------------------RGWAS--AYTIEAVITQFAASIVK----------------GQGRVAR-------KPKTNKEFNRRSAEESFRSLVKTHE 19920874 Drosophila_melanogaster VAAPDHTIRTR---RLMKEYRE-MERLQAKND---AVFTVEL-----VNDSLFEWHVR-LHVID--PDSPLARD----MAEMGV-------PAILLHLSFPDN----FPFAPPFMRVVEPHI------EKGYVMEGGAICMELLTP-----------------------RGWAS--AYTVEAVIMQFAASVVK----------------GQGRIAR-------KPKSTKEFTRRQAEESFRSLVKTHE 58386327 Anopheles_gambiae_str_PEST SKPLDTTIRSR---RLMKELKE-IERLQHSRT--DPCFTVEL-----INDNLYEWHAR-LFRID--PDSPLAED----LVELNI-------PFILLHLVFPEN----FPFAPPFMRVVEPRI------EKGFVMEGGAICMELLTP-----------------------RGWAS--AYTVEAILMQFAASLVK----------------GQGRVSR-------KPKSAKDFSRRSAEEAFRSLVKTHE Bflo1000000133 Branchiostoma_floridae ----------------MRELQE-VRKHKDG------VFEVDL-----VEDNLYEWNVK-LYKVD--PDSEFYQD----MQEVGT-------EFVLLNLTFPEN----FPFSPPFMRVLTPKI------ENGYVMDGGAICMELLTP-----------------------KGWSS--AYTVEAIILQFSASLVK----------------GRGRINR-------KTKKKDTYTRSHAEASFRNVVKTHE Nvec1000021943 Nematostella_vectensis ----------------MKEFQD-VSRKTER------IFSAEL-----VDDNLFEWNVK-LHTID--GDSLLYRD----MVETGS-------KFILLNITFPEN----FPFAPPFMRVLAPRI------EGGFVLDGGAICMELLTP-----------------------KGWSS--AYTVEAVVLQFSAAVVK----------------GKGRIDR---------TCKKAFSKKEAESAYKRLVKTHD 114598936 Pan_troglodytes RQQHCTQVLNR---RLMKELQD-IARLIDRFI------SVEL-----VDESLFDWNVK-LHQVD--KDSVLWQD----MKETNT-------EFILLNLTFPDN----FPFSPPFMRVLSPRL------ENGYVLDGGAICMELLTP-----------------------RGWSS--AYTVEAVMRQFAASLVK----------------GQGRICR------KAGKSKKSFSRKEAEATFKSLVKTHE 94395457 Mus_musculus RQQHCTQVRSR---RLMKELQD-IARLSDRFI------SVEL-----VNENLFDWNVK-LHQVD--KDSVLWQD----MKETNT-------EFILLNLTFPDN----FPFSPPFMRVLSPRL------ENGYVLDGGAICMELLTP-----------------------RGWSS--AYTVEAVMRQFAASLVK----------------GQGRICR------KAGKSKKSFSRKEAEATFKSLVKTHE 109504540 Rattus_norvegicus RQQHCTQVRSR---RLMKELQD-IARLSDRFI------SVEL-----VNENLFDWNVK-LHQVD--KDSVLWQD----MKETNT-------EFILLNLTFPDN----FPFSPPFMRVLSPRL------ENGYVLDGGAICMELLTP-----------------------RGWSS--AYTVEAVMRQFAASLVK----------------GQGRICR------KAGKSKKSFSRKEAEATFKSLVKTHE 94396224 Mus_musculus RQQHCTQVRSR---RLMKELQD-IARLSDRFI------SVEL-----VNENLFDWNVK-LHQVD--KDSVLWQD----MKETNT-------EFILLNLTFPDN----FPFSPPFMRVLSPRL------ENGYVLDGGAICMELLTP-----------------------RGWSS--AYTVEAVMRQFAASLVK----------------GQGRICR------KAGKSKKSFSRKEAEATFKSLVKTHE 88982452 Homo_sapiens RQQHCTQVRSR---RLMKELQD-IARLSDRFI------SVEL-----VDESLFDWNVK-LHQVD--KDSVLWQD----MKETNT-------EFILLNLTFPDN----FPFSPPFMRVLSPRL------ENGYVLDGGAICMELLTP-----------------------RGWSS--AYTVEAVMRQFAASLVK----------------GQGRICR------KAGKSKKSFSRKEAEATFKSLVKTHE 113416990 Homo_sapiens RQQHCTQVRSR---RLMKELQD-IARLSDRFI------SVEL-----VDESLFDWNVK-LHQVD--KDSVLWQD----MKETNT-------EFILLNLTFPDN----FPFSPPFMRVLSPRL------ENGYVLDGGAICMELLTP-----------------------RGWSS--AYTVEAVMRQFAASLVK----------------GQGRICR------KAGKSKKSFSRKEAEATFKSLVKTHE 68392303 Danio_rerio NRQHCTQVRTR---RLMKELQE-IRRLGDSFI------TVEL-----ADDNLFDWNVK-LHQVD--KDSALWQD----MKETNT-------EFILLNVTFPDN----FPFSPPFMRVLTPRL------ENGYVLDGGAICMELLTP-----------------------RGWSS--AYTVEAVMRQFAASLVK----------------GQGRICR------KAGKSKKAFSRKEAEATFKSLVKTHE 47229506 Tetraodon_nigroviridis QRPNCTQVRTR---RLMKELQE-IRRLGDNFI------TVEL-----VEDNLFEWNVK-LHQVD--KDSALWQD----MKETNT-------EFIVLNVTFPDN----FPFSPPFMRVLSPRL------ENGYVLDGGAICMELLTP-----------------------RGWSS--AYTVEAVMRQFAASLVK----------------GQGRICR------KPGKSKKAFNRKEAEATFKSLVKTHE Bflo1000009658 Branchiostoma_floridae NGEESSESEDD---GMDSDLNEDVDEED-HF-------EMDE-----------VTSEK-DKKKE--EDEDIETENLIVLERLKM---------NQRRDYLKEN----FPFDPPFVRIIAPII------NGGYVLGGGAICMELLTK-----------------------QGWSS--AYTIEAVIMQISATLVK----------------GKARINF--------NANKTPAVRNPHVCAFIFVIV--- 71984336 Caenorhabditis_elegans DGKVQGSITATD--RLMKEIRD-IHRSEHFKN---GIYTFELE----KEENLYQWWIK-LHKVD--EDSPLFED----MKKLKKDHNQ---DHLLFSFTFNEK----FPCDPPFVRVVAPHI------NQGFVLGGGAICMELLTK-----------------------QGWSS--AYSIESCILQIAATLVK----------------GRARISF-------DAKHTSTYSMARAQQSFKSLQQIHA 47216182 Tetraodon_nigroviridis RRHLLGGTRERQPVRMARQIED-VRNSSCEYF--ILFLPRSF-----RGKPAVTCTSL-TIRVD--PDSPLHTD----LQVLKEKEGM---DYILLNFSYKDN----FPFDPPFVRVVSPVL------CGGYVLGGGAICMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVQF--------GANKNQYSLARAQQAYKSLVQIHE Nvec1000009989 Nematostella_vectensis KGTVCGSVQATD--RLMKELRD-VYRSDSFKL---GNYSVYL-----NNDNLYDWSIK-IMRVD--PESVLHKD----MVQIEKQEGI---DHILLNMTFTDK----FPFDPPFVRVCYPVI------QAGYVLSGGAICMELLTP-----------------------QGWSS--AYTIEAVIVQISATLVK----------------GKARINF-------QDTKKVVYSLHRAQQSFKSLVQIHE ci0100153655 Ciona_intestinalis DGRISGSVQASD--RLLKELKA-IYRSESFKQ---GCYNVEL-----VNDSLYEWHVQ-ILKVD--PDSHLHAD----LKELKANGGQ---ASIIMGVSFRDN----FPFDPPFVRVVCPVL------TGGFVLGGGAICMELLTK-----------------------QGWSS--AYSIESLIMQIMATLVK----------------GKARIQF--------GASSSTYSLSRAQQSFQRLVQIHE 115923264 Strongylocentrotus_purpuratus KGSVSGSVGATD--RLMKELRD-IYRSDSFKK--KKMYSVDL-----VNDSLYDWNVK-IYTVD--SDSPLHAD----LCQLKEKEGK---DHILLNMTFKEH----YPYDPPFVRVVWPIL------TGGYVLGGGAICMELLTK-----------------------QGWSS--AYTIEAVILQIAATLVK----------------GKARIQF--------GAGKSQYSLVRAQQSFRSLVQIHE 48109570 Apis_mellifera KGSVCGSVQATD--RLMKELRD-IYRSDSFKK---GMYSIEL-----VNDSLYEWNVR-LMCVD--PDSPLHSD----LILLKEKEGK---DSILLNMLFKET----YPFEPPFVRVVHPMI------SGGYVLIGGAICMELLTK-----------------------QGWSS--AYTVEAVIMQISATLVK----------------GKARIQFQGPGSASKVCGQGQYSLARAQQSFKSLVQIHE 58396115 Anopheles_gambiae_str_PEST KGSVSGSVQATD--RLMKELRD-IYRSDSFKN---NMYSIEL-----VNDSIYEWNIR-LMSVD--PDSPLHND----LVLLKEREGK---DSILLNIIFKET----YPFEPPFVRVVHPII------SGGYVLVGGAICMELLTK-----------------------QGWSS--AYTVEAVIMQIAATLVK----------------GKARIQF----GPTKSLSQGQYSLARAQQSFKSLVQIHE 24639327 Drosophila_melanogaster KGSVSGSVQATD--RLMKELRD-IYRSDAFKK---NMYSIEL-----VNESIYEWNIR-LKSVD--PDSPLHSD----LQMLKEKEGK---DSILLNILFKET----YPFEPPFVRVVHPII------SGGYVLIGGAICMELLTK-----------------------QGWSS--AYTVEAVIMQIAATLVK----------------GKARIQF----GATKALTQGQYSLARAQQSFKSLVQIHE Bflo1000000060 Branchiostoma_floridae KGAVSGSVQATD--RLMKELKN-VFRSDSLKR---GIYSVEL-----VNDNLYDWNIK-LQGVD--PDSALYAD----LMVLKQKEGR---DFILLNMTFKEN----FPFDPPFVRIIAPII------NGGYVLGGGAICMELLTK-----------------------QGWSS--AYTIEAVIMQISATLVK----------------GKARINF-----NANKTPANQYSLARAQQSFKSLVQIHE 30425296 Mus_musculus SGATSGSVCASD--RLMKELRE-IYRSQSYKS---GTFSVEL-----INDSLYDWHVK-LRKVD--PDSCLYRD----LQRLKQKEGI---DYILLNFSFKDN----FPFDPPFVRVVLPVL------SDGYVLDGGALCMELLTN-----------------------QGWSS--AYSIESVILQINATLVK----------------GKARVRF---------GVDNHYTEQVARRVYKSMVLKHE 109465090 Rattus_norvegicus NGAVSGSVQATD--RLMKELRD-IYRSQSFKG---GNYAVEL-----VNDSLYDWNVK-LLKVD--QDSALHND----LQILKEKEGA---DFILLNFSFKDN----FPFDPPFVRVVSPVL------SGGYVLGGGAICMELLTK-----------------------QGWSS--AYSIESVIMQISATLVK----------------GKARVQF--------GANKSQYSLTRAQQSYKSLVQIHE 31541872 Mus_musculus NGAVSGSVQATD--RLMKELRD-IYRSQSFKG---GNYAVEL-----VNDSLYDWNVK-LLKVD--QDSALHND----LQILKEKEGA---DFILLNFSFKDN----FPFDPPFVRVVSPVL------SGGYVLGGGAICMELLTK-----------------------QGWSS--AYSIESVIMQISATLVK----------------GKARVQF--------GANKSQYSLTRAQQSYKSLVQIHE 31543906 Homo_sapiens NGAVSGSVQATD--RLMKELRD-IYRSQSFKG---GNYAVEL-----VNDSLYDWNVK-LLKVD--QDSALHND----LQILKEKEGA---DFILLNFSFKDN----FPFDPPFVRVVSPVL------SGGYVLGGGAICMELLTK-----------------------QGWSS--AYSIESVIMQISATLVK----------------GKARVQF--------GANKSQYSLTRAQQSYKSLVQIHE 114559985 Pan_troglodytes NGAVSGSVQATD--RLMKELRD-IYRSQSFKG---GNYAVEL-----VNDSLYDWNVK-LLKVD--QDSALHND----LQILKEKEGA---DFILLNFSFKDN----FPFDPPFVRVVSPVL------SGGYVLGGGAICMELLTK-----------------------QGWSS--AYSIESVIMQISATLVK----------------GKARVQF--------GANKSQYSLTRAQQSYKSLVQIHE 47220542 Tetraodon_nigroviridis NGAVSGSVQASD--RLMKELRE-IYRSQSYKT---GIYSVEL-----VSDSLYEWHVK-LRTVD--PDSPLHSD----LQVLKEKEGM---DYILLNFSYKDN----FPFDPPFVRVVSPVL------SGGYVLGGGALCMELLTKQVRSWFLLPAVVGRQVDCLSGAFQGWSS--AYSIESVIMQINATLVK----------------GKARVQF------GANKVKADSFHRRVPRQLLCLIVI-- 109510440 Rattus_norvegicus NGTVSGSVHASN--RLMKELRE-IYRSQSYKS---GTYSVEL-----INDSLYDWHVK-LRKVD--PDSPLYGD----LQLLKEKEGI---EYILLNFSFKDN----FPFDPPFVRVELPIL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVRF--------GADKNQYNLETAQQSYDSVVQMHE 51830637 Mus_musculus NGTVSGSVHASN--RLMKELRD-IYRSQSYKS---GTYSVEL-----INDSLYDWHVK-LRKVD--PDSPLYGD----LQLLNEKEGI---DYILLNFSFKDN----FPFDPPFVRVELPIL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVRF--------GADKNQYNLETAQQSYDSVVQMHE 94408620 Mus_musculus NGTVSGSVHASN--RLMKELRD-IYRSQSYKS---GTYSVEL-----INDSLYDWHVK-LRKVD--PDSPLYGD----LQLLNEKEGI---DYILLNFSFKDN----FPFDPPFVRVELPIL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVRF--------GADKNQYNLETAQQSYDSVVQMHE 114658251 Pan_troglodytes NGAVSGSVQASD--RLMKELRD-IYRSQSYKT---GIYSVEL-----INDSLYDWHVK-LQKVD--PDSPLHSD----LQILKEKEGI---EYILLNFSFKDN----FPFDPPFVRVVLPVL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVQF-------------------GANKVLLLRILHK 114658255 Pan_troglodytes NGAVSGSVQASD--RLMKELRD-IYRSQSYKT---GIYSVEL-----INDSLYDWHVK-LQKVD--PDSPLHSD----LQILKEKEGI---EYILLNFSFKDN----FPFDPPFVRVVLPVL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVQF--------GANKNQYNLARAQQSYNSIVQIHE 29789401 Homo_sapiens NGAVSGSVQASD--RLMKELRD-IYRSQSYKT---GIYSVEL-----INDSLYDWHVK-LQKVD--PDSPLHSD----LQILKEKEGI---EYILLNFSFKDN----FPFDPPFVRVVLPVL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVQF--------GANKNQYNLARAQQSYNSIVQIHE 114658247 Pan_troglodytes NGAVSGSVQASD--RLMKELRD-IYRSQSYKT---GIYSVEL-----INDSLYDWHVK-LQKVD--PDSPLHSD----LQILKEKEGI---EYILLNFSFKDN----FPFDPPFVRVVLPVL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVQF--------GANKNQYNLARAQQSYNSIVQIHE 114658249 Pan_troglodytes NGAVSGSVQASD--RLMKELRD-IYRSQSYKT---GIYSVEL-----INDSLYDWHVK-LQKVD--PDSPLHSD----LQILKEKEGI---EYILLNFSFKDN----FPFDPPFVRVVLPVL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVQF--------GANKNQYNLARAQQSYNSIVQIHE 30725841 Mus_musculus NGAVSGSVQASD--RLMKELRD-VYRSQSYKA---GIYSVEL-----INDSLYDWHVK-QHKVD--SDSPLHSD----LQILKEKEGI---EYILLNFSFKDN----FPFDPPFVRVVLPVL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVQF--------GANKNQYNLARAQQSYNSIVQIHE 109484735 Rattus_norvegicus NGAVSGSVQASD--RLMKELRD-VYRSQSYKA---GIYSVEL-----INDSLYDWHVK-LHKVD--SDSPLHSD----LQILKEKEGI---EYILLNFSFKDN----FPFDPPFVRVVLPVL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVQF--------GANKNQYNLARAQQSYNSIVQIHE 94387187 Mus_musculus NGAVSGSVQASD--RLMKELRD-VYRSQSYKA---GIYSVEL-----INDSLYDWHVK-LHKVD--SDSPLHSD----LQILKEKEGI---EYILLNFSFKDN----FPFDPPFVRVVLPVL------SGGYVLGGGALCMELLTK-----------------------QGWSS--AYSIESVIMQINATLVK----------------GKARVQF--------GANKNQYNLARAQQSYNSIVQIHE 121908958 Trichomonas_vaginalis_G3 EKYKPSNAGEL---RLIQDLKS-IKSMPAKDL----GFSAEP-----YQGHLSTWEIH-LFGFE--PKDSIYPD----IQKYKQLTGR---DYVQFMVHFPPD----YPIRPPFVRVVQPRF----KFHTGRVTIGGSICADILTM-----------------------NGWNP--SYDVSSCFSNIFAEIFS----------------QNPQIDF---------LNMTPYSKEEAYNAYIRVAREHG 121900284 Trichomonas_vaginalis_G3 NGSSNSSVDE----RIVRDFYT-ISKRHPKEL----GFTVLP-----YNNDIHVWEVR-LWSFD--KDTPIYKD----MQIFEAKTGR---NYIELRVSFPPN----YPIHPPFIRVVYPRC------TGERVLNGGAFCISALTL--------------------TKEDGWSP--IYDFEDLLSSILVEMRS------------QTVSDPLRINF---------ENDTPYSVQEAISTYRRLAGIHG 121908902 Trichomonas_vaginalis_G3 DYQVNMNSATDD--RIVRDYYT-ISRRKPSEL----GFSVKPY----MND-IRTWEVH-LFGFD--KKDPIYAD----IQKYKKQTGK---DYIELRVSFPPD----YPNRPPFLRVISPRC----VSHGGRVTLGGAICVSALTL--------------------TNQNGWSP--IYDFESLFMNIIAEMLN--------------CEPPLRIDF---------NNDTPYSTREAIDSFMRLTSDHG Ngru1000014952 Naegleria_gruberi TGASEGCVN-----CIMKEYMK-IKQANTEKF----GISANP-----VNDDLFHWEIR-FFNFDIKEDGKIAKD----LQEYKKKNGI---DYITLDLTFPLE----FPFKPPFIRVLKPRF----AFRTGHVTIGGSICMELLTS-----------------------SGWSS--VCSLESIFVQIRSEMVA----------------GEAQLDF---------SNTSPYSEHEAKEAFFRVAQRYG Bflo1000013501 Branchiostoma_floridae SYKTSGASSVAIH-RLIMDLKN-MKKTKGKF-----GVEGVP-----RGDNLFLWDVK-LTNID--PKCPLGKD----LQQYAKQHQEE--PVIKMEMKFPPD----YPMAPPFVRVLKPRF----KFLTGHVTIGGSICMEMLTR-----------------------SGWRP--TNDIESILVQIRAEILS---------------DNNARLDK---------NPNWEYTESEAKTAFHRMVNRYG Bflo1000003589 Branchiostoma_floridae SAPRNESVSFY---VVCPDYLKNMKKTKGKF-----GVEGVP-----RGDNLFLWDVK-LTNID--PKCPLGKD----LQQYAKQHQEE--PVIKMEMKFPPD----YPMAPPFVRVLKPRF----KFLTGHVTIGGSICMEMLTR-----------------------SGWRP--TNDIESILVQIRAEILS---------------DNNARLDK---------NPNWEYTESEAKTAFHRMVNRYG Ptri1000000857 Phaeodactylum_tricornutum RAEKETSGGKR---RLAQDLYR-IMNQDTNKA----GFSLKPS----KEDSMERWTIK-LFKFD--EDSNLAKD----MLVLGL-------EHIELEMSFPEQ----YPFEPPFVRVSRPRF----KRQTGFVMN-GALCMELLTN-----------------------EGWNP--INDIESVIVSVRSLLVV----------------GDGRLEA-----------ACNLADAKYRSLLDAAVESQS Tpseu1000003290 Thalassiosira_pseudonana --------------RLASDLYK-IMMADTDEA----GFSLEPC----DEDSMDKWCIK-LFGFD--CDSHLAKD----LMVVGM-------DHVELEMSFPDD----YPFEPPFVRVVRPRF----KRQTGFVMN-GALCMELLTK-----------------------DGWNP--INDIESVIVSIRSLMVV----------------GDGRLQA----AKPSAASIGSYSAAEASSAYDHLSKFHQ 70993182 Aspergillus_fumigatus_Af293 QYATAPATK-----VLQQHLQA-TLKVQARESLHDLGWYIDPD----FITTVYQWIVE-LHSFD--PKLPLAKD----LKQANM-------KSVVLELRFPLG----FPMSPPFVRVIRPRFLEFANGGGGHVTAGGALCMELLTN-----------------------SGWLP--TASIESVLLQVRMAITN-------------PEPRPARLAL--------NRSRSDYSVVEAVEAYKRACLAHG 42551042 Gibberella_zeae_PH_1 AASSPAALR-----ALNGQIKD-LQQIQSSTNIAMLGWYIDFE----KLDNLFHWIVE-LHSFD--MNLALAQD----MIRYGC-------SSIVLEVRFGAS----FPISPPFVRVIRPRFVPFAQGGGGHVTMGGSICSELLTN-----------------------SGWSP--ALSLEKVFLEVRMNLCE--------------KDPPARIEQSAGVRRAQNGSHMDYGMFEAVDAYRRAATAHG consensus/100% ...................h..........................................................h......................................................................................................................................................................... consensus/95% ..................phb..h.p.........................h..h.hp....h-...p..h..p....h..............l.h.h.a.......aP..PPahRl..P.h........G.V...GuhC.phLs.........................sW......phpshh......h......................l.........................h........ consensus/90% ...............l.p-hb..l.p.p...........hp........psh..Wpl+.h..hD...cs.l..D....h..............lbhph.F..p....aPh.PPFhRVh.P.h........G.V..GGulCh-hLT........................pGW.s...hslEulh.ph.s.hh.................s..pl.b..............hs...u...a..h...a. consensus/85% ...............lhp-hbp.l.c.ps.b.......phc........-slhpWpl+.l.phD...cs.l..D....hbb............IbhphpF..s....aPh.PPFhRVl.P.h......psGaV..GGulChEhLT........................pGWss..shslEulh.ph.upllp................s.uRlpb..........s...as..pA..sa.phs..H. consensus/80% p...p.p.p.....RLh+-lbp.l.+.ps.b......hsl-.......s-slapWpl+.L.phD...-osL.pD....hbbhp........p.IlLphsF.ps....aPhsPPFhRVl.P.h......psGaVb.GGAlCMELLT........................pGWus..sYolEulh.Qh.Asllp................GpuRlpb..........sp..as..pAb.sapplsp.H. consensus/75% p...ssplp.p...RLh+ELbc.lb+.psbb......hsl-h.....hs-sLacWpl+.L.plD...DSsL.pD....hbbhpp.......caIlLphoF.-s....FPFsPPFhRVl.P.l......psGaVhsGGAlCMELLT........................pGWuS..uYolEuVhbQh.AslVc................GpuRlpb..........spp.asb.cAbpsappllp.Hp consensus/70% p...ssslpsp...RLMKELb-.lbR.psbb......aslEL.....lsDsLa-WpV+.L.plD...DSsLapD....hbbhpp.......-aILLphoF.-s....FPFsPPFlRVl.P.l......psGaVhsGGAlCMELLT........................pGWSS..AYolEuVlhQlsAoLVK................GpuRlpb........s.spppasb.cAbpuapplVp.Hp 8. UBE2W FINAL ---HHHHHHHH---------EE-----------------------------EEEEEEEE-----------EEEEEEE-------------EEE------------EEE--------EEEEEEE------------HHHHHHHHHHHH------------HHHHH-------------EEE----- Ptri1000006839 Phaeodactylum_tricornutum NYRIQRELKAFL-SDPPSNLSV---KVG--------------------KNLRVWIVSIEGAKGT-VYEGEIFKLRISFPPQY--PTVPPSVYFL--------PPSIPVHEHVYTNGDICLSLLGK----DWRPTMTAQSIAVSILSILSSAQSKSLPMDNARHA-QN--KPGEYQKDWVYHDDNC Tpseu1000010929 Thalassiosira_pseudonana NYRIQRELKEFL-KSPPPGLSV--KISG--------------------KNVRLWVITLSMPENT-VYAGETYKLRVQFPNDY--PTSPPSVYFL--------PPT-PRHEHVYTNGDICLSLLGK----DWRPIMTAQSVAQAIFSILCGAQRKSLPMDNSRHA-GN--KPGQKQDDWVYHDDNC 23619379 Plasmodium_falciparum_3D7 NYRIQKELNNFL-KNPPINCTI--DVHP--------------------SNIRIWIVQYVGLENT-IYANEVYKIKIIFPDNY--PLKPPIVYFL--------QKP-PKHTHVYSNGDICLSVLGD----DYNPSLSISGLILSIISMLSSAKEKKLPIDNYTHA-DA--KPGSSQNNFLYHDDKC Ia#86.m00396 Toxoplasma_gondii NYRIQKELQAFL-SNPPPNCRV--YVHP--------------------SNIRVWLIEMTGMEGS-PYANEMYRLKVVIPPDY--PFSPPTCFFL--------QPA-PVHVHVYSNGDVCLNLLGS----DWRPSLSISAIAVAILSMLTSAKQKQLPTDNAAHM-DV--PAGHHGTQFLYHDDKV 50557414 Yarrowia_lipolytica_CLIB122 QRRIMKELERLK-TAPPNCEYHYGRSPTLLRLSYCTCTNPAHITLVEANDLETWQIDLEVNNDNPLYTGKTFRLQFIIGPNY--PVESPQVQFMPLP-----ARPIPIHPHIYSNGHICLDILGD----EWTPVQTVASVCISLQSMLGSNDRDERPPDDERYIKHA--PKNPKQTRFVYHDDTV 71002042 Aspergillus_fumigatus_Af293 GKRLSKELLKMK-EHLPPGISI---VKD--------------------DNLEEWQMDIKVLDDNPLYKDQTYRLKFTFGSKY--PIEPPEVQFIELPSTSDTPRPIPMHPHIYSNGIICLDLLGSA---GWSPVQTVESVCMSIQSMLTANNRNERPPGDAEFVSYN--KRRPRDIAFMYDDDNV 46124999 Gibberella_zeae_PH_1 DRRLVKELGKMHKDGMPPGITLI-ERND--------------------SVAGDWFVDIQVLDENPLYKDQIYRLKFHFPKMY--PIEPPEVTFNK-----QTDRPIPMHPHIYSNGIICLDLLGQQ---GWSPVQSVSSVCMSIQSMLTSNDKDERPPGDEDFVRGN--RQRPRDIEFLYHDNTV Crei1000002793 Chlamydomonas_reinhardtii QRRIQSELNEWM-RSPPEGCCL---ESC--------------------EPMTSWVVIMQGAGGVRLYSDEVFRVRIDFGEQY--PLDPPDVIFL--------APA-PIHPHIYSNGHICLDILYSGRNGGWSPALTMSKVVLSLRSMLASNTDKRKPPGDAEYCARVG-NRSPKLSNWVFEDDKV 46099348 Ustilago_maydis_521 TKRLQKELSEIKVKGAPEGCEV---IKA--------------------DDLQEWQFSIQVLGNS-LYQDQKFGLRFRFSDSY--PMESPEVVFMV-----TDGFQAPVHPHVYSNGHICASILGN----EWSPVLTISSVLLTLQSMLASCKQLQRPPDNDAYVKRA--PISPKDSRFVYDDDTV Ngru1000001737 Naegleria_gruberi AKRIQKEIQKLG-SGVGIDIFI--NQPI--------------------NNLETIYCTLHGARNT-IYSGEQYLLKFTFSNDY--PLQSPEVVFV--------PPHVPLHEHVYTNGHICLNILYD----GWTPVMNIQSVCMSIQSMLSSATQKKKPIDNDTYVMSC--SHSPKNTRWNFHDDKV 18419831 Arabidopsis_thaliana TNRLQKEFMEWQ-TNPPSGFKH---RVS--------------------DNLQRWIIEVHGVPGT-LYANETYQLQVEFPEHY--PMEAPQVIFQ--------HPA-PLHPHIYSNGHICLDVLYD----SWSPAMRLSSICLSILSMLSSSSVKQKPKDNDHYLKNCKHGRSPKETRWRFHDDKV 18410856 Arabidopsis_thaliana TNRLQKELVEWQ-MNPPTGFKH---KVT--------------------DNLQRWIIEVIGAPGT-LYANDTYQLQVDFPEHY--PMESPQVIFL--------HPA-PLHPHIYSNGHICLDILYD----SWSPAMTVSSICISILSMLSSSTEKQRPTDNDRYVKNCKNGRSPKETRWWFHDDKV 18401461 Arabidopsis_thaliana CNRLQKELSEWQ-LNPPTGFRH---KVT--------------------DNLQKWTIDVTGAPGT-LYANETYQLQVEFPEHY--PMEAPQVVFV--------SPA-PSHPHIYSNGHICLDILYD----SWSPAMTVNSVCISILSMLSSSPAKQRPADNDRYVKNCKNGRSPKETRWWFHDDKV 18422281 Arabidopsis_thaliana CNRLQKELSEWQ-VNPPTGFKH---RVT--------------------DNLQKWVIEVTGAPGT-LYANETYNLQVEFPQHY--PMEAPQVIFV--------PPA-PLHPHIYSNGHICLDILYD----SWSPAMTVSSVCISILSMLSSSPEKQRPTDNDRYVKNCKNGRSPKETRWWFHDDKV 124399351 Paramecium_tetraurelia SRRLSKELEQMQ-KSFANEFNI--KLPN--------------------NEISHWIVGFEGAKGT-LYEGEKFELQFKFPNSYVEPIESPEVVFL--------GKP-PEHEHIYSNGFICLSILYD----EWSAAHNVSSLCLSIQSMMSSATIKMKPPNDADFVKQAT-GRGPKSYKWTFHDTKC 89300310 Tetrahymena_thermophila_SB210 TKRLQKDLEAMQ-KNYKDQFHV--TLPN--------------------NDLKLWHVEFTAAQGT-VFQGEKFKLQFKFSPEY--------VIFI--------GKI-PDHEHIYSNGFICLSILYD----EWSAALTVSSVCLSILSMLSSATKKGRPFNDAEFCKRSQ-GRGPKAFLCRQNIKKQ 124394210 Paramecium_tetraurelia AKRLQKDLEQMQ-KSYVDQFNV--RMPN--------------------NDIKHWIVAFEGAKGT-LYQGEKFELQFKFSNEY--PIESPEVIFI--------GKP-PEHEHIYSNGFICLSILFD----EWSAALTVSSVCLSIQSMLSSATKKMKPPNDAEFVKRAA-GRGPKSFLWSYHDEKC 124397544 Paramecium_tetraurelia AKRLQKDLEQMQ-KSYTDQFNV--RMPN--------------------NDIKHWIVAFEGAKGT-LYQGEKFELQFKFSNEY--PIESPEVIFI--------GKP-PEHEHIYSNGFICLSILFD----EWSAALTVSSICLSIQSMLSSATKKMKPPNDAEFVKRAA-GRGPKSFLWSYHDEKC 124416285 Paramecium_tetraurelia AKRLQKDLEQMQ-KSYTDQFNV--RMPN--------------------NDIKHWIVAFEGAKGT-LYQGEKFELQFKFSNEYVEPIESPEVIFI--------GKP-PEHEHIYSNGFICLSILFD----EWSAALTVSSVCLSIQSMLSSATKKMKPPNDAEFVKRAA-GRGPKSFLWSYHDEKC VIIb#55.m04812 Toxoplasma_gondii SSARQRVLAVLP-PHLTDYSMM---DTD--------------------HDGFVWYIALEGAAGT-LYEKEVFLVRFRFSPKY--PIEAPEVTFV--------PPFLPVHPHVYSNGHICLSILYD----SWSPALGVSSCGMSLLSMVSSCRQKQKPADDDAYCKVWG-SKSPKNVKWVFHDDRI 23509349 Plasmodium_falciparum_3D7 KRRLEKERLELL-SQKENTIKL----LQ--------------------EHADKWIIQITGAENT-LYSNETFQMQFKFTEKY--PIESPEVIFL--------GQP-PIHPHIYSNGHICLSILYD----HWSPVLSVNSICLSIISMLSSCKKKRKPLDDILYCSTGP-RISPKNMKWMFHDDKV 66827055 Dictyostelium_discoideum_AX4 AKRLQKELLDLK-TNPPPCISI---TEG--------------------DNLDKWVIAVDGTEGS-IYQGEHFKLQFKFSSGY--PLDSPEVIFI--------GTP-PIHPHIYSNGHICLSILYD----NWSPALTVSSVCLSILSMLSGCTEKIRPTDDSKYVSRVL-NKSPKEVRWMFHDDTV 17510293 Caenorhabditis_elegans TRRLMKELAQLK-SEAPEGLLVDNTSTS--------------------NDLKQWKIGVVGAEGT-LYAGEVFMLQFTFGPQY--PFNSPEVMFV--------GETIPAHPHIYSNGHICLSILSD----DWTPALSVQSVCLSILSMLSSSKEKKHPIDDAIYVRTC--SKNPSKTRWWFHDDSV Nvec1000014187 Nematostella_vectensis KKRLQKELCELQ-KRPPSGMKINKDSVS--------------------SSLAVWVIELDGATGT-LYENEKFLLQFKFGARY--PFESPEVTFI--------GGHVPVHPHVYSNGHICLSILTD----DWSPALSVEAVCISIVSMLSSCHEKKRPPDNNFYVSTC--HKNPKKTRWWYHDDSV ci0100140113 Ciona_intestinalis -KRIQKEILTMR-KSPPPGIRLCEDSL---------------------RSAPEFMVELTGATGT-LYSEQIFKLLFKFGERY--PFESPQVTFV--------GNCIPVHPHVYSNGHICLSILTE----DWSPALSTEAVCLSVISMLSSCTEKKLPPDNAFYIRTC--SKNPKDTKWWFHGKT- 58387116 Anopheles_gambiae_str_PEST QRRLQKELMSLI-KEPPPGVSVDEESVS--------------------QNLTQWIINIDGVEGT-LYEGEHFQLLFKFNNKY--PFDSPEVTFI--------GSNIPIHPHVYSNGHICLSILTD----DWSPALSVQSVCLSISSMLSSCREKRRPPDNGIYVKTC--NKNPKKTKWWYHDDSV 28573347 Drosophila_melanogaster ERRLHKELMSLI-KEPPPGVTIDTESVQ--------------------QNLSEWKINIKGFEGT-LYEGEDFQLLFKFNNKY--PFDSPEVTFI--------GTNIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIASMLSSCREKKRPPDNTIYVKTC--NKNPKKTKWWYHDDSV 115663109 Strongylocentrotus_purpuratus QKRLHKELMQIQ-KDPPPGIKINEEKAA--------------------TVLNTWHVDLDGAPNS-IYAGEKFQLQFKFSNKY--PFDSPEVVFI--------GSNIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCTEKKRPPDNSFYVKTC--NKSPKKTKWWYHDNN- 91079008 Tribolium_castaneum ERRLQKELMSLM-KEPPPGVEVDFDLAE--------------------QNLLHWIINMEGAAGT-LYEGERFQLQFKFSNKY--PFDSPEVTFV--------GNNIPVHPHIYSNGHICLSILTD----DWSPALSVQSVCLSIVSMLSSCKEKQRPPDNAFYVKTC--NKNPKKTKWWYHDDSV 66549055 Apis_mellifera KRRLQKELTSLI-REPPPGVHVDEDLTS--------------------QNLTQWIVHMEGAKGT-LYEGEQFQLQFRFSSKY--PFDSPEVTFI--------GGNIPIHPHIYSNGHICLSILTE----DWSPALSVQSICLSIVSMLSSCKEKKRPPDNSFYVKTC--SKNPKKTKWWYHDDNV Bflo1000025168 Branchiostoma_floridae QKRLQKELLALQ-KDPPPGVRVDEASVT--------------------KSLSTWQVDMDGAPGT-LFEGEKFQLLFKFGPRY--PFDSPQVVFT--------GPNIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKKRPPDNNFYVRTC--SKNPKKTKWWYHDDSV Bflo1000032579 Branchiostoma_floridae QKRLQKELLALQ-KDPPPGVRVDEASVT--------------------KSLSTWQVDMDGAPGT-LFEGEKFQLLFKFGPRY--PFDSPQVVFT--------GPNIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKLKTYD--FYLGRG--VRTSLYSDVF------ 45387655 Danio_rerio AQRLHKELLALQ-NDPPPGMTLNEKSVQ--------------------NTITQWIVDMEGASGT-VYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GDNIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCTEKRRPSDNLFYVRTC--NKNPKNTKWWHHDDTC 68356096 Danio_rerio ------------------------------------------MREVCRTQSLSGLYDMEGAQGT-VYEGEKFQLLFKFSSRY--PFESPQVMFT--------GENIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKRRPPDNSFYVKTC--NKNPKKTKWWYHDDTC 47223533 Tetraodon_nigroviridis QKRLQKELLALQ-NDPPPGMTLNEKSVQ--------------------NTITQWIVDMEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVIFT--------GENIPIHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKRRPPDNSFYVRTC--NKNPKKTKWWYHG--- 82795417 Mus_musculus ---------------------------------------------------------MEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GENIPIHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKTTTR--------------------------- 47933385 Homo_sapiens QKRLQKELLALQ-NDPPPGMTLSEKSVQ--------------------NSITQWIVDMEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GENIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKRRPPDNSFYVRTC--NKNPKKTKWWYHDDTC 75766089 Homo_sapiens QKRLQKELLALQ-NDPPPGMTLNEKSVQ--------------------NSITQWIVDMEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GENIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLS------------------------------------ 82795563 Mus_musculus QKRLQKELLALQ-NDPPPGMTLNEKSVQ--------------------NSITQWIVDMEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GENIPIHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKRRPPDNSFYVRTC--NKNPKKTKWWYHGRNT 114620514 Pan_troglodytes QKRLQKELLALQ-NDPPPGMTLNEKSVQ--------------------NSITQWIVDMEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GENIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKRRPPDNSFYVRTC--NKNPKKTKWWYHDDTC 94364832 Mus_musculus QKRLQKELLALQ-NDPPPGMTLNEKSVQ--------------------NSITQWIVDMEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GENIPIHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKRRPPDNSFYVRTC--NKNPKKTKWWYHDDTC 109475992 Rattus_norvegicus DKRLQKELLALQ-NDPPPGMTLNEKSVQ--------------------NSITQWIVDMEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GENIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKRRPPDNSFYVRTC--NKNPKKTKWWYHDDTC 47933381 Homo_sapiens PKRLQKELLALQ-NDPPPGMTLNEKSVQ--------------------NSITQWIVDMEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GENIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKRRPPDNSFYVRTC--NKNPKKTKWWYHDDTC 114620512 Pan_troglodytes PKRLQKELLALQ-NDPPPGMTLNEKSVQ--------------------NSITQWIVDMEGAPGT-LYEGEKFQLLFKFSSRY--PFDSPQVMFT--------GENIPVHPHVYSNGHICLSILTE----DWSPALSVQSVCLSIISMLSSCKEKRRPPDNSFYVRTC--NKNPKKTKWWYHDDTC consensus/100% .........................................................h.....s..a..p.a.hbh.hs..Y........s.F.............P.H.HlYoNG.lChslL.p.....aps......h..sl.Shhsus.................................. consensus/95% .....................................................h.h.h.s..ss.la.sp.ablbh.Fs.pY..P.psPpV.F.............P.H.HlYoNG.ICLslL.p.....Wpssbshpulhhol.SMLsusp.c.bP.ss..ah..................... consensus/90% ..Rl.+-h..h........h............................p.h..W.h.hpsh.so.lY.sc.abLbhpFs.pY..PhpsPpV.F.........s...P.H.HlYSNG.ICLslL.p....pWossholpulhhSl.SMLoSsppKbbP.ss..ah..s....s.p...a.ac.... consensus/85% ..RlpK-l..hb.p..s..hp...........................psh..W.l.hpGh.so.lY.sE.abLbhpFs.pY..PhcsPpV.F.........s.s.P.H.HlYSNGaICLslL.c....pWoPsholpSlhlSIbSMLoSsppKb+P.ss..ah..s....sP+p..a.aHDp.. consensus/80% ..RlpKEL..hb.pp.ssshpl..........................pslp.W.l.hpGh.sT.lY.sEpabLbhcFsspY..Ph-uPpV.F.........s.s.PhHsHlYSNGaICLsIL.-....sWSPuholpSlClSIbSMLSSspcKb+P.sss.ahpss...bsPKp.pa.aHDc.. consensus/75% ..RlpKEL..hb.pp.Psshpl..........................pslppWblphpGh.GT.LYpsEpFbLbFcFsscY..Ph-uPpVhF.........s.s.PhHPHlYSNGaICLSIL.-....cWSPAhoVpSVClSIbSMLSSspcKb+PsDssbalpss..s+sPKp.cWhaHD-p. consensus/70% .+RLQKEL..hb.pssPsshpl..p.sp....................pslppWhlshpGA.GT.LYpsEpFbLbFcFsscY..Ph-SPpVhF.........u.s.PlHPHlYSNGHICLSIL.-....-WSPALoVpSVCLSIhSMLSSspcKb+PsDssbYl+ss..s+sPKpscWhaHDDp. 9. BRUCE-like FINAL -HHHHHHHHHHH-----------EEEEE-----------------EEEEEEE-----------EEEEEE------------EEEEE----EEEEE-------EEEE----EE----------------------------------------EEEEEEEEEHHH--H----------------------HHHHHHHHHHHHHHHHHHH---------------HHHHHHHHHHHHH---------------------------------------------------------HHHHHHHHHHHHHHHHHHHHHHH-- 85118516 Neurospora_crassa_OR74A SVFRITKELSDLQKDSD----LSLAVACRDVDV-----------RNVKAMIIGPHETPYEFGFFEFAFVF-NKDYPRRSPQVQATTTNDGRTRFNPNIYANGKVCL----TWRGE--------------------------RGEQWSAA-QGLESILLSIQSLM--SMNPYENEPGFEDAKEAQDQKNQKDYIQKVIRGFAPCRSEYLGINPDGTFAPTSVSGEIASLGSGESDMDI---------------------LDEESSVPF------------------------EPFKDLCKRRFLWYYDSYLAAVEKGKS 15227320 Arabidopsis_thaliana WFKKVDQDWKILQNNLP----DGIFVRAYEDRM-----------DLLRAVIVGAFGTPYQDGLFFFDFHL-PSDYPSVPPSAYYHSGG---WRLNPNLYEEGKVCLSLLNTWTGR--------------------------GNEVWDPKSSSILQVLVSLQGLVL-NSKPYFNEAGYDKQVGTAEGEKNSLGYNENTF--------------------------LLNCK-TMMYL-----------------------MRKPPK----------------------------DFEELIKDHFRKRGYYILKACDAYMK 30685832 Arabidopsis_thaliana WVKKVQQEWSNLEANLP----NTIYVRVCEERM-----------DLLRAALVGAPGTPYHDGLFFFDIML-PPQYPHEPPMVHYHSGG---MRLNPNLYESGRVCLSLLNTWSGS--------------------------GTEVWNAGSSSILQLLLSFQALVL-NEKPYFNEAGYDKQLGRAEGEKNSVSYNENAF--------------------------LITCK-SMISM-----------------------LRKPPK----------------------------HFEMLVKDHFTHRAQHVLAACKAYME 70999189 Aspergillus_fumigatus_Af293 RMSRIRKEYEILETSLP----PGIFVRTWESRI-----------DILRVLIIGPQGTPYEYAPFVIDFQF-PEDYPNKPPASFFHSWTNGQGKVNPNLYEDGRICLSILGTWPTK-------------------------SPEESWSPVKSTALQILVSIMGLVL-VKNPFYNEAGYDALAVGDNRRVESSQYTEKAF--------------------------LMTRNFIKHA------------------------LHHPVA----------------------------GLEDVLTWNYIKSEQKDDASSRPNLL 24649371 Drosophila_melanogaster YQRAVQREYLMLKSSLP----NGVVVRAYEDRM-----------DLMSVMMVGPKRTPYQNALFFFDFQF-GRDYPKSPPVCHYISYCTD--RLNPNLYEGGRVCVSLLGTWMGR--------------------------DNEVWSPS-STMLQVLVSIQGLIL-VDEPYYNEAGYEKQRGTQLGNENSRVYNEMA---------------------------IIKIAQSTVKQ-----------------------LTNPPL----------------------------IFRNELIEHFKEFGTELYARMRAWSE 58383761 Anopheles_gambiae_str_PEST FYKAVQREHWLLRTALP----PGVWVRTFEDRL-----------DLLSVMIEGPKKTPYEDGLFLFDIQL-GLDYPRAPPLCHYISYCTD--RLNPNLYEDGKVCVSLLGTWSGR--------------------------GTEVWGPT-STLLQVIVSIQGLIL-VAEPYFNEAGYEKLRGSQQGKENSRMYNEMV---------------------------LLKLVQSMTKL-----------------------MSNPPE----------------------------VFREQILTHFHACGQRMYSRLKTWME 110760609 Apis_mellifera FFRTVSKELKLLKSSLP----PGIWVKGFEDRI-----------DLYSVMFRGPEKTPYEDGLFLFDFQL-SADYPAAPPLCHYISYCND--RLNPNLYEDGKVCVSLLGTWSGR--------------------------GTEVWTSS-STLLQVIVSIQGLIL-VPEPYFNEAGFDKQKGSQQGKENSRMYNEMV---------------------------VLKLVQAQTKL-----------------------LQHPPP----------------------------VFKDIIIEHFKRHAKKLLQRLELWME 91084477 Tribolium_castaneum FLVALQKEYKLLKDSLP----AGVWVRTYDNRM-----------DLLSVMIRGPAKTPYEDGLFLFDIQL-SPDYPKNPPGVHYISYITE--PLNPNLYVEGKVCVSLLGTWMGR--------------------------GSEMWGPN-STLLQLIVSIQGLIL-VAQPYFNEAGYERQTHTQQGCENSRTYNEFVI--------------------------LKLVQ-SMTEL-----------------------LNAAPK----------------------------VFQNEVLAHFQAKGEAMCERLMKYCD 58268232 Cryptococcus_neoformans_var_neoformans_JEC21 YHSRIQKEHRALQSSLP----ENILVRTYEDRL-----------DLMRVLIIGPEGTPYTDAPFVFDVYLNPTKFPNEPPIVHFHSHTNGHGRCNPNLYEEGKVCLSILGTWSGD--------------------------ESESWNPSKSSLLQVFVSISGLVL-VRCPYHCEPAFAKLEGTREGKINSRLYSEKAY--------------------------VLSRTFVRTA------------------------LERPPT----------------------------GLESEI-RYFYLTRGRLRSVIDHAQR 71981623 Caenorhabditis_elegans RTKRIAKELASIANALPLNASNSIYVCYDEGRV-----------DIIKVLISGPDDTPYANGLFEFDIFF-PTGYPFSPPKCAFLTTGSGNVRFNPNLYNDGKICLSILGTWEGR--------------------------PEEKWNP-YCSLMQVLVSIQ--------------------------------------------------------------------------------------------------------------------------------------EVIEKHLWLKREAILKQAQAWID 124399658 Paramecium_tetraurelia KIQKFLNQISTIEQNIPIQATNSIFIRHDNARM-----------DCMRIVIFGGSGTPYAHGAFLYDLYF-GNDYPVRPPKIKLATPRHDKVGFNPNLYNFRRVWLDLLGTW------------------------------DDSWNVDYSTILEKLFSVKSLVM-SENFMINKP--ESQMETLNVGQANRGYCNFIK--------------------------INNIRYAMIEQ-----------------------LQNPPR----------------------------GFEEVIKKGFYLRKELIMKEIELWIE 124400960 Paramecium_tetraurelia KMQKLVSEISTIEENLPLEATNSIFLRYDTDRM-----------DCMRTIIFGASGTPYAHGAFLYDMFF-GDDYPQRPPKMKLATTGHGKVRFNPNLYNCGKVCLSLLGTW------------------------------GDNWIANFSTILQILVSVQSMVM-SEYVMFNEPGWESQMGTPNGEQANRGYCNFIK--------------------------IQNIRYAMVEQ-----------------------LQNPPR----------------------------GFEAVIKKSFYLRKELIMKEIELWVE 15219165 Arabidopsis_thaliana WVKDIQKEWKILDKNLP----ETIFVRACESRI-----------DLLRAVIIGAEGTPYHDGLFFFDIQF-PDTYPSVPPKVHYHSGG---LRINPNLYKCGKVCLSLISTWTGK--------------------------KREKWLPKESTMLQLLVSIQALIL-NEKPYYNEPGYEKSMGTPLGESYSKDYSENVF------------------------------VFSLKTM---------------------------------------------------------HFEEFVRSHFFVRSHDIVKACNAYKD 68437089 Danio_rerio FFSTVRKEMALLATSLP----DGIMVKTFEDRM-----------DLFSALIKGPHRTPYEDGLFLFDIQL-PNIYPAVPPLFRYLSQCSG--RLNPNLYDNGKVCVSLLGTWIGK--------------------------GTERWTSK-SSLLQVLISIQGLIL-VNEPYYNEAGFDSDRGLQEGYENSRCYNEMAL--------------------------IKLVQ-SMTQL-----------------------LQQPVE----------------------------VFQQEVYEHFACSGWRLVHRLESWLE 47214958 Tetraodon_nigroviridis FFSTVRKEMALLATSLP----EGIMVKTFEDRMVGVLTSGSLVLDLFSALIKGPTRTPYEDGLFLFDIQL-PNIYPAVPPLFHYLSQCSG--RLNPNLYDNGKVCVSLLGTWIGKVGVVQQHTQGSYCHWPSTILVEQVVEGTERWTSK-SSLLQVLISIQGLIL-VNEPYYNEAGFDSDRGLQEGYENSRCYNEMAL--------------------------IKMVQ-SMTQL-----------------------LQNPVE----------------------------VFRQEIQEHFALNGWRLVHRLEAWLD 50234896 Mus_musculus FFSTVRKEMALLATSLP----DGIMVKTFEDRM-----------DLFSALIKGPTRTPYEDGLYLFDIQL-PNIYPAVPPHFCYLSQCSG--RLNPNLYDNGKVCVSLLGTWIGK--------------------------GTERWTSK-SSLLQVLISIQGLIL-VNEPYYNEAGFDSDRGLQEGYENSRCYNEMAL--------------------------IRVVQ-SMTQL-----------------------VRRPPE----------------------------VFEQEIRQHFSVGGWRLVNRIESWLE 109492276 Rattus_norvegicus FFSTVRKEMALLATSLP----DGIMVKTFEDRM-----------DLFSALIKGPTRTPYEDGLYLFDIQL-PNIYPAVPPHFCYLSQCSG--RLNPNLYDNGKVCVSLLGTWIGK--------------------------GTERWTSK-SSLLQVLISIQGLIL-VNEPYYNEAGFDSDRGLQEGYENSRCYNEMAL--------------------------IRVVQ-SMTQL-----------------------VRRPPE----------------------------VFEQEIRQHFSVGGWRLVNRIESWLE 109489413 Rattus_norvegicus FFSTVRKEMALLATSLP----DGIMVKTFEDRM-----------DLFSALIKGPTRTPYEDGLYLFDIQL-PNIYPAVPPHFCYLSQCSG--RLNPNLYDNGKVCVSLLGTWIGK--------------------------GTERWTSK-SSLLQVLISIQGLIL-VNEPYYNEAGFDSDRGLQEGYENSRCYNEMAL--------------------------IRVVQ-SMTQL-----------------------VRRPPE----------------------------VFEQEIRQHFSVGGWRLVNRIESWLE 114670605 Pan_troglodytes FFSTVRKEMALLATSLP----EGIMVKTFEDRM-----------DLFSALIKGPTRTPYEDGLYLFDIQL-PNIYPAVPPHFCYLSQCSG--RLNPNLYDNGKVCVSLLGTWIGK--------------------------GTERWTSK-SSLLQVLISIQGLIL-VNEPYYNEAGFDSDRGLQEGYENSRCYNEMAL--------------------------IRVVQ-SMTQL-----------------------VRRPPE----------------------------VFE-----------QEIRSACMPFVL 33636750 Homo_sapiens FFSTVRKEMALLATSLP----EGIMVKTFEDRM-----------DLFSALIKGPTRTPYEDGLYLFDIQL-PNIYPAVPPHFCYLSQCSG--RLNPNLYDNGKVCVSLLGTWIGK--------------------------GTERWTS-KSSLLQVLISIQGLIL-VNEPYYNEAGFDSDRGLQEGYENSRCYNEMAL--------------------------IRVVQ-SMTQL-----------------------VRRPPE----------------------------VFEQEIRQHFSTGGWRLVNRIESWLE 124409420 Paramecium_tetraurelia KMNRIIKELSDIGENLPLQLTNSIFLRYDKDRM-----------DAMQAMIFGSSGTPYAHGAFLYNFLF-CDDYPSRPPKCLLETTGHGKVRFNPNLYNCGKVCLSLLGTW------------------------------GDNWKANESTLWQILVSIQAMVM-SEYVYFNEPGWESSMGTVDGEKNNRGYCNIVK--------------------------LANIRYAMIEQ-----------------------LQKPPA----------------------------GFEEVIQKSFYLRKDVIIKEVQQWID 89289979 Tetrahymena_thermophila_SB210 QAKRIQNEIRILVKNLPIYSTNSIFVRCNELDT-----------NHFQCIIMGSKDTPYAYGAFAYDLHC-DYSFPNQPPKMRLITTGGGSVRFNPNLYKDGFICLSLLGTWNGQ--------------------------KTEKWNESTSNILQILISIQSLVM-NDQVYYNEPSVGKQ--NQENEMRNRAYQNIIK--------------------------ISNIRHAMIDQ-----------------------LKNPDP----------------------------EFRDIIKMHFYLNKIEILNQCDKWII 58261412 Cryptococcus_neoformans_var_neoformans_JEC21 RSLAIAKELAILTTSLPVAWHSSVFLRVDEARV-----------DVLKAMIIGPEGTPYENGCLLFDIFL-PLEYNQRCPLVKYMTTNGGKWRYNPNLYADGKVCLSLLGTWQGP--------------------------G---WIAGQSTLLQVLISIQSLIL-CEEPYLNEPGWANMSGS----AQSKAYNANIR--------------------------RMVLADAMANN-----------------------IKSPPH----------------------------PFENEIKTHFRLKAKAIRQQIEKWRE 15229481 Arabidopsis_thaliana WAKRIQDEWRILEKDLP----EMIFVRAYESRM-----------DLLRAVIIGAQGTPYHDGLFFFDIFF-PDTYPSTPPIVHYHSGG---LRINPNLYNCGKVCLSLLGTWSGN--------------------------QREKWIPNTSTMLQVLVSIQGLIL-NQKPYFNEPGYERSAGSAHGESTSKAYSENTF--------------------------ILSLK-TMVYT-----------------------MRRPPK----------------------------YFEDFAYGHFFSCAHDVLKACNAYRN 124412336 Paramecium_tetraurelia KIVRLAQEFADMSTSLPIEHTNAIFVRADKERV-----------DVMKALVMGAKGTPYAHGAFLFDIYA-DDSYPNAPPKMNLSTTGNGKVRFNPNLYSCGKVCLSLLGTWRGN--------------------------ASENWDPKISTLLQVLVSTQAIIM-SEEVYFNEPGFEQEANTDEGEKKNEGYSNIVR--------------------------YCNIKYAMIDQ-----------------------IRDPPK----------------------------GFETIIRRHFYLKKQEILEECNKWVE 124406526 Paramecium_tetraurelia KIVRLAQEFADMSTSLPIEHTNAIFVRADKERV-----------DVMKALVMGAKGTPYGHGAFLFDIYA-DDSYPNAPPKMNLSTTGNGKVRFNPNLYSCGKVCLSLLGTWRGN--------------------------ASENWDPKISTLLQVLVSTQAIIM-SEEVYFNEPGSEQEANTDEGEKRNEGYSNIVR--------------------------YCNIKYAMIDQ-----------------------IRDPPK----------------------------GFETIIKRHFYLKKQEILEECNKWVE 89299207 Tetrahymena_thermophila_SB210 KMQRLIKEIADLAHSLPMTPQSSIFVRYDTSRM-----------DVMKSLIFGAEDTPYAHGAFVFDMFF-DDSYPQNSPKVNLATTGSAKIRFNPNLYHCGKVCLSLLGTWRGS--------------------------SNENWNPNISTILQLLVSIQAIVM-SEYVYFNEPGYEGHAGTAEGEKLNRGYQNIVK--------------------------IGNIRYAMIEQ-----------------------INNPLP----------------------------EFKEVIMMSFFLKKDMILKECEKWLD 124401200 Paramecium_tetraurelia KLIRLAQEFADMSNSLPVEHTNSIFVRADKDRV-----------DVIKALIMGASGTPYAHGAYLFDIYF-EDQYPNTPPKMNLSTTGSGKIRFNPNLYACGKVCLSLLGTWRGH--------------------------ASENWDPKLSTILQVLVSTQAIIM-SEDVYFNEPGFEGEAGQQDGEMKNEAYSNIVR--------------------------YGNIQYAMIEN-----------------------IKNPPT----------------------------GFETIIRRHFYLKKGEIMEEVRKWVQ 89288400 Tetrahymena_thermophila_SB210 KMVRLAQEIADMANSLPIDHTSSIFVRCDSKRV-----------DVMKCIIMGSSGTPYAHGAFLYDIFF-EDSYPNTSPRVNLQTTGNNKVRFNPNLYNCGK------GTWKGQ--------------------------ASENWDPKISTLLQVLVSLSAIIM-NEDIYFNEPGYEGHSGTEDGDRKNEAYSNIVR--------------------------FSNIKYAMIQT-----------------------IRNPPE----------------------------GFEDVIKRHFYLKKDEILKECEGWIE 89303321 Tetrahymena_thermophila_SB210 KMVRLAQELADLSTALPIDHTNSIFVRCDTDRV-----------DVMKCMVMGSKGTPYAHGAFIFDVYF-SDEYPNQPPKCNLETTGAGKVRFNPNLYACGKVCLSLLGTWRGN--------------------------ASENWDPKISTLLQILVSLQAIIM-SEEVYFNEPGFEGEAGSEEGERKNEAYSNIVR--------------------------YCNIKYAMIDQ-----------------------IKNPPE----------------------------GFQAVIMRHFYLKKQEILEECQKWIK 89285001 Tetrahymena_thermophila_SB210 KVQRLKNELKQLAKSLPIQSSNSIFLVYDEQRL-----------DVMRALIFGSEDTPYAHGAYFFDIFI-PDDYPQNPPKVSIVSTKDNSIRFNPNLYADGLVCISLLGTWEGD--------------------------QSENWDPTNSNIYQILISIQSLIM-NEDVYFNEPYYQEWKGSEQGDKCNRAYSNIVK--------------------------YYNIQHAIGDM-----------------------IENPPY----------------------------EFQQVIYTHFLLKREAIKATAQAWIE 115964468 Strongylocentrotus_purpuratus CLLRIKRDIMNIYTDPP----LGMCIVPEEDI------------TRVHALITGPFDTPYEGGFFHFFVKF-PPDYPIRPPRIKLMTTGDGSVRFNPNLYRSGKVCLSILGTWRGP--------------------------A---WSPA-QSLSSVLMSIQSLM--NEKPYHNEPGFEQ------------------------------------------------------------------------------------------------------------------------------------------- 31542455 Mus_musculus VYFGLSGISCPFIREPP----PGMFVVPDTVDM-----------TKIHALITGPFDTPYEGGFFLFVFRC-PPDYPIHPPRVKLMTTGNNTVRFNPNFYRNGKVCLSILGTWTGH--------------------------A---WSPA-QSISSVLISIQSLM--TENRYHNEPGFEQERHPGE----SKNYNECIR--------------------------HETIRVAVCDM-----------------------MEGKCPCPE-------------------------PLRGVMEKSFLEYYDFYEVACKDRLH 114666526 Pan_troglodytes CLLRIKRDIMSIYKEPP----PGMFVVPDTVDM-----------TKIHALITGPFDTPYEGGFFLFVFRC-PPDYPIHPPRVKLMTTGNNTVRFNPNFYRNGKVCLSILGTWTGP--------------------------A---WSPA-QSISSVLISIQSLM--TENPYHNEPGFEQERHPGD----SKNYNECIR--------------------------HETIRVAVCDM-----------------------MEGKCPCPE-------------------------PLRGVMEKSFLEYYDFYEVACKDRLH 12751495 Homo_sapiens ---------MSIYKEPP----PGMFVVPDTVDM-----------TKIHALITGPFDTPYEGGFFLFVFRC-PPDYPIHPPRVKLMTTGNNTVRFNPNFYRNGKVCLSILGTWTGP--------------------------A---WSPA-QSISSVLISIQSLM--TENPYHNEPGFEQERHPGD----SKNYNECIR--------------------------HETIRVAVCDM-----------------------MEGKCPCPE-------------------------PLRGVMEKSFLEYYDFYEVACKDRLH 83320076 Rattus_norvegicus ---------MSIYKEPP----PGMFVVPDTVDM-----------TKIHALITGPFDTPYEGGFFLFVFRC-PPDYPIHPPRVKLMTTGNNTVRFNPNFYRNGKVCLSILGTWTGP--------------------------A---WSPA-QSISSVLISIQSLM--TENPYHNEPGFEQERHPGD----SKNYNECIR--------------------------HETIRVAVCDM-----------------------MEGKCPCPE-------------------------PLRGVMEKSFLEYYDFYEVACKDRLH 47217004 Tetraodon_nigroviridis CILRIKRDIMSIYKEPP----PGMFVVPDPQDM-----------TKIHALITGPFDTPYEGGFFLFLFRC-PPDYPIHPPRVKLITTGHNTVRFNPNFYRNGKVCLSILGTWTGP--------------------------A---WSPA-QSISSVLISIQSLM--TENPYHNEPGFEQERHPGD----SKNYNECIR--------------------------HETMRVAVCDM-----------------------LEGKIACPE-------------------------ALWSVMEKSFLEYYDFYEGVCKERLY 50539720 Danio_rerio ---------MSIYKEPP----PGMFVVPDPHDM-----------TKIHALITGPFDTPYEGGFFLFLFRC-PPDYPIHPPRVKLITTGHNTVRFNPNFYRNGKVCLSILGTWTGP--------------------------A---WSPA-QSISSVLISIQSLM--TENPYHNEPGFEQERHPGD----SKNYNECIR--------------------------HETMRVAVCDM-----------------------LEGKVSCPE-------------------------ALWSVMEKSFLEYYDFYEGVCKERLH 60472388 Dictyostelium_discoideum_AX4 RTIRIAQEQGSMMKSLPLSYESSIFVRVDEKNI-----------DAMQCLITGPKDTPYSGGCFLFNCTF-PQEYPSSPPHVTILTTGGGSVRFNPNLYNNGKVCLSLLGTWAGS--------------------------SGETWNPNTSTLLQVFISIQSLIL-VPEPFFNEPGFESQIGTKSGIASSSSYNHNLI--------------------------PSTIQYAMIEM-----------------------IQNPPP----------------------------PFKDVILAHFFYQKDTIKKQCDAWVN 28830048 Dictyostelium_discoideum RTIRIAQEQGSMMKSLPLSYESSIFVRVDEKNI-----------DAMQCLITGPKDTPYSGGCFLFNCTF-PQEYPSSPPHVTILTTGGGSVRFNPNLYNNGKVCLSLLGTWAGS--------------------------SGETWNPNTSTLLQVFISIQSLIL-VPEPFFNEPGFESQIGTKSGIASSSSYNHNLI--------------------------PSTIQYAMIEM-----------------------IQNPPP----------------------------PFKDVILAHFFYQKDTIKKQCDAWVN 91086823 Tribolium_castaneum RVKRIAQEAVTLSTSLPLSYSSSVFVRYDTSRL-----------DVMKVLMTGPADTPYANGCFEFDVFF-PPDYPLSPMMINLETTGHHTIRFNPNLYNDGKVCLSVLNTWHGR--------------------------PEEKWNAQTSNFLQVLVSIQSLIL-VPEPYFNEPGYERSRGTPAGTQSSREYNANVS--------------------------QATVRWAMLEQ-----------------------ILNPCL----------------------------CFKDVIYTHFYLKRHEILAQVEGWIK 45550729 Drosophila_melanogaster RVKRLAQEAVTLSTSLPLSFSSSVFVRCDTDRL-----------DIMKVLITGPADTPYANGCFEFDVFF-PPDYPNQPMLINLETTGRHSVRFNPNLYNDGKVCLSVLNTWHGR--------------------------PEEKWNAQTSSFLQVLVSIQSLIL-VPEPYFNEPGFERSRGSPSGTNSSREYNSNIY--------------------------QACVRWAMLEQ-----------------------IRSPSQ----------------------------CFKDVIHKHFWLKREEICAQIEGWIE 110756922 Apis_mellifera RVKRLAQEAVTLSTALPLSYSSSVFVRCDADRL-----------DVMKVLITGPAETPYANGCFEFDVYF-PPDYPNSPMLINLETTGRHTVRFNPNLYNDGKVCLSVLNTWHGR--------------------------PEEKWNAHTSSFLQVLVSIQSLIL-VSEPYFNEPGYERSRGTTSGAQSSQEYNANIC--------------------------QATAKWAMLDQ-----------------------IRNPCP----------------------------CFKEVIHTHFWIKRHEIVAQLERWIR 58382934 Anopheles_gambiae_str_PEST RVKRLAQETVTLSTSLPLSYSSSVFVRCDTDRL-----------DIMKVLITGPAETPYANGCFEFDVYF-PPDYPNSPMMINLETTGRNTVRFNPNLYNDGKVCLSVLNTWHGR--------------------------PEEKWNAHTSSFLQVLVSIQSLIL-VPEPYFNEPGFERSRGTPTGTHSSREYNSNIY--------------------------QACVRYAMLEQ-----------------------LRHPCP----------------------------CFQDVIHAHFWLKRNEICNQIEEWIA 61744456 Homo_sapiens RARRLAQEAVTLSTSLPLSSSSSVFVRCDEERL-----------DIMKVLITGPADTPYANGCFEFDVYF-PQDYPSSPPLVNLETTGGHSVRFNPNLYNDGKVCLSILNTWHGR--------------------------PEEKWNPQTSSFLQVLVSVQSLIL-VAEPYFNEPGYERSRGTPSGTQSSREYDGNIR--------------------------QATVKWAMLEQ-----------------------IRNPSP----------------------------CFKEVIHKHFYLKRVEIMAQCEEWIA 109478992 Rattus_norvegicus RARRLAQEAVTLSTSLPLSSSSSVFVRCDEERL-----------DIMKVLITGPADTPYANGCFEFDVYF-PQDYPSSPPLVNLETTGGHSVRFNPNLYNDGKVCLSILNTWHGR--------------------------PEEKWNPQTSSFLQVLVSVQSLIL-VAEPYFNEPGYERSRGTPSGTQSSREYDGNIR--------------------------QATVKWAMLEQ-----------------------IRNPSP----------------------------CFKEVIHKHFYLKRVEVMAQCEEWIA 10048468 Mus_musculus RARRLAQEAVTLSTSLPLSSSSSVFVRCDEERL-----------DIMKVLITGPADTPYANGCFEFDVYF-PQDYPSSPPLVNLETTGGHSVRFNPNLYNDGKVCLSILNTWHGR--------------------------PEEKWNPQTSSFLQVLVSVQSLIL-VAEPYFNEPGYERSRGTPSGTQSSREYDGNIR--------------------------QATVKWAMLEQ-----------------------IRNPSP----------------------------CFKEVIHKHFYLKRIELMAQCEEWIA 47213168 Tetraodon_nigroviridis RSRRLAQEAVTLSTSLPLSSSSSVFVRCDEERL-----------DIMKVLITGPADTPYANGCFEFDVYF-PQDYPNSPPLVNLETTGGHSVRFNPNLYNDGKVCLSILNTWHGR--------------------------PEEKWNPQTSSFLQVLVSVQSLIL-VAEPYFNEPGYERSRGTPSGTQSSREYDGNIR--------------------------QASVKWAMLEHCATLRHASKRYGCPRADAFPG--LRPPAASARG------------------------CPSRVIHKHFYLKRTEIMCQCEEWIA 47213169 Tetraodon_nigroviridis RSRRLAQEAVTLSTSLPLSSSSSVFVRCDEERL-----------DIMKVLITGPADTPYANGCFEFDVYF-PQDYPNSPPLVNLETTGGHSVRFNPNLYNDGKVCLSILNTWHGR--------------------------PEEKWNPQTSSFLQVLVSVQSLIL-VAEPYFNEPGYERSRGTPSGTQSSREYDGNIR--------------------------QASVKWAMLEQ-----------------------LRNPSP----------------------------CFKEVIHKHFYLKRTEIMCQCEEWIA 68387366 Danio_rerio RARRLAQEAVTLSTSLPLSSSSSVFVRCDEERL-----------DIMKVLITGPADTPYANGCFEFDVYF-PQDYPNSPPLVNLETTGGHSVRFNPNLYNDGKVCLSILNTWHGR--------------------------PEEKWNPQTSSFLQVLVSIQSLIL-VAEPYFNEPGYERSRGTPSGTQSSREYDGNIR--------------------------QATVKWAMLEQ-----------------------MRNPSP----------------------------CFKEVIHRHFYLKRAEIMAQCESWIC consensus/100% ...........h....s......hhl..................p.hp.hh.Gs..TPY..u.h.h.h.h....as..s....h.o.......hNPNhY...b.......TW.................................W....psh.p.hhS...... .......................................................................................... .......... ........................... consensus/95% ...........h.p..P.....shhl.....ph...........s.hpshh.Gs..TPY..uha.a.h.h.s..YP..s..h.h.o......+hNPNhY.pG+lClsllsTW.................................W.s..psh.plhlS.puhh. s...h.NEs..............p..hp.............................................................. .......... ......h...h......h...h..... consensus/90% ....l.p-...h.pp.P.....uhhl...p.ch...........s.hpshl.Gs..TPY..Ghabashbh.s..YP..PP.hph.o.s....RhNPNhY.sG+VClSlLsTW.G...............................Wss..poh.plllSlQulh. s.psaaNEsGa-...........sp.Ys..h..............................h..s......................... hp..s..... ..hb..h..pF......h...hp.b.. consensus/85% ....l.p-...l.ps.P.....ulhV.s.p.ch...........s.hpshI.Gs..TPY..GhabFshbh.s..YP..PP.hph.o.s....RhNPNLYpsGKVClSlLsTW.G...............................Wss..pohbplLlSlQulh. s.psYaNEsGa-p.....p....sp.Ys..hb.............................h..uh.p...................... hp..s..... ..hb..h..pF......h...hp.bh. consensus/80% b..pl.pEh..l.psLP....sulhV+s.ppch...........DhhpslI.Gs..TPY..GhFbFshbh.s.pYP..PP.hph.ops....RhNPNLYpsGKVClSlLsTW.Gp..........................s.-.Wss..SohhQlLlSlQuLlh s.psYaNEsGa-pp....p....sc.Ys..hb............................slp.uh.pb..................... hp.ss..... .shcp.l.ppFb.....h...hp.ah. consensus/75% bh.pl.pEh..l.psLP....sulaV+s.ppRh...........DhhpslI.Gs..TPY.pGhFbFDhbh.s.pYP..PPbhph.ops....RhNPNLYpsGKVClSlLsTW.Gp..........................s.E.Wss..SohLQVLlSlQuLlh s.psYaNEsGa-pp.ss.ps..psc.Ysp.hb............................slp.uh.pb..................... lppPs..... .sFcp.l.ppFbb...bl...hc.alp consensus/70% bh.pl.pEh..l.psLP....sulaV+spcpRh...........Dlh+slI.GP.cTPY.sGhFhFDhbh.PssYP.pPPbhphbossssphRhNPNLYpsGKVCLSlLGTWpGp..........................spEpWssp.SolLQVLlSIQuLlh s.pPYaNEPGa-ppbGs.pG.ppS+.Ysp.lb..........................b.slp.uM.pb..................... lppPs..... .sFcpllbpHFhhb.pblh.phc.alp 10.Ubc6p FINAL ----HHHHHHHHHHHHHHH-------------------EEEE----------------EEEEE------------EEEEEEE-----------EEEE--EEE-----EEEEEEE----------HHHHHHHHHHHH-- 45201468 Ashbya_gossypii_ATCC_10895 ---ASRQAYKRLSKEYKMMT-ENPPPYIVA--APKEDNILVW----------------HYVITG-PPET-PYEDGQYHGTLVFPNDYPFNPPAIRMLTPNGRFRENTRLCLSMSDYHPDTWNPSWSVATILTGLLSFM 50309721 Kluyveromyces_lactis_NRRL_Y_1140 ---ASIQANKRLTKEYKNIV-NNPPPFIIA--APHEDNILEW----------------HYVITG-PPST-PYENGQYHGTLTFPSDYPFNPPAIRMITPNGRFKENTRLCLSMSDYHPEAWNPAWSVVTILNGLLSFM 6320947 Saccharomyces_cerevisiae ---ATKQAHKRLTKEYKLMV-ENPPPYILA--RPNEDNILEW----------------HYIITG-PADT-PYKGGQYHGTLTFPSDYPYKPPAIRMITPNGRFKPNTRLCLSMSDYHPDTWNPGWSVSTILNGLLSFM 50290111 Candida_glabrata_CBS_138 ---ATKQAQKRLTKEYKMMV-ENPPPFIIA--RPNEENILEW----------------HYVISG-PPDT-PYDGGQYHGTLTFPSDYPYKPPAIRMITPNGRFKENTRLCLSMSDYHPDTWNPGWSVATILNGLLSFM 32563946 Caenorhabditis_elegans -SGVSTVALQRLKKDYQRLL-KEPVPFMKA--APLETNILEW----------------RYIIIG-APKT-PYEGGIYMGKLLFPKDFPFKPPAILMLTPNGRFQTNTRLCLSISDYHPDTWNPAWTVSTIITGLMSFM 21358599 Drosophila_melanogaster GGRKQPTAVSRMKQDYMRLK-RDPLPYITA--EPLPNNILEW----------------HYCVKG-PEDS-PYYGGYYHGTLLFPREFPFKPPSIYMLTPNGRFKTNTRLCLSISDFHPDTWNPTWCVGTILTGLLSFM 58388043 Anopheles_gambiae_str_PEST MTNQKPTATCRLKQDYMRLK-RDPVPYITA--EPLPSNILEW----------------HYVIKG-PEDS-PYYGGYYHGTLLFTKEFPFKPPSIYMTTPNGRFKTNKRLCLSISDFHPDTWNPAWSVATILTGLLSFM ci0100138685 Ciona_intestinalis RSKVPITATQRLKQDYMRLK-KDPIPYITA--EPLPSNILEW----------------HYLVTG-PDDT-PYTGGFYHGKLVFPREFPFRPPAIYMITPSGRFKCNTRLCLSISDFHPDTWNPAWSVGTILTGLLSFM 48132665 Apis_mellifera -NRITNSATARLKQDYLRLK-KDPIPYVVA--EPVPSNILEW----------------HYVVKG-PEKT-PYEGGFYHGKLIFPVEFPFQPPSIYMTTPNGRFKVNTRLCLSISDFHPDTWNPAWSVSTILTGLLSFM 91082969 Tribolium_castaneum SNRKTNSATSRLKQDYLRLK-RDPVPYITA--EPLPSNILEW----------------HYVVCG-PENT-PYEGGFYHGKLVFPREFPFKPPSIYMITPNGRFRTNKKLCLSISDFHPDTWNPAWSVSTILTGLLSFM 115969448 Strongylocentrotus_purpuratus SKRVNTTASARLKQDYMRLK-KDPVPYVTA--EPLPSNILEW----------------HYVVRG-PKET-PYEGGLYHGKLVFPREFPFKPPSIYMTTPNGRFKTNTRLCLSISDFHPDTWNPAWSVSTILTGLLSFM Bflo1000025151 Branchiostoma_floridae QRRAPTTATQRLKQDYLRLM-KDPVPYITA--APLPSNILEW----------------HYVVKG-PENS-PYEGGQYHGKLVFPREFPFKPPSIYMMTPNGRFKCNTRLCLSISDFHPDTWNPAWSVSTILTGLLSFM 37577126 Homo_sapiens SKRAPTTATQRLKQDYLRIK-KDPVPYICA--EPLPSNILEWFKRFSWLSLLSSWDYRHYVVRG-PEMT-PYEGGYYHGKLIFPREFPFKPPSIYMITPNGRFKCNTRLCLSITDFHPDTWNPAWSVSTILTGLLSFM 37577124 Homo_sapiens SKRAPTTATQRLKQDYLRIK-KDPVPYICA--EPLPSNILEW----------------HYVVRG-PEMT-PYEGGYYHGKLIFPREFPFKPPSIYMITPNGRFKCNTRLCLSITDFHPDTWNPAWSVSTILTGLLSFM 114550572 Pan_troglodytes NKRAPTTATQRLKQDYLRIK-KDPVPYICA--EPLPSNILEW----------------HYVVRG-PEMT-PYEGGYYHGKLIFPREFPFKPPSIYMITPNGRFKCNTRLCLSITDFHPDTWNPAWSVSTILTGLLSFM 85662413 Mus_musculus NKRAPTTATQRLKQDYLRIK-KDPVPYICA--EPLPSNILEW----------------HYVVRG-PEMT-PYEGGYYHGKLIFPREFPFKPPSIYMITPNGRFKCNTRLCLSITDFHPDTWNPAWSVSTILTGLLSFM 56090425 Rattus_norvegicus NKRAPTTATQRLKQDYLRIK-KDPVPYICA--EPLPSNILEW----------------HYVVRG-PEMT-PYEGGYYHGKLIFPREFPFKPPSIYMITPNGRFKCNTRLCLSITDFHPDTWNPAWSVSTILTGLLSFM 68442601 Danio_rerio NKRAPTTATQRLKQDYLRIK-KDPVPYICA--EPLPSNILEW----------------HYLVRG-PEKT-PYEGGYYHGKLIFPREFPFKPPSIYMITPNGRFKCNTRLCLSITDFHPDTWNPAWSVSTILTGLLSFM 68442599 Danio_rerio NKRAPTTATQRLKQDYLRIK-KDPVPYICA--EPLPSNILEW----------------HYLVRG-PEKT-PYEGGYYHGKLIFPREFPFKPPSIYMITPNGRFKCNTRLCLSITDFHPDTWNPAWSVSTILTGLLSFM 47212316 Tetraodon_nigroviridis NKRAPTTATQRLKQDYLRIK-KDPVPYICA--EPLPSNILEW----------------HYVVRG-PEKT-PYEGGYYHGKLIFPREFPFKPPSIYMITPNGRFKCNTRLCLSITDFHPDTWNPAWSVSTILTGLLSFM Tpseu1000010010 Thalassiosira_pseudonana NTMASPLATKRLQRELKALM-KNPLTSPKIIAQPNEANILEW----------------HYVLEGDADPNSPYNGGIYHGKLIFPKEYPYKPPGVLMLTPNGRFKPNRRLCLSMSDFHPESWNPMWSISTILTGLYSFM Ptri1000001394 Phaeodactylum_tricornutum ---ATDICTRRLTKELRALQ-KDPIRDPKITVAPNESNLLEM----------------HYVIEG-SKDT-EYEGGVYHGKLIFPREYPLKPPGVMMITPSGRFQPGRRLCLSMSDFHPESWNPMWSVSTILTGLYSFM Pram1000009035 Phytophthora_ramorum ---ASAMATKRLRKEYLSMQ-RKPVDYIQA--VPVETNILEW----------------HYVITG-TKGT-PYEGGYYHGKLKFPPEYPMKPPAVMMITPNGRFKTNQRLCLSMSDFHPETWNPMWSVSSILTGLYSFM Psoj1000000332 Phytophthora_sojae -------ATKRLRKEYLAMQ-RKPVDYIQA--VPVESNILEW----------------HYVITG-TKGT-PYEGGFYHGKLKFPPEYPMKPPSVMMITPNGRFKTNQRLCLSMSDFHPETWNPMWSVSSILTGLYSFM consensus/100% .............................h...P...Nlh.h................+..l.G.s..s..h.sG.a..pl.Fs.-aPhpPP.l.h......hp.s..................h.lsp......... consensus/95% .......s..Rhpbp...h..........h...P..pNIhEW................+..l.G.s..o.sY.sG.aa.pl.Fs.-aPhpPP.l.h.T....ap.N..lCLs..p.....WsP.hslspll.ulhShh consensus/90% .......u..Rlpb-h..h....P...h.h...P..sNIhEW................+..l.G.s..o.sYpGG.aa.pl.Fs.-aPh+PP.l.hbT....acsN..lCLs..p.....WsP.holuplL.ulhShh consensus/85% .......uspRlpb-h..h..psP...h.h...P..sNIhEW................+.sl.G.s..o.sY-GG.aa.pl.Fs.-aPaKPP.l.hbT....a+sN..lCLs..p.....WsPhholuplL.ulhShh consensus/80% .....spuspRlpb-h..lp.csP...hpA...P..sNIhEW................+.sl.G.P..o.sYEGGhaa.cl.Fsp-aPFKPP.l.hbT....a+sN..lCLs..p.....WsPuholuplL.ulhShh consensus/75% .....spuspRlpb-h..lp.csP...hpA...P..sNIhEW................+.sl.G.P..o.sYEGGhaa.cl.Fsp-aPFKPP.l.hbT....a+sN..lCLs..p.....WsPuholSplL.ulhShh consensus/70% .p...ssAspRLpb-hhblp.csPs..lsA...Pb.sNILEW................HYllbG.P..o.PYEGGhYHGcL.FPp-aPFKPPul.hbTs...F+sN.+lCLS..sD.P-sWNPAWoVSoILsGLhSFh