The DOMON domains are involved in heme and sugar recognition

Lakshminarayan M. Iyer, Vivek Anantharaman and L. Aravind*

SUPPLEMENTARY material S4- Figure 2


 

MULTIPLE SEQUENCE ALIGNMENT OF THE DM13 DOMAIN

 

Secondary Structure                            ------------------EEEEEEE------EEEEEE-------------EEEEEE------------EEEE-----EEE

WH5701_02499_Syn_87301508                  35  GMF------QKAEAP-VSGSFMIKTEAGKQ-VLVLSSDFKTND---SAPDLKVAFSPSAKPLAMSKPPKYELKAGSYTVL

MED121_19319_Msp_87121730                  29  GELEADFS-SWGSDA-VEGDWRISERGGKQ-YIQLLDNFEAK----EGPDVKIFLSKQDAASVNG-----NNATDDAVFI

SAV0708_Saur_15923698                      54  GTFS-----SKNGET-VEGKAEIKN------GKLMLTNYKSS----KGPDLYVYLTKNG-----------DIKNGKEIAM

Tery_2225_Tery_113475866                  104  GKFVT----VTRGEN-TKGRAEIIRVGGKT-QVRLGENFSTM----VGPAVKLVLHKQSRPESYT--------GENYVPL

SAR0761_Saur_49482964                      54  GTFS-----SKNGET-VEGKAEIKN------GKLMLTNYKSS----KGPDLYVYLTKNG-----------DIKNGKEIAM

Dpse\GA17202_Dpse_54637920                149  PRPQKISAL-RGVHG-VSSENIVIVDA----QTLLVPNFSYDG---EAPDAKFWVGRGQRPTSEG-----LRIPDENGKE

LOC579039_Spur_115695230                   93  GALGFS----PLVHR-TSAAAVVVLDP----KKIRFEDLNYDG---FGPDAYFWVGPGDTPSYDE----DYRIPDETGSL

Npun02007944_Npun_53686863                 49  GTF------VSGEQK-TQGKVNITTKDGKS-FLELDESFKTSE---SGPDLVVILHRSDNVINSTKPPSYPLKNGDYFII

VV2_0813_Vvul_27367221                     60  GEFSRDRADSDFLHW-GEGKLVIDAQ-----SIAFLGELA------PGPDYKLYLSPTFIETEAD----FLANKEQLVRV

PTD2_04781_Ptun_88861382                   53  AKFERHLQDSDFLHF-GEGEVFISQH-----AIAFVGTLA------PGPDYQLYLSPEFVETELD----FNRLKAQMVQV

SKA34_10313_Psp_89075712                   53  GVFKKDLAGSDFLHW-GEGKVSISTT-----KITLMGKLA------PGPDYKLYLSPRFVETEAE----FNQEKADMVLV

VAS14_13564_Vang_90578082                  53  AEFKRDLAGSDFLHW-GEGKVSISPT-----KITLMGKLA------PGPDYKLYLSSKFVETETE----FNQEKTNMVLV

SwooDRAFT_1484_Swoo_118072177              53  GQFSRERADSDLLHW-GEGEVSISAD-----SIAFIGSLA------PGPDYRLYLSPKFVDTESG----FNRLKPSMVQV

GP2143_05890_Mgam_119476097                53  GVFSRDRTDSDTLHW-GEGDVAIGVS-----SIAFVGSLA------PGPDYKLYLSPEYVETESA----FIRLKPSMVQV

AHA_0124_Ahyd_117618213                    53  GTFRRDLQDSDRLHW-GEGQVSIGPD-----AISLMGKLA------PGPDYQLYLSPVFVETEAD----FARLKEKMVKI

Bpro_4776_Psp_91790601                     51  GQFRRDLKDSDPLHW-GEGTVSVSPQ-----SIALSGKLA------PGPDYKLYLSPEFVETEAD----FARLKARMTRV

Rgel02001255_Rgel_47574394                 50  AQFRRELKDSDALHW-GEGMVFVGPR-----SISLLGRLA------PGPDYKLYLSPEFVETEAD----FQRLKPQMLRV

_Apre_3056881                              85  GGF------VTQEHE-TRGTATVLRNGAA--RVLRLEGFSTS----DGPDLHIWLSDATAGGEWG-----KYDDGRYLPL

Bcep18194_B2454_Bsp_78063301               52  SRFERDLPGSDALHW-ADGKLTITDS-----SIAFEGDIA------PGPDYKIYLVPEFVDTRAS----FLAIKARAARV

BURPS1710b_A0229_Bpse_76818639             51  GRFERSLTRGDPLHW-TDGKLSVTRA-----ALAFEGRVA------PGPDYKLYLVPGFVASKEA----FLAAKPHARRV

BMAA1071_Bmal_53716672                     51  GRFERSLTRGDPLHW-TDGKLSVTRA-----ALAFEGRVA------PGPDYKLYLVPGFVASKEA----FLAAKPHARRV

BTH_II1180_Btha_83716796                   51  GRFERGLTGGDPLHW-TDGKLSVTRA-----ALAFEGRVA------PGPDYKIYLVPEFVASKQA----FLDVKSRARRV

Dpse\GA17202_Dpse_54637920                 34  GTKIGAL--TRLHHG-VSGDVYAVDS-----RTIFIKKFSYDG---EAPAAYFYVGNTARPSNEG----ATRLRDERGGT

VVA1277_Vvul_37676937                      67  GEFSRDRADSDFLHW-GEGKLVIDAQ-----SIAFSGELA------PGPDYKLYLSPTFIETEAD----FLANKERLVRV

MED92_12219_Osp_89095454                   51  GTFTKDLPGSDFLHW-GEGKLSLSQN-----KIVFEGELA------PGPDYKLYLSPVFVDDEAS----FEASKAKMVQL

y4083_Ypes_22127952                        53  GEFSRNQKGSDAVHW-AEGKLYVSEH-----EIAFEGEIA------PGPDYKIYLTKTQADDKES----FLKIKDDAKLI

CPS_0108_Cpsy_71281863                    208  GFF------STFAHN-VSGKATIIDD-----CTIEISQFSYDG---GGPDVYFYGAIDHEYTSVD------AFPMGQKLN

BCE_G9241_pBC218_0022_Bcer_47565260        59  GQF------QNGTHE-TTGTAKIHQFADGK-RVLRLSDFATS----NGPDVRVVLVPTNHLKNNE-----DVKNYKYIEL

BcerKBAB4DRAFT_5400_Bwei_89204057          61  GQF------QNGVHE-TTGTATIHQLADGK-RILRLSNFSTS----NGPDVRVVLVPTDRLKNNE-----DVKNHQYIEL

h16_B2146_Reut_116696082                   56  GTFRKDLAGSDALHW-ASGQLHVSPQ-----ALAFEGSVA------PGPDYRIYLAPEFVDNKEA----FLRIKDRSLQV

StropDRAFT_2984_Stro_113943736             94  GEF------ITHEHN-TSGTARIIRTAD---GTLRLELVGLATS--NGPDLRVWLTDQPVRTGEAGW--HVFDDGHWVEL

V12B01_16186_Vspl_84385226                 50  GEFTKDRQDSDFLHW-GEGIVSVSES-----AIAFEGELA------PGPDYKVYLSPKFIETEQA----FNDSKSELLRV

MED222_03785_Vsp_86145746                  50  GEFTKDRQDSDFLHW-GEGIVSVSES-----AIAFEGELA------PGPDYKVYLSPKFIETEQA----FNDSKSELLRV

DIP1688_Cdip_38234259                      79  GQF------ISHEHE-TSGTATLIVDADGK-KKLVLTDLATS----NGPDVHVWLSKAPVIPGKDGW--FVAKDHEHFDV

ELI_07270_Elit_85374281                     9  GQFRG----ADSSHP-ASGGVEIVKLKGGGLGVKLKGDFKVR----GGPDLRVWLSDASNPRNGR-----AVRSKDYVDL

CENSYa_0388_Csym_118194106                174  GSFA-----GLAGHA-ADGDAKILDVSGT--RFLRFENFEVT----NGPDLRVYLAPDGSVSQGI-------------EL

FP2506_10116_Fpel_114705286                30  GRFT-----GAGGHV-ASGTVTVVTENGRT-RVVFGNDFSLD----SAPDAYVAFGSARSYAEGT-------------AF

SPO3881_Sipo_56698692                      31  GHFE-----GRSNHV-TSGSVKLVKDGERY-VVELGDDFSLD----GGPDPRVAFGKDGKYDPDT-------------KL

pFBAOT6.58_Apun_51492571                   51  GRLARDLKGSDLLHW-GEGEIRVSRD-----RIAHIGRLA------PGPDYKLYLAPRFVDTKEA----FLLIKDRSVRV

RS9917_09446_Syn_87123823                  48  GRF------LAAEHP-AAGQVHLERRNGNA-TLVFSQDFRTTD---QAPDLFVVVSPMAMPLENSPAPAYPLTPGTFHVV

alr1534_Ana_17229026                       41  GNF------QAGEHP-TQGQVSVVKEAGKN-YLEFDRNFKTD----QGPDLYVILYRFEKPPISG------IKEKDYVSI

Shewana3_4340_Ssp_117676277                53  GEFKKDQKGSDALHW-ANGQLFVTDN-----EIVFKGDVA------PGPDYKIYLTKKQATDKES----FLDIKKEAILI

LV102a_Kpne_38639590                       57  GDFKRNQKGSDVLHW-AEGELYVTDN-----EIAFKGEVA------PGPDYKIYLTKKQAVDKSS----FLEMKKEAVLI

YPO4064_Ypes_16124178                      45  GEFSRNQKGSDAVHW-AEGKLYVSEH-----EIAFEGEIA------PGPDYKIYLTKTQADDKES----FLKIKDDAKLI

YpesA_01002441_Ypes_77634206               49  GEFSRNQKGSDAVHW-AEGKLYVSEH-----EIAFEGEIA------PGPDYKIYLTKTQADDKES----FLKIKDDAKLI

L8106_15844_Lsp_119494581                  51  GSFVT----VEQDHP-TEGTARIVNENGKR-YLEFDSAFTTA----QGPDVNIIFHQKNSVPVNL-------KEGEYITL

HaurDRAFT_3840_Haur_113938194             124  GSF------RGIDHK-SGGIATLYQQPDGS-NLLRLSDFFVE----AGPDMYIFVAKAAEINQPS------DLQAGYLEL

CwatDRAFT_1309_Cwat_67924807               35  GSFI-----GQGGHT-VSGEFQIIKKGEIH-YLVLQDNFKFDGA--PDPKIGFSLNDEF--------------SEDTLFS

SAML0205_Samb_91199778                     79  GGL------ISHEHA-TSGTVKLVRLTDGS-HVVRLENLDTS----NGPDLHVWLTDAPVKEGTAGW--HVFDDGKYVSL

SCO0153_Scoe_21218710                      85  GEL------VSHEHS-TSGTAQLVRLTDGS-HVVRLENLDTS----NGPDVHVWLTDAPVKEGKAGW--HLFDDGEYVDL

_Smob_28864193                             87  GTL------ISHEHT-TSGTVRVLRLPDGS-RTLRLEGLDTS----NGPDVKVLLSDAPVKPGRAGW--HVFDDGAHVSL

Sde_3314_Sdeg_90022954                    232  AEF------NTFAYG-VAGTVTVVND-----CTLLFTRFSYTGG--GLPDVYFYTGPNGNFNQGS-----GFGPNLYG--

gll1172_Gvio_37520741                     132  ETPPLSLVSVDAKTT-GTGRIVVENGKS---YVQLSEDFTIG----EAPAIRMTLYKDAAPPKKT------YDPKKYVDL

MGP2080_04650_Mgam_119504874               66  TEFNRQRKDSDALHW-GEGQLRIYAD-----AIIFEGNLA------PGPDYRLYLAPKFVETEAD----FLAVKEQSVEI

Dpse\GA11661_Dpse_54638383                 35  KPRVLPEFK-RLAHGLRSSNISVLDA-----KTFYIPNLHYDG---AGPDAYFWVGNGSEPNIMG-----IKVPNEVGSL

Jann_4076_Jsp_89056567                     31  AIASGQFQD-TGPRYQGSGSAVIAQTSDGQ-TVLQFGDFSVT----PGPDLEVWLVEADAPTSAA-----AVLASSYISL

AgaP_ENSANGG00000002671_Agam_118777513      4  GREIGHL--TNFGHG-IKGQVFAVDE-----STLFVKGFAYDG---NAPDAFFWVGNSPRPSPEGF---IIPYPEEYSGR

CG32922_Dmel_33589346                      34  GTKIGAL--TRLHHG-VSGDVYAVDS-----RTIFIKKFNYDG---EAPAAYFYVGNTARPSNEG----AARLRDERGGT

CG14681_Dmel_78706736                      34  GTKIGAL--TRLHHG-VSGDVYAVDS-----RTIFIKKFNYDG---EAPAAYFYVGNTARPSNEGA----ARLRDERGGT

LOC657143_Tcas_91093736                    56  GKFIGKL--SELHHG-VSGEVYAVDG-----RTLYLKDFTYDG---QGPAAYFYASTSRNANSAG-----FRLRDENGSP

CG14681_Dmel_78706738                      34  GTKIGAL--TRLHHG-VSGDVYAVDS-----RTIFIKKFNYDG---EAPAAYFYVGNTARPSNEGA----ARLRDERGGT

LOC579354_Spur_115712036                   28  GKLVGTFT-DKSTHD-IAGTVYAEDE-----TTLRIIGFRYDG---AGPDAFIWAGESGVPSDDG-----FIIPDEEGRT

LOC759839_Spur_115740301                   10  GKLVGTFT-DKSTHD-IAGTVYAEDE-----TTLRIIGFRYDG---AGPDAFIWAGESGVPSDDG-----FIIPDEEGRT

LOC575280_Spur_115894476                   29  GEFIQQGDTPATHHN-VNGTVYAVDD-----DTLQIIGFTFDG---KAPDTFFFIGTSGTPSGNS------YVIPKAIGL

AgaP_ENSANGG00000007940_Agam_58384351      33  GTKIGDL--SELHHG-VSGSVYAVDA-----RTLFLKNFNYDG---EGPAAYFYVGNTRAPSNKG----AHRLRDERGRA

LOC756575_Spur_115749019                   12  GKLVGTFT-DKSSHD-IAGTVYAEDD-----TTLRIIGFRYDG---AGPVAYFWVGPGGTPNNNGD----YAIPDETGST

LOC661990_Tcas_91089939                    57  GRLIGPL--QEFAHG-IKGTVYAVDE-----STIFIKGFSYDG---TGPDAFFWIGNSPRPSPEGI---IIPYPENYAGR

LOC766813_Spur_115616851                   17  GEFIQQGDTPATHHN-VSGTVYAVDD-----DTLQIIGFTFDG---KAPDTFFFIGTTGTPSGNG------YVIPKAIGL

AgaP_ENSANGG00000012956_Agam_58375727       4  GKYIGKF--NSYHHQ-ASGDVYAVDE-----YTFLLTGFNYDG---NGIDTFFWSGASNRPGPQG-----FIVPDEYGKT

AgaP_ENSANGG00000003130_Agam_118784555      4  GKYIGKF--NSYHHQ-ASGDVYAVDE-----YTFLLTGFNYDG---NGIDTFFWSGASNRPGPQG-----FIVPDEYGKT

knk_Dmel_21356127                          43  GKYLGKL--NSYHHQ-VSGDVYAVNE-----YTFLIVGFNYDG---NGADTFFWSGASNRPGPQG-----FIVPDEYGKT

W01C8.5_Cele_17570215                     148  SVKLSTEFS-GKRYQLRSGPLYVIDR-----RTIKVYGFTFEGN--KAPKTYFYAGRGASVSYSSGVKVAIRGKDEKEIS

CBG14733_Cbri_39598096                    146  SVALNSEFT-GKRYKLRSGPLYVIDR-----RTIKVYGFTFEGN--KAPKTYFYVGRGASVSYSLGVKVAIRGKDEKEIS

CBG07437_Cbri_39596518                    230  FESEPDMGL-FGEYGIISDPIEVIDS-----RTLRIPRFSYKAS--QTPDGYFFAGAGKDIDTKTGEKAVIVGKDLSVNT

H06A10.1_Cele_17568459                    148  FESEADMGL-FGEYGIISDPIEVIDS-----RTLKIPKFSYKAS--QTPDGYFFAGAGSEIDQKSGKKAAILRSDQTLNY

LOC661990_Tcas_91089939                   176  PRVLPEFK--RLAHGLRSDNISILDA-----KTFYIPNLHYDG---AGPDAYFWVGNGSEPSPFG-----IKVPNEMGSL

AgaP_ENSANGG00000002671_Agam_118777513    123  PRVLPEFK--RLAHGLRSSNISILDA-----KTFYIPNLHYDG---AGPDAYFWVGNGSEPNIMG-----TKVPNELGSL

CG12492_Dmel_24649429                      35  KPRVLPEFK-RLAHGLRSSNISVLDA-----KTFYIPNLHYDG---AGPDAYFWVGNGSEPNIMG-----IKVPNESGSL

AgaP_ENSANGG00000007940_Agam_58384351     149  PTKIAGL---SGVHDVSSDNIVIVDA-----QTLLVPNFSYDG---EAPDAKFWVGRGPAPTSQG-----IRIPDENGKE

CG32922_Dmel_33589346                     149  PRPQKISAL-RGVHGVSSDNIVIVDA-----QTLLVPNFSYDG---EAPDAKFWVGRGQRPTSDG-----LRIPDENGKE

CG14681_Dmel_78706738                     149  PRPQKISAL-RGVHGVSSDNIVIVDA-----QTLLVPNFSYDG---EAPDAKFWVGRGQRPTSDG-----LRIPDENGKE

LOC575280_Spur_115894476                  154  GRLGLS----PVVHGVTAEDVIILNS-----KLIRFVGLNYDG---RGPDAYFWTDTVTNPTTSG-----MRVPQDGERT

LOC766813_Spur_115616851                  142  GRLGLS----PVVHGVTAEDVIILNS-----KLIRFVGLNYDG---RGPDAYFWTDIVTNPTTNG-----MRVPQDGERT

LOC763595_Spur_115784194                   21  GQL-------SGSYGLSATSVVIVND-----QKLRFEGLQYDG---SCSGARFWAGTGSTPSSSG-----HFVRNEHNSD

LOC578503_Spur_115753700                   54  GFGPALGFS-PRVHGTSATAVVVLDP-----KKIRFENLNYDG---LGPDAYFWVGPGDTPNNDGD----YKIPDETGSL

LOC657143_Tcas_91093736                   169  PKPQKIEPL-KGVHAISSDPIVIVDA-----QTLLVPNLSYDG---EAPDAKFWVGRGPKPSPQG-----IRVPDENGKM

LOC759839_Spur_115740301                  130  GALGFA----PRVHNTYASAVIILNA-----KQIRVENLVYDG---QGPRAYFWVGPGGTPNNDGD----YEIPDETGST

LOC579354_Spur_115712036                  148  GTLGFT----PRVHNTYASAVIILNA-----KQIRVENLVYDG---QGPRAFFWVGPEGTPNNNGD----YAIPDETGST

knk_Dmel_21356127                         162  GTFS------KRSHNVSSSSVEILDS-----KTIRIKDFTYDG---RGKRTFFWTGVGPQPSSRG-----SKLPDERGYL

AT5G54830_Atha_15239759                    32  SLIGHESEF-KMLQHQLRGVFTVVDD-----CSFRVSRFDML----SGSEVHWWGAMSSDFDNMT--------NDGFVIS

AgaP_ENSANGG00000012956_Agam_58375727     123  GSLTAI----AGGPMVSSDTIDILDS-----KTIRITDFSYDGKAGGQGAVHFWVGVGPSPSSKG-----SKVPDEMGYL

AgaP_ENSANGG00000003130_Agam_118784555    123  GSLTAI----AGGPMVSSDTIDILDS-----KTIRITDFSYDGKAGGQGAVHFWVGVGPSPSSKG-----SKVPDEMGYL

W01C8.5_Cele_17570215                      31  GVYVGDL---SSPETDISGQVFIVNA-----TTLQIFNLTFSP---SQQDLYFWLDTKDVPTREG-----IKAHTFEYGI

Consensus/80%                                  u.h.......p..a..spuph.h.p........hb..ph........uPDhbhhhs.....s..s..........p....

 

Secondary Structure                      EEE---EE----EEEEE-------------EEEEEEEE------EE---

WH5701_02499_Syn_87301508                APL---KSSKGAQRYVIPASI---DLSKQKSVLIWCQKFNATMAWAPIQ   146\Prokaryotic DM13

MED121_19319_Msp_87121730                SLV---STFEGGSEYEIPAGI---NLNSFKSLIFHCEAYSKLWGTSQIR   139| homologs

SAV0708_Saur_15923698                    VD-----YDKEKQTFDL-KNV---DLSKYDEVTIYCKKAHVIFGGAKLK   146|

Tery_2225_Tery_113475866                 GDL---KSFTGEQVYNIPEGI---SWQDFPNVVVWCERFDVTFGSAKLN   208|

SAR0761_Saur_49482964                    VD-----YDKEKQTFDL-KNV---DLSKYDEVTIYCKKAHVIFGGAKLK   146|

Dpse\GA17202_Dpse_54637920               NPLR--RYERKTIVLTLPEDL---TIFDIGHFGVWCEAFTVDFGHVRLP   258|

LOC579039_Spur_115695230                 QVIR--SYSGATVTVTLIDNI---TIADIGHIGLWCVRFRQDFGHVDIP   200|

Npun02007944_Npun_53686863               APL---QKYSGAQTYSIPQNI---NLADYKSAAIWCRKFNATFGAASLS   160|

VV2_0813_Vvul_27367221                   GD----VKTFDRFILPIPSDV---DVANYTTAIVWCETFSQFITSAKYR   165|

PTD2_04781_Ptun_88861382                 GA----VKTFNNFLVALPAHI---DPTHYNTVIVWCDSFNQFITSAQYQ   158|

SKA34_10313_Psp_89075712                 GN----VKTFDNFVVNVPQNI---DIAKYDTVIVWCESFGQFITSAKYQ   158|

VAS14_13564_Vang_90578082                GN----VKTFDNFVVNVPQNI---DIAKYDTVIVWCESFGQFITSAKYK   158|

SwooDRAFT_1484_Swoo_118072177            GS----VNTFDNFIVTIPDDV---DPSHYNTLVVWCESFGQFITSGTYE   158|

GP2143_05890_Mgam_119476097              GD----VKTFDNFLVTMPAGV---DVSQYNAVIIWCETFGQFITSARYQ   158|

AHA_0124_Ahyd_117618213                  GP----VKTFDNFIVPLPADV---NPADYDSVIIWCETFGQFITAARYR   158|

Bpro_4776_Psp_91790601                   GD----VKTFENFVVRVPESV---DISRYNTVIVWCESFSQFITAARYR   156|

Rgel02001255_Rgel_47574394               GD----VKTFENFLVPVPQGV---DPSRFNTVIVWCEAFGQFITAAKYR   155|

_Apre_3056881                            GPM---KATDGNQNYPIPEDA---DLSGLRSVVVWCDRFNVAFGSAPLD   189|

Bcep18194_B2454_Bsp_78063301             AD----LKTFGNFVVPLPADV---NPAQYTSVVIWCERFSKFISAARYR   157|

BURPS1710b_A0229_Bpse_76818639           GE----LKTFGDFVAPLAADV---DIDAYTTVVIWCERFSQFIGAAQYR   156|

BMAA1071_Bmal_53716672                   GE----LKTFGDFVAPLAADV---DIDAYTTVVIWCERFSQFIGAAQYR   156|

BTH_II1180_Btha_83716796                 GE----LKTFGDFVVPLAEDV---DVDAYTTVVIWCERFSQFIGAARYR   156|

Dpse\GA17202_Dpse_54637920               ASLT-RRYRNKDITLSLPEGK---TLRDIKWFSVWCDEFAVNFGDVAIP   143|

VVA1277_Vvul_37676937                    GD----VKTFDRFILPIPSDV---DVANYTTAIVWCETFSQFITSAKYR   172|

MED92_12219_Osp_89095454                 SD----VKTFNGFVVNVPEGI---DVGQYTTAVVWCETFAEFISAAKYQ   156|

y4083_Ypes_22127952                      GD----LKNFGNFKKTLPAGV---NPDDYTTVQIWCETFSQFIGSASYK   158|

CPS_0108_Cpsy_71281863                   GK----IYNNARIFIKLPQNK---SLDDLNGLSVWCTEFEANFGQVEFT   308|

BCE_G9241_pBC218_0022_Bcer_47565260      GN---LKGNKGDQNYEIPEGV---DVSEYGSVSVWCKRFNENFGAVYFN   164|

BcerKBAB4DRAFT_5400_Bwei_89204057        GK---LKGNKGNQNYEIPEGI---DVSTYGSVSVWCKRFNENFGAAYFK   166|

h16_B2146_Reut_116696082                 GE----LKTFGNFTVALPAGV---DPARYGAVVIWCERFGQFIGAAGYR   161|

StropDRAFT_2984_Stro_113943736           GR---LKGNRGDQAYDIPAEV---DLDRLTSVSIWCKRFAVSFGAAPLA   202|

V12B01_16186_Vspl_84385226               GD----VKTFDRFMVELPEGT---DLNRFNTVVIWCETFGQFITSAKIK   155|

MED222_03785_Vsp_86145746                GD----VKTFDRFMVNLPEGT---DLNRFNTVVIWCETFGQFITSAKIK   155|

DIP1688_Cdip_38234259                    AP---IKGNIGNQVYDLPDDV---NFDEWTSVVLWCDDFNVSFGAAELS   187|

ELI_07270_Elit_85374281                  GR---LKSSSGEQIYRIPASA---DVADANSVVIWCRAFGVFFGSATLK   117|

CENSYa_0388_Csym_118194106               GK---LKGSRGSQNYNI-DGI---DTDVYNTVVIYCQPFRVYFGEAQLF   270|

FP2506_10116_Fpel_114705286              AE---LKSLSGGQTYSAPASI---DGSDYDAVWLWCRRFSVPLGVARLR   128|

SPO3881_Sipo_56698692                    GA---LLSLTGKQRYAVPPTW---DVSAYNEVYIWCDVAGVPLGVAKIN   129|

pFBAOT6.58_Apun_51492571                 GD----VKTFNGFIVEVPAGI---DVRDYNTVVIWCEAFDQFISAAEYQ   156|

RS9917_09446_Syn_87123823                AP---LKSTRGSQRYGLPAYL---KTSEQRSVLIWCRQYNATMSWAQLE   159|

alr1534_Ana_17229026                     AQ---LQKITGNQRYALPNNV---NLQEFKSVAIWCRKFNATFGYAIL-   144|

Shewana3_4340_Ssp_117676277              GE----LKNFGNFKKTIPASV---NVNEFTTVQIWCEHFSKFIGSAKYQ   158|

LV102a_Kpne_38639590                     GE----LKNFGNFRKSIPDSV---NVNEFTTVQIWCERFSKFIGSAEYR   162|

YPO4064_Ypes_16124178                    GD----LKNFGNFKKTLPAGV---NPDDYTTVQIWCETFSQFIGSASYK   150|

YpesA_01002441_Ypes_77634206             GD----LKNFGNFKKTLPAGV---NPDDYTTVQIWCETFSQFIGSASYK   154|

L8106_15844_Lsp_119494581                TS---LQSFEGSQRYLLPDNL---DLSQYKSVGIWCRKFNVTFGYASL-   155|

HaurDRAFT_3840_Haur_113938194            GK---LKGSEGNQNYSLPADF---DPALYANVVIWCEKYQVLMAVAPIQ   228|

CwatDRAFT_1309_Cwat_67924807             GL----NLDQGKQIYRLPFDF---NPDKYNEVTLWCDKFNADLAEAKY-   132|

SAML0205_Samb_91199778                   GK---LKGNKGSQNYRVPGDV---DPTRYTSVSIWCDRFDVSFGAAELA   187|

SCO0153_Scoe_21218710                    GK---LKGNKGSQNYDVPADV---DPSRYTSVSVWCDRFNVSFGAAELA   193|

_Smob_28864193                           GS---LKGNKGDQNYALPRDL---DLDRYTSVSIWCDRFDVSFGASALM   195|

Sde_3314_Sdeg_90022954                   -----QTFETTTHIQTISSS----ALNELDAISVWCVQAGVDFGSGVFN   330|

gll1172_Gvio_37520741                    G---ALVSRKGDQRYAIPTNL---NVKDYGSIAIWCKKFDVTLAYAKID   240|

Jann_4076_Jsp_89056567                   GP---LQSATGGQTYGIPSDI---DASAYGSVVIWCEDFSVLFSVAPLR   142|

MGP2080_04650_Mgam_119504874             GA----VRNFDGFVLSHSPAI---NAAQYTTAVVWCETFGQFITSGQYR   171/

Dpse\GA11661_Dpse_54638383               EP--LRGYQGEDIEIQLPGSM---TVYDIDWLAVWCVEYRHNFGHVYIP   144

AgaP_ENSANGG00000002671_Agam_118777513   EPPVLQQHTNTDIILKLPMGK---RIRDIRWLSVWCRRFTVDFGEIFIP   115

CG32922_Dmel_33589346                    ASL-TRRYRNKDVTLSLPEGK---TLRDIKWFSVWCDEFAVNFSDVSIP   143

CG14681_Dmel_78706736                    ASL-TRRYRNKDVTLSLPEGK---TLRDIKWFSVWCDEFAVNFGDVSIP   143

LOC657143_Tcas_91093736                  EV--IRRYRKEGVTLTLPEGK---TLNNIKIFYVWCEEFEVNFGDVKIP   163

CG14681_Dmel_78706738                    ASL-TRRYRNKDVTLSLPEGK---TLRDIKWFSVWCDEFAVNFGDVSIP   143

LOC579354_Spur_115712036                 VK--LEAYDNVDIRVTLPAGK---TVSSLAWISVWCREFGANFGDLTVP   136

LOC759839_Spur_115740301                 VK--LEAYDNVNISVTLPAGK---TVSSLAWISVWCRQFAANFGDLTVP   118

LOC575280_Spur_115894476                 GSEKLGAYSNEDLTIHLDTSEGPGSLSDYLWISVWCKQAGANFADVTIP   142

AgaP_ENSANGG00000007940_Agam_58384351    GV--LRRYRNEDITLSLPEGK---TLRDIRWFSVWCDDFSVNFGDVQIR   141

LOC756575_Spur_115749019                 GV--ISAYTGETITLTFPEGT---DIFDIGHFGLWCVQANQDFGHVDIP   121

LOC661990_Tcas_91089939                  EPPVLRAYNNTNIILRLPMGK---RIRDIRWLSVWCRRFTVDFGEVFIP   168

LOC766813_Spur_115616851                 GSEKLGAYSNEDLTIHLDTSEGPGSLSDYLWISVWCKQAGANFADVTIP   130

AgaP_ENSANGG00000012956_Agam_58375727    NI--LERYFNKDFTLRLPDNK---KITEVKWLAVYDLNSQNNFADVYIP   111

AgaP_ENSANGG00000003130_Agam_118784555   NI--LERYFNKDFTLRLPDNK---KITEVKWLAVYDLNSQNNFADVYIP   111

knk_Dmel_21356127                        NI--LDRYHNKDFTLTLPDRK---KITEIKWLAVYDLSSQNNFGDVYIP   150

W01C8.5_Cele_17570215                    EIS-ENYRGGKDIILELPENY---DIFHIDWISVYCYKYRVNFGSVLVP   264

CBG14733_Cbri_39598096                   EIS-ENYRGGKDIILELPENY---DIFHIDWISVYCYKYRVNFGSVLVP   262

CBG07437_Cbri_39596518                   CPM-LKDITDETMIVRLDPSQ---TIYDIEWISVFCYKYSHDFGHLDIG   346

H06A10.1_Cele_17568459                   CPM-LKDITDQDIIIRLDQSQ---TIYDIEWISVFCYKYSHDFGHLDMG   264

LOC661990_Tcas_91089939                  DP--LRGYQGEDIEIQLPGSI---TVYDIDWLAVWCVQYRHNFGHVIIP   284

AgaP_ENSANGG00000002671_Agam_118777513   DP--LRGYQGEDIEIQLPGNL---TVYDIDWLAVWCVEYRHNFGHVYIP   231

CG12492_Dmel_24649429                    EP--LRGYQGEDIEIQLPESL---TVYDIDWLAVWCVEYRHNFGHVYIP   144

AgaP_ENSANGG00000007940_Agam_58384351    TP--LRRYDKKTIVLTLPGDL---TVFDIGHFGVWCEAFTVDFGHVRIP   256

CG32922_Dmel_33589346                    NP--LRRYERKTIVLTLPEDL---TIFDIGHFGVWCEAFTVDFGHVRLP   258

CG14681_Dmel_78706738                    NP--LRRYERKTIVLTLPEDL---TIFDIGHFGVWCEAFTVDFGHVRLP   258

LOC575280_Spur_115894476                 ASKLNQPLSGVTRTVRLPG-----DVFELRSIGLWCVLARQNFGHVVIP   260

LOC766813_Spur_115616851                 ASKLNQRLSGVTRTVRLPG-----DVFDKRSIGLWCVLARQNFGHVVIP   248

LOC763595_Spur_115784194                 DP-LVAYSGDSDIELILPDGE---SVFEIGYIGVWCQ--GADVGHVDIP   123

LOC578503_Spur_115753700                 QI--IKSYSGVNVTVTLLDNI---TVADIGHIGLWCVLFTEDFGHVDIP   164

LOC657143_Tcas_91093736                  EP--LRRYDRKTVVLTLPGDL---TVFDIGHFGIWCEAFTVDFGHVQIP   278

LOC759839_Spur_115740301                 GV--INPYNGETITLTFPEGT---DIFDIGHFGLWCIRFTQDFGHVDIP   237

LOC579354_Spur_115712036                 GV--ISAYNGETITLTFPEGT---DIFDIGHFGLWCIAFTQDFGHVDIP   255

knk_Dmel_21356127                        DP--IRQYNKETIELELPGDK---TIFDIDWISVYDVADNENYGHVLFN   266

AT5G54830_Atha_15239759                  DQKLNQTFKNSSFIVRLLGNV---TWDKLGVVSVWDLPTASDFGHVLLS   139

AgaP_ENSANGG00000012956_Agam_58375727    DP--IRAYDRETITLELPGDM---TIFDIDWFSVYDVEARRDYGSILIS   232

AgaP_ENSANGG00000003130_Agam_118784555   DP--IRAYDRETITLELPGDM---TIFDIDWFSVYDVEARRDYGSILIS   232

W01C8.5_Cele_17570215                    TPLGSFPEDNDRVVVHVPEKH---RIDEFKSFSIYSFKTDKSMASVVFP   140

Consensus/80%                            s.......p..sb.h.hP.s....sh.ph..hshhC.ph...hu.s.h…...

 

Supplementary Fig. 2. Multiple alignment of the DM13 domain.  Proteins are represented by their gene names, species abbreviations and gis. The coloring reflects the consensus at 80% conservation.  The consensus abbreviations and corresponding coloring scheme is as follows: a, aromatic (FYWH) and h, hydrophobic residues (ACFILMVWY), shaded yellow; s, small residues (AGSVCDN), colored green; u, tiny residues (GAS) shaded green; b, big residues (EFHIKLMQRWY), shaded grey; and p, polar residues (CDEHKNQRST) colored blue. The nearly absolutely conserved cysteine is shaded red. Species abbreviations are as follows:  Prokaryotes: Ahyd : Aeromonas hydrophila; Ana : Nostoc sp.; Apre : Actinosynnema pretiosum; Apun : Aeromonas punctata; Bcer : Bacillus cereus; Bmal : Burkholderia mallei; Bpse : Burkholderia pseudomallei; Bsp. : Burkholderia sp.; Btha : Burkholderia thailandensis; Bwei : Bacillus weihenstephanensis; Cdip : Corynebacterium diphtheriae; Cpsy : Colwellia psychrerythraea; Csym : Cenarchaeum symbiosum; Cwat : Crocosphaera watsonii; Elit : Erythrobacter litoralis; Fpel : Fulvimarina pelagi; Gvio : Gloeobacter violaceus; Haur : Herpetosiphon aurantiacus; Jsp. : Jannaschia sp.; Kpne : Klebsiella pneumoniae; Lsp. : Lyngbya sp.; Mgam : marine gamma; Msp. : Marinomonas sp.; Npun : Nostoc punctiforme; Osp. : Oceanospirillum sp.; Psp. : Photobacterium sp.; Psp. : Polaromonas sp.; Ptun : Pseudoalteromonas tunicata; Reut : Ralstonia eutropha; Rgel : Rubrivivax gelatinosus; Samb : Streptomyces ambofaciens; Saur : Staphylococcus aureus; Scoe : Streptomyces coelicolor; Sdeg : Saccharophagus degradans; Sipo : Silicibacter pomeroyi; Smob : Streptomyces mobaraensis; Ssp. : Shewanella sp.; Stro : Salinispora tropica; Swoo : Shewanella woodyi; Syn : Synechococcus sp.; Tery : Trichodesmium erythraeum; Vang : Vibrio angustum; Vsp. : Vibrio sp.; Vspl : Vibrio splendidus; Vvul : Vibrio vulnificus; Ypes : Yersinia pestis. Eukaryotes: :Agam : Anopheles gambiae; Atha : Arabidopsis thaliana; Cbri : Caenorhabditis briggsae; Cele : Caenorhabditis elegans; Dmel : Drosophila melanogaster; Dpse : Drosophila pseudoobscura; Spur : Strongylocentrotus purpuratus; Tcas : Tribolium castaneum.