Molecular genetics of ecological diversification: Duplication and rapid evolution of toxin genes of the venomous gastropod Conus

Journal List > Proc Natl Acad Sci U S A > v.96(12); Jun 8, 1999

Proc Natl Acad Sci U S A. 1999 June 8; 96(12): 6820–6823.

PMCID: PMC21999

Evolution

Molecular genetics of ecological diversification: Duplication and rapid evolution of toxin genes of the venomous gastropod Conus

Thomas F. Duda, Jr.^* and Stephen R. Palumbi

Department of Organismic and Evolutionary Biology, Biological Laboratories, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138

^*To whom reprint requests should be addressed. e-mail: tduda/at/oeb.harvard.edu.

Communicated by George N. Somero, Stanford University, Pacific Grove, CA

Received November 13, 1998; Accepted April 20, 1999.

This article has been cited by other articles in PMC.

Abstract

Predatory snails in the marine gastropod genus Conus stun prey by injecting a complex mixture of peptide neurotoxins. These conotoxins are associated with trophic diversification and block a diverse array of ion channels and neuronal receptors in prey species, but the evolutionary genesis of this functional diversity is unknown. Here we show that conotoxins with little amino acid similarity are in fact products of recently diverged loci that are rapidly evolving by strong positive selection in the vermivorous cone, Conus abbreviatus, and that the rate of conotoxin evolution is higher than that of most other known proteins. Gene duplication and diversifying selection result in the formation of functionally variable conotoxins that are linked to ecological diversification and evolutionary success of this genus.

Efforts to understand the molecular evolution of genes fundamental to complex adaptations have focused on key morphological or developmental features [e.g., Hox genes (1–2) and MADS-box genes (3–4)]. Genes that contribute to ecological diversification and the nature of evolutionary forces acting during this process are much more poorly known, partly because genes directly involved in ecological attributes are hard to identify.

Among the 500 species of the genus Conus, some hunt fishes, whereas others consume gastropods, polychaetes, or hemichordates (5). Venom is injected to stun prey and contains a wide variety of neurotoxins (“conotoxins”) that block a diversity of ion channels and neuronal receptors (6–16) in prey species. Different conotoxins are maximally effective on different prey species (6–8, 17–18); the ability to prey on these taxa among Conus species is thus linked to conotoxin evolution. To illuminate the genetic basis of ecological diversification in cone snails, we have begun to investigate the molecular evolution of conotoxins.

Members of the δ and ω conotoxin classes (“four-loop” conotoxins) have been sequenced (10, 19–20), are known to target a variety of sodium and calcium channels, respectively (10–16, 21), and have a “-C-C-CC-C-C-” cysteine “backbone” (8, 21–22) (C, cysteine; dashes refer to one to seven intercatenated amino acids of various types). These peptides are between 25 and 35 aa in length but are translated as precursor peptides of between 70 and 80 aa in which the N-terminal part of the propeptide (the “prepro” region) has been suggested to possess a signaling or processing function and that is cleaved from the toxin region during processing (19).

We examined the expression of four-loop conotoxin genes among two distantly related (unpublished data) vermivorous cone snails, Conus abbreviatus and Conus lividus to elucidate the evolution of these genes in Conus. These taxa have distinct diets; C. abbreviatus feeds primarily on eunicid and nereid polychaetes, whereas C. lividus feeds on a diversity of prey, including hemichordates and capitellid, nereid, spionid, and terebellid polychaetes (5, 23–25) Results show that the toxins from these species, particularly C. abbreviatus, comprise a rapidly duplicating multigene family that is diversifying by strong positive selection.

MATERIALS AND METHODS

Specimens of C. abbreviatus, Conus ebraeus, and C. lividus were collected from various sites around Oahu, Hawaii and kept in tanks until processing. C. abbreviatus and C. lividus are fairly distantly related, probably diverging during the Miocene, whereas C. abbreviatus and C. ebraeus are more closely related, probably diverging during the last 1.7–4.5 million years (unpublished data).

We extracted mRNA and synthesized cDNA from a portion of the venom duct of these species with variations on methods of Jakobsen et al. (26) and Lee and Vacquier (27). We amplified putative four-loop conotoxins from the cDNA of C. abbreviatus and C. ebraeus with a 5′ primer (CATCGTCAAGATGAAACTGACGTG) designed within the N terminus of the prepro region of four-loop conotoxins from published sequences (19–20) and an oligo[dT] for 40 cycles under the following conditions: 94°C for 30 seconds, 45°C for 30 seconds, 1-minute ramp to 68°C, and 68°C for 1 min. Amplification products were ligated into t-tailed pBluescript II KS(−) (28) that were then transformed into competent Escherichia coli. After growing the colonies overnight, we screened the white/positive colonies with either M13 (CATTTTGCTGCCGGTCA) or T3 (ATTAACCCTCACTAAAGGGAAC) vector primers and the 5′ conotoxin primer. Amplification products of expected sizes (200–400 bp) were sequenced. A primer (CACAGGTATGGATGACTCAGG) was then designed within the 3′ untranslated region from four-loop conotoxin sequences obtained from C. abbreviatus and C. ebraeus.

We used both conotoxin primers to amplify putative four-loop conotoxins from two individuals of C. abbreviatus and C. lividus for 40 cycles under the following conditions: 94°C for 30 seconds, 50°C for 30 seconds, and 72°C for 30 seconds. The amplification products were cloned as above, and white/positive colonies were screened with amplifications with vector primers. We sequenced at least 40 screen amplifications of expected insert size from each individual.

Sequences were aligned manually first by species and then compiled and aligned together. The conserved arrangement of the cysteine backbone aided in alignments of the diverse sequences recovered. Molecular phylograms were constructed with neighbor-joining from Kimura two-parameter distances among the sequences with mega (29). Identical sequences were represented only once. Bootstrap values were also estimated with mega from 100 replicates. Sequence groups were denoted based on existence of distinct clades and similarity of predicted amino acid sequences.

The corrected proportions of nonsynonymous substitutions per nonsynonymous site (Dn) and synonymous substitutions per synonymous site (Ds) were estimated from the sequences representative of the diversity of the predicted amino acid sequences within each sequence group. Dn and Ds were estimated from the beginning of the prepro region to the last codon before the stop codon with method 1 of Ina (ref. 30; program by T.F.D.). Significance of differences between Dn and Ds was estimated by using a one-tailed t test with infinite degrees of freedom (29). We used Hochberg’s (31) Bonferroni technique to correct significance for multiple tests. Dn and Ds were also estimated with method 1 of Ina (30) over sliding/overlapping windows of 14 codons by using a “slide” of 7 codons for all sequence comparisons (program by T.F.D.).

With sequence data from a calmodulin locus and the fossil and biogeographic record of Conus, we have estimated the rate of synonymous substitutions within the nuclear genome of this genus to be 0.63–1.8% per million years (unpublished data). We used this rate and the amount of synonymous divergence between the most similar sequences from C. abbreviatus and C. lividus to estimate the times of divergence of conotoxin genes. Rates of nonsynonymous substitutions among the toxin regions of these genes were then calculated from the times of divergence and Dn values among sequences.

RESULTS AND DISCUSSION

We report 180 sequences of four-loop conotoxins recovered from cDNA preparations (GenBank accession nos. AF089901–AF090080) that increase the known nucleotide sequence diversity of four-loop conotoxins by 30-fold and that assemble into 11 divergent groups in these species (Fig. 1). Considerably more diverse sequences were obtained from C. abbreviatus than from C. lividus. This may be because of the design of the 3′ conotoxin primer or because C. lividus has less diverse conotoxin gene families. Because of the possible bias in acquiring conotoxins from these taxa and the extreme divergence of sequences from C. lividus, we focused our analyses primarily on the evolution of four-loop conotoxins from C. abbreviatus.

Figure 1

Neighbor-joining tree reconstructed from Kimura two-parameter distances computed from comparisons of entire conotoxin sequences (including 3′ untranslated regions). Bootstrap values >50% are indicated on branches. Roman numerals (more ...)

The nine sequence groups of C. abbreviatus, in which seven include sequences from both individuals, must represent at least five loci in this species. Because of the levels of divergence among these groups (>7% nucleotide divergence) and because each individual has sequences belonging to at least seven groups, it is likely that each represents a distinct locus.

Predicted amino acid sequences of the sequences are highly variable (Table 1). Even among the closely related monophyletic set encompassing groups A3–A8, nearly every amino acid in the toxin region (excluding the structure-defining cysteine residues) has been substituted at least once. Most of the amino acid variation is nonconservative; that is, substitutions involve amino acids of different chemical classes. Moreover, there is considerable length variation among the toxin peptides, which range from 25 to 36 aa. Despite this variation, phylogenetic reconstruction of the untranslated and coding regions shows the distinct signal of gene duplication and subsequent divergence (Fig. 1). Without the evidence of similarity provided by the more slowly evolving prepro region, the recent divergence of these radically different peptides and their evolutionary relationships would never have been apparent.

Table 1

Predicted amino acid sequences representative of the diversity of conotoxins from each sequence

group

Sequences from groups A1–A8 of C. abbreviatus cluster together monophyletically and are most closely related to sequences from group L2 from C. lividus; sequences from group A9 in C. abbreviatus are more distantly related to this cluster. These relationships show that some conotoxin loci in C. abbreviatus have persisted since the separation of C. abbreviatus and C. lividus in the Miocene (unpublished data), whereas others—perhaps the bulk of four-loop conotoxins—have duplicated and diversified subsequently.

Conotoxin sequence divergence is driven by strong selection, especially the recently duplicated loci A1–A8. Dn is significantly greater than Ds in the majority of between-group pairwise comparisons (43 of 59; Table 2). All comparisons with P values ≤ 0.001 (n = 9) remain significant after correcting for multiple tests. The average Dn and Ds values across overlapping windows for all sequence comparisons between representative sequences from groups A1 toA8 (Fig. 2) show that the amino acid variation and signature of diversifying selection (Dn > Ds) is mostly restricted to the toxin part of these sequences.

Table 2

Dn:Ds ratios, significance levels, and Dn and Ds values among conotoxin

sequences

Figure 2

Sliding window analysis of average Dn and Ds estimates for all toxin sequence comparisons. Codons 1–42 primarily include the prepro region; codons 43–63 only contain toxin sequences terminating before the stop codon; because the presumed (more ...)

Moreover, the average Dn among the toxin regions of sequence groups A1–A8 (0.47, SE = 0.20) is significantly greater than that among the prepro regions (0.053, SE = 0.04). Interestingly, the average Ds among toxin regions (0.18, SE = 0.10) is also significantly greater than the average Ds among prepro regions (0.024, SE = 0.020) (see Fig. 2). This pattern is also observed in abalone lysin genes under diversifying selection (32) and may reflect the shift of nucleotide positions from synonymous to nonsynonymous sites during protein evolution. Alternatively, if most changes among conotoxin loci involve nonsynonymous substitutions, Ina’s (30) method might overestimate Ds while underestimating Dn in regions of low sequence identity because it equally weights all possible pathways for multiple step changes of a codon. In either case, toxin regions appear to be accumulating both synonymous and nonsynonymous substitutions at a greater rate than prepro regions.

Sequences from C. abbreviatus in the A1–A8 cluster and those from C. lividus in the L2 cluster show a divergence at synonymous sites of ≈12.0% in the prepro and 3′ untranslated regions. This is similar to the 12.4% divergence of calmodulin introns from these species (unpublished data) and suggests that the A1–A8 and L2 clusters diverged when C. abbreviatus and C. lividus lineages diverged. Prior rate calibrations for calmodulin show that sequences diverge at synonymous sites at a rate of 0.63–1.8% per million years in Conus—about average for eukaryotic genes (33). If we use these rates to calculate the average divergence time of the A1–A8 and L2 clusters, the result is about 6.7–19 million years. Likewise, the average time of divergence among the A1–A8 clusters is about 4.9–14 million years. From these times and the average Dn, we estimate that the rates of nonsynonymous substitutions among the toxin regions of the A1–A8 clusters average 1.7–4.8% per million years. The lower rate of 1.7% per million years (= 17 nonsynonymous substitutions per site per 10⁹ years) is five times greater than the highest nonsynonymous rate reported by Li (33) for mammals (interferon γ nonsynonymous rate = 3.1 substitutions per site per 10⁹ years) and nearly three times greater than the highest nonsynonymous rates reported for Drosophila. Conotoxins are thus diversifying at an extraordinarily high rate.

Significant differences between Dn and Ds among the nucleotide sequences of these peptides are signals of diversifying selection and adaptive evolution (34). These signals are mainly due to the divergence of sequences within the toxin and not the prepro region, implying that diversifying selection has involved toxin sequences and not the entire propeptide sequences. The considerable length variation among mature conotoxin peptides also suggests adaptive evolution for conotoxin size. Although we do not know whether these conotoxins have functional differences and it is unclear how conotoxin length affects functionality, results from other studies have demonstrated that four-loop conotoxins that differ in amino acid sequences have different specificities for particular cell channels (6–16). Thus, the rapid evolution we document here among conotoxin loci probably leads to substantial variation in venom effectiveness against particular prey types.

Our results show that conotoxin diversity is associated with an ongoing process of locus duplication and rapid divergence. Because conotoxins are intricately related to a species’ ability to paralyze its prey, the rapid adaptive evolution of these loci suggests that conotoxins are under strong selection in response to changes in the availability of or accessibility to particular prey species over time or because of a type of “arms race” between conotoxins and the cell channels and receptors of prey. Coevolution of predator and prey may generate evolutionary forces similar to those seen in host-pathogen evolution (34–38) and provide the means by which ecologically relevant genetic loci may rapidly diversify.

Acknowledgments

We thank G. C. Fiedler, D. Strang, P. S. Armstrong, and K. A. del Carmen for aid in collections of specimens. We also thank F. Cipriano, M. Hare, D. Hartl, S. Lavery, and two anonymous reviewers for input and reviews of earlier drafts of this manuscript. This work was supported by the Research Council of the University of Hawaii and Organismic and Evolutionary Biology Department of Harvard University and grants from the Conchologists of America, Hawaiian Malacological Society, Lerner-Gray Fund for Marine Research, Sigma Xi, the Western Society of Malacologists, and grants to S.R.P. from the National Science Foundation.

Footnotes

The sequences reported in this paper have been deposited in the GenBank database (accession nos. AF089901–AF090080).

References

Sharman, A C; Holland, P W H. Neth J Zool. 1996;46:46–67.

Bailey, W J; Kim, J; Wagner, G P; Ruddle, F H. Mol Biol Evol. 1997;14:843–853. [PubMed]

Purugganan, M D; Rounsley, S D; Schmidt, R J; Yanofsky, M F. Genetics. 1995;140:345–356. [PubMed]

Theissen, G; Kim, J T; Saedler, H. J Mol Evol. 1996;43:484–516. [PubMed]

Kohn, A J. Ecol Monogr. 1959;29:47–90.

Olivera, B M; Gray, W R; Zeikus, R; McIntosh, J M; Varga, J; Rivier, J; de Santos, V; Cruz, L J. Science. 1985;230:1338–1343. [PubMed]

Olivera, B M; Rivier, J; Clark, C; Ramilo, C A; Corpuz, G P; Abogadie, F C; Mena, E E; Woodward, S R; Hillyard, D R; Cruz, L J. Science. 1990;249:257–263. [PubMed]

Olivera, B M; Rivier, J; Scott, J K; Hillyard, D R; Cruz, L J. J Biol Chem. 1991;266:22067–22070. [PubMed]

Cruz, L J; Gray, W R; Yoshikami, D; Olivera, B. J Toxicol Toxin Rev. 1985;4:107–132.

10.

Monje, V D; Haack, J A; Naisbitt, S R; Miljanich, G; Ramachandran, J; Nasdasdi, L; Olivera, B M; Hillyard, D R; Gray, W R. Neuropharmacology. 1993;32:1141–1149. [PubMed]

11.

Fainzilber, M; Vanderschors, R; Lodder, J C; Li, K W; Geraerts, W P M; Kits, K S. Biochemistry. 1995;34:5364–5371. [PubMed]

12.

Fainzilber, M; Kofman, O; Zlotkin, E; Gordon, D. J Biol Chem. 1994;269:2574–2580. [PubMed]

13.

Kristipati, R; Nadasdi, L; Tarczyhornoch, K; Lau, K; Miljanich, G P; Ramachandran, J; Bell, J R. Mol Cell Neurosci. 1994;5:219–228. [PubMed]

14.

Hasson, A; Shon, K J; Olivera, B M; Spira, M E. J Neurophysiol. 1995;73:1295–1302. [PubMed]

15.

Kits, K S; Lodder, J C; Vanderschors, R C; Li, K W; Geraerts, W P M; Fainzilber, M. J Neurochem. 1996;67:2155–2163. [PubMed]

16.

Gandia, L; Lara, B; Imperial, J S; Villarroya, M; Albillos, A; Maroto, R; Garcia, A G; Olivera, B M. Eur J Physiol. 1997;435:55–64. [PubMed]

17.

Endean, R; Rudkin, C. Toxicon. 1963;1:49–64.

18.

Endean, R; Rudkin, C. Toxicon. 1965;2:225–249. [PubMed]

19.

Woodward, S R; Cruz, L J; Olivera, B M; Hillyard, D R. EMBO J. 1990;9:1015–1020. [PubMed]

20.

Colledge, C J; Hunsperger, J P; Imperial, J S; Hillyard, D R. Toxicon. 1992;30:1111–1116. [PubMed]

21.

Cruz, L J; Ramilo, C A; Corpuz, G P; Olivera, B M. Biol Bull. 1992;183:159–164.

22.

Hillyard, D R; Olivera, B M; Woodward, S; Corpuz, G P; Gray, W R; Ramilo, C A; Cruz, L J. Biochemistry. 1989;28:358–361. [PubMed]

23.

Kohn, A J. Pac Sci. 1980;34:359–369.

24.

Kohn, A J; Nybakken, J W. Mar Biol. 1975;29:211–234.

25.

Reichelt, R E; Kohn, A J. Proc Fifth Int Coral Reef Congr. 1985;5:191–196.

26.

Jakobsen, K S; Breivold, E; Hornes, E. Nucleic Acids Res. 1990;18:3669. [PubMed]

27.

Lee, Y H; Vacquier, V D. Anal Biochem. 1992;206:206–207. [PubMed]

28.

Marchuk, D; Drumm, M; Saulino, A; Collins, F S. Nucleic Acids Res. 1991;19:1154. [PubMed]

29.

Kumar, S; Tamura, K; Nei, M. mega: Molecular Evolutionary Genetics Analysis. University Park, PA: Pennsylvania State Univ.; 1993. , Version 1.01.

30.

Ina, Y. J Mol Evol. 1995;40:190–226. [PubMed]

31.

Hochberg, Y. Biometrika. 1988;75:800–802.

32.

Metz, E M; Robles-Sikisaka, R; Vacquier, V D. Proc Natl Acad Sci USA. 1998;95:10676–10681. [PubMed]

33.

Li, W-H. Molecular Evolution. Sunderland, MA: Sinauer; 1997.

34.

Hughes, A L; Nei, M. Nature (London). 1988;335:167–170. [PubMed]

35.

Hughes, A L. Mol Biol Evol. 1992;9:381–393. [PubMed]

36.

Hill, R E; Hastie, N D. Nature (London). 1987;326:96–99. [PubMed]

37.

Riley, M A. Mol Biol Evol. 1993;10:1048–1059. [PubMed]

38.

Zhang, J; Rosenberg, H F; Nei, M. Proc Natl Acad Sci USA. 1998;95:3708–3713. [PubMed]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of
National Academy of Sciences