Genome Wide Analyses Reveal Little Evidence for Adaptive Evolution in Many Plant Species
Genome Wide Analyses Reveal Little Evidence for Adaptive
Evolution in Many Plant Species
Toni I. Gossmann,1 Bao-Hua Song,2 Aaron J. Windsor,2 Thomas Mitchell-Olds,2 Christopher J. Dixon,3
Maxim V. Kapralov,3 Dmitry A. Filatov,3 and Adam Eyre-Walker∗,1
1
Centre for the Study of Evolution, School of Life Sciences, University of Sussex, Brighton, United Kingdom
Institute for Genome Sciences and Policy, Department of Biology, Duke University
3
Department of Plant Sciences, University of Oxford, Oxford, United Kingdom
2
*Corresponding author: E-mail: .
Associate editor: Naoki Takebayashi
The relative contribution of advantageous and neutral mutations to the evolutionary process is a central problem in evolutionary biology. Current estimates suggest that whereas Drosophila, mice, and bacteria have undergone extensive adaptive
evolution, hominids show little or no evidence of adaptive evolution in protein-coding sequences. This may be a consequence
of differences in effective population size. To study the matter further, we have investigated whether plants show evidence of
adaptive evolution using an extension of the McDonald–Kreitman test that explicitly models slightly deleterious mutations
by estimating the distribution of fitness effects of new mutations. We apply this method to data from nine pairs of species.
Altogether more than 2,400 loci with an average length of ≈280 nucleotides were analyzed. We observe very similar results
in all species; we find little evidence of adaptive amino acid substitution in any comparison except sunflowers. This may be
because many plant species have modest effective population sizes.
Key words: adaptive evolution, distribution of fitness effects, plants, effective population size, McDonald-Kreitman test.
Introduction
The contribution of adaptive evolution relative to genetic
drift is a fundamental problem in molecular evolution
(Kimura 1983; Gillespie 1991). Several methods to estimate the fraction of adaptive substitutions, α, have been
developed based on the McDonald–Kreitman (MK) test
(McDonald and Kreitman 1991) that contrast polymorphism and divergence between selectively and neutrally
evolving sites (Charlesworth 1994; Fay et al. 2001; Smith
and Eyre-Walker 2002; Bierne and Eyre-Walker 2004; Welch
2006; Boyko et al. 2008; Eyre-Walker and Keightley 2009).
These methods have been applied to a variety of species.
Estimates in Drosophila (Smith and Eyre-Walker 2002;
Bierne and Eyre-Walker 2004; Welch 2006; Bachtrog 2008)
and rodents (Halligan et al. 2010) suggest that ≈50% of all
amino acid substitutions have been fixed as a consequence
of adaptive evolution and for microorganisms estimates
may be even higher (Charlesworth and Eyre-Walker 2006;
Liti et al. 2009). However, although analyses of DNA sequence diversity show signs of some adaptive evolution
(Fay et al. 2001; Zhang and Li 2005), overall the level of
adaptive evolution in hominids appears to be very low
(Chimpanzee Sequencing and Analysis Consortium 2005;
Zhang and Li 2005; Boyko et al. 2008; Eyre-Walker and
Keightley 2009). The contrast between hominids and other
animals has led to the suggestion that effective population
size may be an important determinant of the rate of adaptive evolution because hominids typically have low effective
population sizes in contrast to rodents, insects, and bacteria
(Eyre-Walker et al. 2002; Fraser et al. 2007; Halligan et al.
2010). However, some caution should be exercised because
the level of adaptive evolution is typically measured as the
proportion of substitutions that are adaptive and this depends both on the numbers of substitutions that are effectively neutral and the number that are advantageous.
It has been shown that the number of effectively neutral
mutations is negatively correlated to the effective population size in many species (Woolfit and Bromham 2003,
2005; Popadin et al. 2007; Moran et al. 2008; Piganeau and
Eyre-Walker 2009). Hence, the correlation between proportion of substitutions that are adaptive and Ne may
be a consequence of the correlation between the proportion of effectively neutral mutations and Ne and may
not reflect any change in the absolute rate of adaptive
evolution.
The rate of adaptive evolution has also been studied in plants. On a genome-wide scale, previous studies
in Arabidopsis thaliana have shown little evidence for
adaptive evolution (Bustamante et al. 2002; Barrier et al.
2003; Schmid et al. 2005). This was attributed to the high
frequency of inbreeding in A. thaliana and the reduction
in effective population size that this caused. However, the
outcrossing species A. lyrata (Barnaud et al. 2008; Foxe
et al. 2008), the partially outcrossing cultivated tropical
grass Sorghum bicolor (Hamblin et al. 2006), and the mainly
outcrossing Zea species (Bijlsma et al. 1986; Ross-Ibarra
et al. 2009) also show little evidence of positive selection. Instead, all these species show evidence of slightly deleterious
© The Author 2010. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please
e-mail:
Mol. Biol. Evol. 27(8):1822–1832. 2010
doi:10.1093/molbev/msq079
Advance Access publication March 18, 2010
1822
Research article
Abstract
Little Evidence for Adaptive Evolution in Plant Species · doi:10.1093/molbev/msq079
Materials and Methods
Sequence Data
Data were retrieved from Genbank (http://www.ncbi
.nlm.nih.gov/Genbank) for Oryza spp. (GenBank IDs:
EF000002–EF01059, Caicedo et al. 2007), Populus tremula (EU752500–EU753117, Ingvarsson 2008b), A. lyrata
(BV683158–BV686427, EF502173–EF502282, EF502359–
EF502483, and EF502558–EF502973, Ross-Ibarra et al.
2008; EU592234–EU592323, Foxe et al. 2008), Zea mays
(BV123534–BV144210 and BV446558–BV447590, Wright
et al. 2005), S. bicolor (DQ427111–DQ430705, Hamblin
et al. 2006), Boechera stricta (FJ573482–FJ577247,
Song et al. 2009; GQ907358–GQ910665), Schiedea globosa
(GU830974–GU831538), and Helianthus petiolaris and
H. annuus (Strasburg et al. 2009; supplementary table S1,
Supplementary Material online). Polymorphic data for A.
thaliana were downloaded from http://walnut.usc.edu
/2010 and for P. balsamifera from http://www.popgen
.uaf.edu/data (Olson et al. 2010).
The annotated protein-coding genome of A. thaliana
was obtained from TAIR (ftp://ftp.arabidopsis.org/
home/tair/Genes/TAIR8 genome release). The annotated P. trichocarpa and S. bicolor genomes were obtained
from JGI (http://genome.jpi-psf.org/Poptr1 1, Tuskan et al.
2006 and http://genome.jgi-psf.org/Sorbi1, Paterson et al.
2009). Predicted coding sequences of Z. mays were obtained
from http://magi.plantgenomics.iastate.edu/downloadall.
html (Fu et al. 2005) and http://ftp.maizesequence.org/
release-3b.50/sequences/. The rice genome was downloaded from ftp://ftp.plantbiology.msu.edu/pub/data/
Eukaryotic Projects/o sativa/annotation dbs/pseudomole
cules/version 6.0/all.dir/. We analyzed the polymorphism
d (...truncated)