Adaptation and speciation: what can Fst tell us?

doi:10.1016/j.tree.2005.05.017

Trends in Ecology & Evolution

Volume 20, Issue 8, August 2005, Pages 435-440

https://doi.org/10.1016/j.tree.2005.05.017 Get rights and content

A useful way of summarizing genetic variability among different populations is through estimates of the inbreeding coefficient, F_st. Several recent studies have tried to use the distribution of estimates of F_st from individual genetic loci to detect the effects of natural selection. However, the promise of this approach has yet to be fully realized owing to the pervasive dogma that this distribution is highly dependent on demographic history. Here, I review recent theoretical results that indicate that the distribution of estimates of F_st is generally expected to be robust to the vagaries of demographic history. I suggest that analyses based on it provide a useful first step for identifying candidate genes that might be under selection, and explore the ways in which this information can be used in ecological and evolutionary studies.

Introduction

It is becoming increasingly cheaper and faster to survey samples genetically from model and non-model organisms for a large number of loci across their genomes, an advance that derives largely from the activities of the biomedical community [1]. Here, I reappraise an old idea [2] for the analysis of such data, first proposed when multi-locus surveys were scarce and difficult to obtain; it was later discredited 3, 4 and abandoned; and now, fuelled by the plentiful supply of these data, occasionally peeps apologetically out of research articles, smothered in caveats [5].

The ready availability of different classes of gene frequency information has rekindled an interest in natural selection and the development of a variety of methods for use in trying to infer the presence and mode of selection. Three main approaches can be identified [6]: (i) detailed modelling of selection at individual loci or sequences; (ii) multilocus comparisons, of which the Lewontin–Krakauer Method (see Glossary [2]), discussed here, is the oldest; and (iii) comparison of patterns of substitution among synonymous and non-synonymous sites. Analyses of the advantages and drawbacks of these different approaches are detailed by Nielsen [6]. Much of the research has been driven by the biomedical community with an aim to identify and characterize biochemical function and the phenotypic effect of natural variation throughout the human genome, often based on comparative analysis [7]. Judgements about the efficacies of different methods implicitly tend to have these, ultimately medical, goals in mind.

In evolutionary biology there are, by contrast, several interesting hypotheses that can be tested by characterizing the number, position and fitness effects of genomic regions that show apparent adaptive divergence in allele frequency, without the need to delve into the physiological details. I argue that, for most organisms, the easiest way to achieve this is by using the Lewontin–Krakauer method, which appears, at least in recent versions, to be generally robust to the vagaries of demographic history. I invoke recent theoretical results that suggest why this is not so surprising and discuss possible sampling strategies that might maximize the power of the approach. I also outline areas of application, particularly the study of adaptive divergence in the face of gene flow and modes of speciation.

Section snippets

Inbreeding coefficients and the identification of loci subject to selection

The study of selection, particularly local adaptation, at the genetic level has a long history (usefully reviewed in [8]). Examples include the study of local crypsis in response to bird predation in the snail Cepaea nemoralis [9] and the peppered moth Biston betularia [10]. The genetics of adaptation was largely eclipsed during the 1980s and 1990s by an interest in the possibility of recovering the historical demography of populations through an analysis of genetic variation, in particular, of

Alternative methods

The more recent studies have not invalidated the original criticisms of the Lewontin–Krakauer test 3, 4, 17, 18, 19, but, instead, suggest that they are often not applicable to the real world. The criticisms can be re-expressed as violations of the separation-of-timescales approximation. Potentially problematic are high mutation rate loci, such as microsatellites, in which mutations occur in the scattering phase [32]. Another problem arises when the gene frequency in the collecting phase is not

Design of surveys

How should surveys be designed to maximize the chances of picking up loci that are subject to selection? The approximation given by Lewontin and Krakauer (Box 1) can be used to obtain some idea of the expected variability in estimates of F_st among loci. This shows that there is substantial variability and skew in estimates of F_st when biallelic markers are surveyed in only a pair of populations, and hence there is potentially little power to detect outlier loci unless selection is strong

Example applications

There have been an increasing number of studies that aim to identify loci subject to selection 46, 47, often using the distribution of F_st among loci. An illustrative case stems from the work of Pogson and colleagues [48]. They studied a mixture of RFLPs and allozymes in populations of Atlantic cod Gadus morhau. On the basis of tests closely related to those of Lewontin and Krakauer [2], they showed that the mean F_st in allozymes was lower than that of RFLPs, and suggested that the allozymes

Testing models of adaptation and speciation

Once interesting genomic regions have been identified through an analysis of F_st, or related method, how might the information be used? In conjunction with demographic information, such as immigration rate, it might be possible to quantify the distribution of fitness effects that are necessary to lead to the observed distribution of estimates of F_st, and thereby test evolutionary hypotheses [47]. Although predictions of phenotypic evolution are often still based on the infinitesimal model [53],

Conclusions

The abandonment of Lewontin and Krakauer's idea could be regarded as a major advance in the wider acceptance of the usefulness of the neutral theory, and the importance of demographic events in shaping gene frequency data. This can then be seen as leading directly to the significant research programme of the 1980s and 1990s, that of trying to recover population history from genetic information. Ironically, however, the reinstatement of Lewontin and Krakauer's ideas depends largely on a

Acknowledgements

I thank Jay Storz, Renaud Vitalis and three anonymous referees for their useful comments on previous versions of the article. This work was supported by an Advanced Fellowship from the Natural Environment Research Council.

Glossary

AFLP:: amplified fragment length polymorphism. A way of assaying nucleotide sites for polymorphisms, typically resulting in a dominant marker system. A relatively inexpensive way of obtaining many markers.
Ascertainment bias:: bias in demographic inferences owing to the use of (typically) low mutation rate markers, such as SNPs, that have been previously identified in earlier smaller scale studies. The SNPs so identified will form a biased subset, with alleles at intermediate frequencies (otherwise

References (70)

M. Nei et al.
Drift variance of F_st and G_st statistics obtained from a finite number of isolated populations
Theor. Popul. Biol.
(1977)
M. Nei
Mean and variance of F_st in a finite number of incompletely isolated populations
Theor. Popul. Biol.
(1977)
J. Wakeley
The coalescent in an island model of population subdivision with variation among demes
Theor. Popul. Biol.
(2001)
D.J. Balding et al.
DNA profile match probability calculations: how to allow for population stratification, relatedness, database selection and single bands
For. Sci. Int.
(1994)
D.J. Balding
Likelihood-based inference for genetic correlation coefficients
Theor. Popul. Biol.
(2003)
C. Schlotterer
Towards a molecular characterization of adaptation in local populations
Curr. Opin. Genet. Dev.
(2002)
C. Schlotterer
Hitchhiking mapping – functional genomics from the population genetics perspective
Trends Genet.
(2003)
B. Guinand
How to detect polymorphisms undergoing selection in marine fishes? A review of methods and case studies, including flatfishes
J. Sea Res.
(2004)
L.H. Rieseberg
Chromosomal rearrangements and speciation
Trends Ecol. Evol.
(2001)
R.A. Gibbs
The International HapMap Project
Nature
(2003)

R.C. Lewontin et al.

Distribution of gene frequency as a test of the theory of the selective neutrality of polymorphisms

Genetics

(1973)

M. Nei et al.

Lewontin–Krakauer test for neutral genes

Genetics

(1975)

A. Robertson

Remarks on the Lewontin–Krakauer test

Genetics

(1975)

J.M. Akey

Interrogating a high-density SNP map for signatures of natural selection

Genome Res.

(2002)

R. Nielsen

Statistical tests of selective neutrality in the age of genomics

Heredity

(2001)

A.G. Clark

Inferring nonneutral evolution from human–chimp–mouse orthologous gene trios

Science

(2003)

E.B. Ford

Ecological Genetics

(1975)

A.J. Cain et al.

Selection in the polymorphic land snail Cepaea nemoralis

Heredity

(1950)

H.B.D. Kettlewell

A survey of the frequencies of Biston Betularia (L) (Lep.) and its melanic forms in Great Britain

Heredity

(1958)

J.C. Avise

Molecular Markers, Natural History, and Evolution

(1994)

L.L. Cavalli-Sforza

Population structure and human evolution

Proc. R. Soc. Lond. B Biol. Sci.

(1966)

B.S. Weir et al.

Estimating F-statistics for the analysis of population structure

Evolution

(1984)

R. Vitalis

Interpretation of variation across marker loci as evidence of selection

Genetics

(2001)

B.S. Weir et al.

Estimating F-statistics

Annu. Rev. Genet.

(2002)

T. Arends

Intra-tribal genetic differentiation among the Yanomama Indians of Southern Venezuela

Proc. Natl. Acad. Sci. U. S. A.

(1966)

A. Robertson

Gene frequency distribution as a test of selective neutrality

Genetics

(1975)

A.M. Bowcock

Drift, admixture, and selection in human-evolution – a study with DNA polymorphisms

Proc. Natl. Acad. Sci. U. S. A.

(1991)

M.A. Beaumont et al.

Evaluating loci for use in the genetic analysis of population structure

Proc. R. Soc. Lond. B Biol. Sci.

(1996)

M. Nordborg

Structured coalescent processes on different time scales

Genetics

(1997)

J. Wakeley

Nonequilibrium migration in human history

Genetics

(1999)

J. Wakeley et al.

Gene genealogies in a metapopulation

Genetics

(2001)

J.F. Wilkins

A separation-of-timescales approach to the coalescent in a continuous population

Genetics

(2005)

B. Rannala et al.

Estimating gene flow in island populations

Genet. Res.

(1996)

A.H. Porter

A test for deviation from island-model population structure

Mol. Ecol.

(2003)

M.A. Beaumont et al.

Identifying adaptive genetic divergence among populations from genome scans

Mol. Ecol.

(2004)

Cited by (355)

Shared phylogeographic patterns and environmental responses of co-distributed soybean pests: Insights from comparative phylogeographic studies of Riptortus pedestris and Riptortus linearis in the subtropics of East Asia
2024, Molecular Phylogenetics and Evolution
Comparative phylogeographic studies of closely related species sharing co-distribution areas can elucidate the role of shared historical factors and environmental changes in shaping their phylogeographic pattern. The bean bugs, Riptortus pedestris and Riptortus linearis, which both inhabit subtropical regions in East Asia, are recognized as highly destructive soybean pests. Many previous studies have investigated the biological characteristics, pheromones, chemicals and control mechanisms of these two pests, but few studies have explored their phylogeographic patterns and underlying factors. In this study, we generated a double-digest restriction site-associated DNA sequencing (ddRAD-seq) dataset to investigate phylogeographic patterns and construct ecological niche models (ENM) for both Riptortus species. Our findings revealed similar niche occupancies and population genetic structures between the two species, with each comprising two phylogeographic lineages (i.e., the mainland China and the Indochina Peninsula clades) that diverged approximately 0.1 and 0.3 million years ago, respectively. This divergence likely resulted from the combined effects of temperatures variation and geographical barriers in the mountainous regions of Southwest China. Further demographic history and ENM analyses suggested that both pests underwent rapid expansion prior to the Last Glacial Maximum (LGM). Furthermore, ENM predicts a northward shift of both pests into new soybean-producing regions due to global warming. Our study indicated that co-distribution soybean pests with overlapping ecological niches and similar life histories in subtropical regions of East Asia exhibit congruent phylogeographic and demographic patterns in response to shared historical biogeographic drivers.
Genomic signatures for drylands adaptation at gene-rich regions in African zebu cattle
2022, Genomics
Indigenous Sudanese cattle are mainly indicine/zebu (humped) type. They thrive in the harshest dryland environments characterised by high temperatures, long seasonal dry periods, nutritional shortages, and vector disease challenges. Here, we sequenced 60 indigenous Sudanese cattle from six indigenous breeds and analysed the data using three genomic scan approaches to unravel cattle adaptation to the African dryland region.
We identified a set of gene-rich selective sweep regions, detected mostly on chromosomes 5, 7 and 19, shared across African and Gir zebu. These include genes involved in immune response, body size and conformation, and heat stress response. We also identified selective sweep regions unique to Sudanese zebu. Of these, a 250 kb selective sweep on chromosome 16 spans seven genes, including PLCH2, PEX10, PRKCZ, and SKI, which are involved in alternative adaptive metabolic strategies of insulin signalling, glucose homeostasis, and fat metabolism.
Our results suggest that environmental adaptation may involve recent and ancient selection at gene-rich regions, which might be under a common regulatory genetic control, in zebu cattle.
An Indian lineage of Histoplasma with strong signatures of differentiation and selection
2022, Fungal Genetics and Biology
Histoplasma, a genus of dimorphic fungi, is the etiological agent of histoplasmosis, a pulmonary disease widespread across the globe. Whole genome sequencing has revealed that the genus harbors a previously unrecognized diversity of cryptic species. To date, studies have focused on Histoplasma isolates collected in the Americas with little knowledge of the genomic variation from other localities. In this report, we report the existence of a well-differentiated lineage of Histoplasma occurring in the Indian subcontinent. The group is differentiated enough to satisfy the requirements of a phylogenetic species, as it shows extensive genetic differentiation along the whole genome and has little evidence of gene exchange with other Histoplasma species. Next, we leverage this genetic differentiation to identify genetic changes that are unique to this group and that have putatively evolved through rapid positive selection. We found that none of the previously known virulence factors have evolved rapidly in the Indian lineage but find evidence of strong signatures of selection on other alleles potentially involved in clinically-important phenotypes. Our work serves as an example of the importance of correctly identifying species boundaries to understand the extent of selection in the evolution of pathogenic lineages.
Whole genome sequencing has revolutionized our understanding of microbial diversity, including human pathogens. In the case of fungal pathogens, a limiting factor in understanding the extent of their genetic diversity has been the lack of systematic sampling. In this piece, we show the results of a collection in the Indian subcontinent of the pathogenic fungus Histoplasma, the causal agent of a systemic mycosis. We find that Indian samples of Histoplasma form a distinct clade which is highly differentiated from other Histoplasma species. We also show that the genome of this lineage shows unique signals of natural selection. This work exemplifies how the combination of a robust sampling along with population genetics, and phylogenetics can reveal the precise genetic changes that differentiate lineages of fungal pathogens.
Phylogenomics of the genus Tursiops and closely related Delphininae reveals extensive reticulation among lineages and provides inference about eco-evolutionary drivers
2020, Molecular Phylogenetics and Evolution
Citation Excerpt :
The resulting SNPs were then analysed using LOSITAN (Antao et al., 2008), considering all the main Tursiops ecotypes/species as different populations (defined in Moura et al., 2013). Lositan uses the Fdist method described by Beaumont (2005) to compare FST against heterozygosity for each SNP, and calculates an expected neutral distribution for all SNPs. SNPs that are found to be outliers against this distribution are inferred as being putatively under selection.
Phylogeographic inference has provided extensive insight into the relative roles of geographical isolation and ecological processes during evolutionary radiations. However, the importance of cross-lineage admixture in facilitating adaptive radiations is increasingly being recognised, and suggested as a main cause of phylogenetic uncertainty. In this study, we used a double digest RADseq protocol to provide a high resolution (~4 Million bp) nuclear phylogeny of the Delphininae. Phylogenetic resolution of this group has been especially intractable, likely because it has experienced a recent species radiation. We carried out cross-lineage reticulation analyses, and tested for several sources of potential bias in determining phylogenies from genome sampling data. We assessed the divergence time and historical demography of T. truncatus and T. aduncus by sequencing the T. aduncus genome and comparing it with the T. truncatus reference genome. Our results suggest monophyly for the genus Tursiops, with the recently proposed T. australis species falling within the T. aduncus lineage. We also show the presence of extensive cross-lineage gene flow between pelagic and European coastal ecotypes of T. truncatus, as well as in the early stages of diversification between spotted (Stenella frontalis; Stenella attenuata), spinner (Stenella longirostris), striped (Stenella coeruleoalba), common (Delphinus delphis), and Fraser’s (Lagenodelphis hosei) dolphins. Our study suggests that cross-lineage gene flow in this group has been more extensive and complex than previously thought. In the context of biogeography and local habitat dependence, these results improve our understanding of the evolutionary processes determining the history of this lineage.
Genotype–environment associations reveal genes potentially linked to avian malaria infection in populations of an endemic island bird
2024, Molecular Ecology
Genome-wide selection signatures address trait specific candidate genes in cattle indigenous to arid regions of India
2024, Animal Biotechnology

View all citing articles on Scopus

View full text

Trends in Ecology & Evolution

Adaptation and speciation: what can Fst tell us?

Introduction

Section snippets

Inbreeding coefficients and the identification of loci subject to selection

Alternative methods

Design of surveys

Example applications

Testing models of adaptation and speciation

Conclusions

Acknowledgements

Glossary

Theor. Popul. Biol.

Theor. Popul. Biol.

Theor. Popul. Biol.

For. Sci. Int.

Theor. Popul. Biol.

Curr. Opin. Genet. Dev.

Trends Genet.

J. Sea Res.

Trends Ecol. Evol.

The International HapMap Project

Nature

Distribution of gene frequency as a test of the theory of the selective neutrality of polymorphisms

Genetics

Lewontin–Krakauer test for neutral genes

Genetics

Remarks on the Lewontin–Krakauer test

Genetics

Interrogating a high-density SNP map for signatures of natural selection

Genome Res.

Statistical tests of selective neutrality in the age of genomics

Heredity

Inferring nonneutral evolution from human–chimp–mouse orthologous gene trios

Science

Ecological Genetics

Selection in the polymorphic land snail Cepaea nemoralis

Heredity

A survey of the frequencies of Biston Betularia (L) (Lep.) and its melanic forms in Great Britain

Heredity

Molecular Markers, Natural History, and Evolution

Population structure and human evolution

Proc. R. Soc. Lond. B Biol. Sci.

Estimating F-statistics for the analysis of population structure

Evolution

Interpretation of variation across marker loci as evidence of selection

Genetics

Estimating F-statistics

Annu. Rev. Genet.

Intra-tribal genetic differentiation among the Yanomama Indians of Southern Venezuela

Proc. Natl. Acad. Sci. U. S. A.

Gene frequency distribution as a test of selective neutrality

Genetics

Drift, admixture, and selection in human-evolution – a study with DNA polymorphisms

Proc. Natl. Acad. Sci. U. S. A.

Evaluating loci for use in the genetic analysis of population structure

Proc. R. Soc. Lond. B Biol. Sci.

Structured coalescent processes on different time scales

Genetics

Nonequilibrium migration in human history

Genetics

Gene genealogies in a metapopulation

Genetics

A separation-of-timescales approach to the coalescent in a continuous population

Genetics

Estimating gene flow in island populations

Genet. Res.

A test for deviation from island-model population structure

Mol. Ecol.

Identifying adaptive genetic divergence among populations from genome scans

Mol. Ecol.

Adaptation and speciation: what can F_st tell us?