Hostname: page-component-89b8bd64d-9prln Total loading time: 0 Render date: 2026-05-06T18:20:29.480Z Has data issue: false hasContentIssue false

On the measurements of genetic differentiation among populations

Published online by Cambridge University Press:  08 January 2013

J. WANG*
Affiliation:
Institute of Zoology, Zoological Society of London, London NW1 4RY, UK
*
*Corresponding author: Institute of Zoology, Regent's Park, London NW1 4RY, UK. Tel: 0044 20 74496620. Fax: 0044 20 75862870. E-mail: jinliang.wang@ioz.ac.uk
Rights & Permissions [Opens in a new window]

Summary

FST, a measurement of the genetic differentiation among subpopulations, is a fundamental parameter in population genetics, with many valuable applications in molecular biology, evolutionary biology, conservation and forensics. One of its close relatives, GST, has been widely used to measure differentiation from highly polymorphic markers such as microsatellites. However, because of the high mutation rate of such markers, GST may underestimate the genomic differentiation due to demographic causes such as migration rate and subpopulation size. A new statistic proposed recently, Jost's D, was claimed to have better properties than GST and was advocated to replace GST as a measure of differentiation. This paper shows that D is not a proper measure of differentiation because it fails to meet some fundamental requirements as a differentiation statistic, and is hardly estimable without bias in practice. D is highly dependent on the gene diversity of a marker and on the unknown parameter of the number of subpopulations, is highly sensitive to how alleles and loci are defined and how data are analysed, does not increase monotonically with either divergence time or drift, and does not always have a maximal value of 1. The maximal D value can be zero or close to zero, depending on the number of alleles at a locus relative to the number of subpopulations. I suggest continuing the use of GST, with caution in its interpretation when highly polymorphic markers are used, before a better estimator of FST that explicitly accounts for mutations is developed.

Information

Type
Research Papers
Copyright
Copyright © Cambridge University Press 2012
Figure 0

Fig. 1. Values of GST and D at generation t as a function of initial gene diversity H0. The parameters are s = ∞ and u = m = 0 for panel (a), s = ∞ and u = m = 0·001 for panel (b), s = 10 and u = m = 0·001 for panel (c) and s = 10, u = 0·01 and m = 0·001 for panel (d). In all four cases, the subpopulation size is N = 100.

Figure 1

Fig. 2. Simulated and theoretical D and GST values as a function of generations (a) and subpopulation size (b). The parameters used in generating the graphs are m = 0·01, u = 0·001, s = 2, N = 100 (a only), t = 200 (b only), and initially 10 alleles of an equal frequency for both subpopulations. Simulated values were obtained from 10 000 replicates, and theoretical values were obtained from recurrence eqn (10).

Figure 2

Fig. 3. Distributions of D values at complete differentiation (HS = 0) of a population subdivided into s = 10 subpopulations under the pure drift model (no mutation, no migration and no selection). The distribution is obtained from 100 000 replicate simulations for a locus with k = 2, 5 and 10 alleles of an equal frequency initially. For all three cases, GST = 1 with frequency 1.

Figure 3

Fig. 4. Effects of pooling alleles on D and GST values as a function of generations, since a population becomes subdivided. The simulations (10 000 replicates) assumed a population subdivided at t = 0 into s = 10 subpopulations, with m = 0·01, u = 0·001, N = 100 and initially 10 equifrequent alleles at a locus.

Figure 4

Fig. 5. Effects of the number of subpopulations on D, GST and FST values. The results are obtained assuming a population subdivided into s subpopulations, with m = 0·01, u = 0·001, N = 100 in the island model and IAM. (a) Shows the equilibrium D, GST and FST values, and (b) shows the D and GST values at generations 10, 100 and 1000 since the subdivision.