Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-05T22:42:14.189Z Has data issue: false hasContentIssue false

A Bayesian method for jointly estimating allele age and selection intensity

Published online by Cambridge University Press:  20 February 2008

MONTGOMERY SLATKIN*
Affiliation:
Department of Integrative Biology, 3060 VLSB, University of California at Berkeley, Berkeley, CA 94720-3140, USA
*
*Telephone: +1 (510) 6436300. Fax: +1 (510) 6436264. e-mail: slatkin@berkeley.edu
Rights & Permissions [Opens in a new window]

Summary

The problem of jointly estimating the intensity of past selection affecting an allele and the allele's age is formulated in a Bayesian framework. The prior distribution of allele age given its frequency is obtained from existing population genetics theory. The prior distribution of selection intensity is assumed to reflect the fact that positive selection on a new mutant is more likely to be weak than strong. The general approach is illustrated by the development of an importance sampling method applicable to low-frequency alleles. This method can be used either when the haplotypes of closely linked marker loci are known or when the lengths of linked ancestral chromosomal segments can be inferred. The method is illustrated with an application to the A− allele of G6PD in Africa. Because changes in allele frequency and recombination are both intrinsically stochastic, there are limits to the accuracy achievable with any method.

Information

Type
Paper
Copyright
Copyright © Cambridge University Press 2008
Figure 0

Fig. 1. The intra-allelic genealogy of a mutant M found in four copies (i=4), illustrating the definitions of allele age, t1, and the intra-allelic coalescence times, t2, t3 and t4.

Figure 1

Fig. 2. The joint posterior distribution of s and t1 computed for the A− allele of G6PD using the importance sampling method described in the text. The data and genetic map were taken from Saunders et al. (2005). In both cases, 106 replicates were used. (a) Pr(s, t1) estimated from the lengths of ancestral chromosomal segments linked to A−. The identification of the ancestral segments is described in the text. (b) Pr(s, t1) estimated from data for four marker loci (BGN, IDH3G, 1CAM and G0·9MT) linked to G6PD. The details of the genetic map are described in the text.

Figure 2

Fig. 3. Marginal distributions of s and t1 obtained by summing over the joint distributions. (a) and (b) are based on the data presented in Fig. 2a and (c) and (d) are based on the data presented in Fig. 2b.