Hostname: page-component-89b8bd64d-shngb Total loading time: 0 Render date: 2026-05-07T01:16:48.137Z Has data issue: false hasContentIssue false

Maximum likelihood estimation of individual inbreeding coefficients and null allele frequencies

Published online by Cambridge University Press:  18 July 2012

NATHAN HALL
Affiliation:
Department of Mathematics, Western Washington University, Bellingham, WA 98225, USA
LAINA MERCER
Affiliation:
Department of Mathematics, Western Washington University, Bellingham, WA 98225, USA
DAISY PHILLIPS
Affiliation:
Department of Mathematics, Western Washington University, Bellingham, WA 98225, USA
JONATHAN SHAW
Affiliation:
North Carolina Wildlife Resources Commission, Raleigh, NC 27695, USA
AMY D. ANDERSON*
Affiliation:
Department of Mathematics, Western Washington University, Bellingham, WA 98225, USA
*
§Corresponding author: Department of Mathematics, Western Washington University, Bellingham, WA 98225, USA. E-mail: amy.anderson@wwu.edu
Rights & Permissions [Opens in a new window]

Summary

In this paper, we developed and compared several expectation–maximization (EM) algorithms to find maximum likelihood estimates of individual inbreeding coefficients using molecular marker information. The first method estimates the inbreeding coefficient for a single individual and assumes that allele frequencies are known without error. The second method jointly estimates inbreeding coefficients and allele frequencies for a set of individuals that have been genotyped at several loci. The third method generalizes the second method to include the case in which null alleles may be present. In particular, it is able to jointly estimate individual inbreeding coefficients and allele frequencies, including the frequencies of null alleles, and accounts for missing data. We compared our methods with several other estimation procedures using simulated data and found that our methods perform well. The maximum likelihood estimators consistently gave among the lowest root-mean-square-error (RMSE) of all the estimators that were compared. Our estimator that accounts for null alleles performed particularly well and was able to tease apart the effects of null alleles, randomly missing genotypes and differing degrees of inbreeding among members of the datasets we analysed. To illustrate the performance of our estimators, we analysed previously published datasets on mice (Mus musculus) and white-tailed deer (Odocoileus virginianus).

Information

Type
Research Papers
Copyright
Copyright © Cambridge University Press 2012
Figure 0

Table 1. Conditional genotype probabilities of the form Pr(gij*|gij, Xij, Θ) for each combination of gij, gij* and Xij

Figure 1

Table 2. RMSE, allele frequencies unknown, 10 possible alleles at each marker

Figure 2

Table 3. Estimated bias in the presence of null alleles, 10 alleles per marker

Figure 3

Table 4. Mean estimated null allele frequencies, 10 alleles per locus

Figure 4

Fig. 1. Results from the analyses of the Arizona Mice (top row) and Deer (bottom row) datasets. The plots on the left contain a point for each individual (mouse or deer) in the dataset. The x- and y-coordinates of each point are the estimated inbreeding coefficients for that individual from the Method 3 MLE (M3) and the Vogl estimator with the uniform prior (V1), respectively. In the plots on the right, each point represents a marker in the dataset and the coordinates are the estimated frequencies of the null allele at that marker.