Hostname: page-component-6766d58669-88psn Total loading time: 0 Render date: 2026-05-21T01:32:18.779Z Has data issue: false hasContentIssue false

Triadic IBD coefficients and applications to estimating pairwise relatedness

Published online by Cambridge University Press:  26 September 2007

JINLIANG WANG*
Affiliation:
Institute of Zoology, Zoological Society of London, Regent's Park, London NW1 4RY, UK
*
*Corresponding author. Telephone: +44 20 74496620. Fax: +44 20 75862870. e-mail: jinliang.wang@ioz.ac.uk
Rights & Permissions [Opens in a new window]

Summary

Knowledge of the genetic relatedness among individuals is essential in diverse research areas such as behavioural ecology, conservation biology, quantitative genetics and forensics. How to estimate relatedness accurately from genetic marker information has been explored recently by many methodological studies. In this investigation I propose a new likelihood method that uses the genotypes of a triad of individuals in estimating pairwise relatedness (r). The idea is to use a third individual as a control (reference) in estimating the r between two other individuals, thus reducing the chance of genes identical in state being mistakenly inferred as identical by descent. The new method allows for inbreeding and accounts for genotype errors in data. Analyses of both simulated and human microsatellite and SNP datasets show that the quality of r estimates (measured by the root mean squared error, RMSE) is generally improved substantially by the new triadic likelihood method (TL) over the dyadic likelihood method and five moment estimators. Simulations also show that genotyping errors/mutations, when ignored, result in underestimates of r for related dyads, and that incorporating a model of typing errors in the TL method improves r estimates for highly related dyads but impairs those for loosely related or unrelated dyads. The effects of inbreeding were also investigated through simulations. It is concluded that, because most dyads in a natural population are unrelated or only loosely related, the overall performance of the new triadic likelihood method is the best, offering r estimates with a RMSE that is substantially smaller than the five commonly used moment estimators and the dyadic likelihood method.

Information

Type
Research Article
Copyright
Copyright © Cambridge University Press 2007
Figure 0

Table 1. Dyadic (four-gene) IBD states and their probabilities

Figure 1

Table 2. Triadic (six-gene) IBD states and their probabilities

Figure 2

Table 3. Effect of the number of reference individuals on the triadic likelihood estimator

Figure 3

Fig. 1. Comparison of seven relatedness estimators using simulated data with different numbers of loci. The root mean squared errors (y-axis, on a logarithmic scale; A) and biases (B) are plotted as a function of the number of loci (x-axis) for four relationships (indicated by column heads) and three allele frequency distributions (indicated by row heads on the right). Each locus is assumed to have eight alleles with known frequencies in an EF, UF or CF distribution. The seven estimators indicated by different lines are the new triadic likelihood (TL), Wang's (2002) estimator (W), Lynch (1988) and Li et al.'s (1993) estimator (LL), Lynch and Ritland's (1999) estimator (LR), Ritland's (1996) estimator (R), Queller and Goodnight's (1989) estimator (QG) and Milligan's dyadic likelihood estimator (M).

Figure 4

Fig. 2. Comparison of seven relatedness estimators using simulated data with different numbers of alleles per locus. The biases and root mean squared errors (y-axis, on a logarithmic scale) are plotted as a function of the number of alleles per locus (x-axis, on a logarithmic scale) for four relationships (indicated by column heads). The number of loci is fixed at 30, and the alleles at each locus have known frequencies drawn from a uniform Dirichlet distribution. The seven estimators indicated by different lines are specified in Fig. 1 and the text.

Figure 5

Fig. 3. Effect of inbreeding on the biases and RMSEs of relatedness estimators. Each individual in the PO, FS, FC or UR dyads has the same inbreeding coefficient, F. With known allele frequencies, biases and RMSEs are calculated by the seven relatedness estimators (specified in Fig. 1 and the text) assuming non-inbreeding, and by the dyadic (denoted by unbroken lines with filled circles) and triadic (denoted by unbroken lines with open circles) likelihood methods allowing for inbreeding. (A) Biases and RMSEs plotted as a function of the inbreeding coefficients of individuals. Each individual is genotyped at 15 loci, with each locus having eight alleles in a triangular frequency distribution. (B) Biases and RMSEs plotted as a function of the number of loci, each having eight alleles in a triangular frequency distribution. The F value of each individual is fixed at 0·16.

Figure 6

Fig. 4. RMSEs of relatedness estimators as a function of the genotyping error rate (e, x-axis) in generating the simulated data. Each individual in the parent–offspring, full-sib, first cousin and unrelated dyads is genotyped at 20 loci, with each locus having eight alleles in a uniform Dirichlet frequency distribution. RMSEs are calculated for the triadic likelihood estimator assuming known allele frequencies and an error rate of ê=0, ê=e, ê=0·8e or ê=1·2e.

Figure 7

Fig. 5. RMSEs of relatedness estimators as a function of the number of microsatellite (A) or SNP (B) loci used in the CEPH dataset. The RMSE of an estimator is calculated for each type and number of markers, and for each of the four relationships (parent–offspring, PO; full-sibs, FS; grandparent–grandoffspring, GG; unrelated, UR). Note that for the case of SNPs, estimators W and LL are the same while LR and QG have indistinguishable RMSEs.

Figure 8

Fig. 6. Triangular plot of IBD coefficients for PO, FS, GG and UR dyads in the CEPH dataset. The numbers of dyads shown in the plot are 795, 1304, 970, and 311 for PO, FS, GG and UR relationships, respectively. Estimates of Δi (i=7, 8, 9) for each dyad are obtained from the triadic likelihood method using 100 microsatellites with known allele frequencies. The top, left and right points of the triangle have IBD coefficients {Δ7, Δ8, Δ9}={0,0,1}, {0,1,0} and {1,0,0}, respectively, representing UR, PO and identical twin relationships, respectively.