Hostname: page-component-6766d58669-bkrcr Total loading time: 0 Render date: 2026-05-19T17:00:10.634Z Has data issue: false hasContentIssue false

How to deal with genotype uncertainty in variance component quantitative trait loci analyses

Published online by Cambridge University Press:  18 July 2011

XIA SHEN*
Affiliation:
The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden School of Technology and Business Studies, Dalarna University, Borlänge, Sweden
LARS RÖNNEGÅRD
Affiliation:
School of Technology and Business Studies, Dalarna University, Borlänge, Sweden Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden
ÖRJAN CARLBORG
Affiliation:
The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden
*
*Corresponding author: The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden. E-mail: xia.shen@lcb.uu.se
Rights & Permissions [Opens in a new window]

Summary

Dealing with genotype uncertainty is an ongoing issue in genetic analyses of complex traits. Here we consider genotype uncertainty in quantitative trait loci (QTL) analyses for large crosses in variance component models, where the genetic information is included in identity-by-descent (IBD) matrices. An IBD matrix is one realization from a distribution of potential IBD matrices given available marker information. In QTL analyses, its expectation is normally used resulting in potentially reduced accuracy and loss of power. Previously, IBD distributions have been included in models for small human full-sib families. We develop an Expectation–Maximization (EM) algorithm for estimating a full model based on Monte Carlo imputation for applications in large animal pedigrees. Our simulations show that the bias of variance component estimates using traditional expected IBD matrix can be adjusted by accounting for the distribution and that the calculations are computationally feasible for large pedigrees.

Information

Type
Research Papers
Copyright
Copyright © Cambridge University Press 2011
Figure 0

Fig. 1. Pedigree I – a simulated F2 intercross with 10 individuals including two founders. (a) A three-generation intercross with five offspring, three parents and two grandparents. Squares and circles denote male and female animals, respectively, with indices inside. Bubbles with arrows pointing to each animal indicate the true genotypes that are not observed. (b) For the distribution method and the expectation method, asymptotic trend of bias in estimating the narrow sense heritability is displayed with respect to the heritability of the trait.

Figure 1

Fig. 2. Pedigree II – a simulated F2 intercross with 18 individuals including four founders. Squares and circles denote male and female animals, respectively, with indices inside. Each dashed curve connects the same individual for the purpose of clear display.

Figure 2

Table 1. The tabular form of pedigree II. Two markers that have a complete link to the QTL were simulated. ℓD and ℓE are log-likelihood from the distribution method and the expectation method, respectively. Given marker I, D>E, while given marker II, D<E.

Figure 3

Table 2. Heritability estimates for pedigree I. Using the distribution method, variance components were estimated with different number of Monte Carlo imputes. Compared to the simulated true value, the heritability estimated by the distribution method had much less bias than that estimated by the expectation method.

Figure 4

Fig. 3. Simulation results for power of interval mapping. A QTL was simulated at 25 cM of pig chromosome 6. The two markers flanking the interval harbouring the QTL are located at 8·6 cM and 36·6 cM. The real experimental genotypes for 191 F2 individuals were used to simulate phenotypes, assuming a narrow sense heritability of 0·9, 0·1, and 0. 1000 simulations were used for comparing the log-likelihood from the expectation and the distribution methods. The points above/below the diagonal are in red/blue, indicating that the distribution method has larger power than the expectation method (a, b), or that the distribution method has lower false positive rate than the expectation method (c). The numbers in colour show the corresponding percentages of the sets of points.

Figure 5

Fig. 4. QTL scan using LRT statistic along pig chromosome 6. The meat quality trait (reflectance value, EEL) is strongly affected by the halothane gene located at 80·4 cM on the chromosome. By adjusting bias of likelihood estimates, thedistribution method refined the peak of the traditional variance component QTL scan using the expectation method and thereby shortened the confidence interval for the QTL. Information for each micro-satellite marker is shown as their PIC.