Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-06T12:11:14.069Z Has data issue: false hasContentIssue false

RWGAIM: an efficient high-dimensional random whole genome average (QTL) interval mapping approach

Published online by Cambridge University Press:  04 February 2013

ARŪNAS P. VERBYLA*
Affiliation:
School of Agriculture, Food and Wine, The University of Adelaide, PMB 1, Glen Osmond, SA 5064, Australia Mathematics, Informatics and Statistics and Food Futures National Research Flagship, CSIRO, Glen Osmond SA 5064, Australia
JULIAN D. TAYLOR
Affiliation:
School of Agriculture, Food and Wine, The University of Adelaide, PMB 1, Glen Osmond, SA 5064, Australia Mathematics, Informatics and Statistics and Food Futures National Research Flagship, CSIRO, Glen Osmond SA 5064, Australia
KLARA L. VERBYLA
Affiliation:
Mathematics, Informatics and Statistics and Food Futures National Research Flagship, CSIRO, Acton ACT 2601, Australia
*
*Corresponding author: School of Agriculture, Food and Wine, The University of Adelaide, PMB 1, Glen Osmond, SA 5064, Australia. Tel: +618 8303 8769. e-mail: ari.verbyla@adelaide.edu.au
Rights & Permissions [Opens in a new window]

Summary

Mapping of quantitative trait loci (QTLs) underlying variation in quantitative traits continues to be a powerful tool in genetic study of plants and other organisms. Whole genome average interval mapping (WGAIM), a mixed model QTL mapping approach using all intervals or markers simultaneously, has been demonstrated to outperform composite interval mapping, a common approach for QTL analysis. However, the advent of high-throughput high-dimensional marker platforms provides a challenge. To overcome this, a dimension reduction technique is proposed for WGAIM for efficient analysis of a large number of markers. This approach results in reduced computing time as it is dependent on the number of genetic lines (or individuals) rather than the number of intervals (or markers). The approach allows for the full set of potential QTL effects to be recovered. A proposed random effects version of WGAIM aims to reduce bias in the estimated size of QTL effects. Lastly, the two-stage outlier procedure used in WGAIM is replaced by a single stage approach to reduce possible bias in the selection of putative QTL in both WGAIM and the random effects version. Simulation is used to demonstrate the efficiency of the dimension reduction approach as well as demonstrate that while the approaches are very similar, the random WGAIM performs better than the original and modified fixed WGAIM by reducing bias and in terms of mean square error of prediction of estimated QTL effects. Finally, an analysis of a doubled haploid population is used to illustrate the three approaches.

Information

Type
Research Papers
Copyright
Copyright © Cambridge University Press 2013
Figure 0

Fig. 1. Scheme for WGAIM, WGAIMI and RWGAIM. Difference between the approaches is indicated in italics and the equations in the text corresponding to each step are indicated in brackets where relevant.

Figure 1

Fig. 2. Computation time (in minutes) for analysis using the dimension reduction approach (solid line) and the approach using markers explicitly (broken line) for numbers of markers from 210 to 1010.

Figure 2

Table 1. High-dimensional simulations for varying population sizes and number of markers. Non-redundant markers varied with population size. Both the processor time and processor time per step of the forward selection process are presented

Figure 3

Table 2. Proportion of 500 simulations in which the QTL was detected for WGAIM, WGAIMI and random WGAIM (RWGAIM) analyses. The standard error for a proportion can be based on binomial distribution, that is $\sqrt {\hat{p}\lpar 1 \minus \hat{p}\rpar \sol 500} $. For $\hat{p} \equals 0{\cdot}1$ (and hence $\hat{p} \equals 0{\cdot}9$), the standard error is 0·013, for $\hat{p} \equals 0.3$ the standard error is 0·020 and for $\hat{p} \equals 0.5$ the standard error is 0·022

Figure 4

Table 3. Two way tables for the QTL in repulsion (C1·4 and C1·8) with proportions of the 500 simulations for each population size for the combinations of non-detected $\overline{D}$ and detected D QTL. C1·4 is on the left and C1·8 on the top of each 2 × 2 table. The standard error for a proportion can be based on binomial distribution, that is $\sqrt {\hat{p}\lpar 1 \minus \hat{p}\rpar \sol 500} $. For $\hat{p} \equals 0{\cdot}1$ (and hence $\hat{p} \equals 0{\cdot}9$) the standard error is 0·013, for $\hat{p} \equals 0{\cdot}3$ the standard error is 0·020 and for $\hat{p} \equals 0{\cdot}5$ the standard error is 0·022

Figure 5

Table 4. Two way tables for the QTL in coupling (C2·4 and C2·8) with proportions of 500 simulations for each population size for the combinations of non-detected $\overline{D}$ and detected D QTL. C2·4 is on the left and C2·8 on the top of each 2 × 2 table. The standard error for a proportion can be based on binomial distribution, that is $\sqrt {\hat{p}\lpar 1 \minus \hat{p}\rpar \sol 500} $. For $\hat{p} \equals 0{\cdot}1$ (and hence $\hat{p} \equals 0{\cdot}9$) the standard error is 0·013, for $\hat{p} \equals 0{\cdot}3$ the standard error is 0·020 and for $\hat{p} \equals 0{\cdot}5$ the standard error is 0·022

Figure 6

Table 5. Proportion of false QTLs in 500 simulations detected for the three methods of analysis. Both linked (putative QTLs are on chromosomes with QTL, C1–C4, C5 has only one interval and cannot lead to false positives) and unlinked (putative QTLs are on chromosomes without QTLs, C6–C9) are presented. The standard error for a proportion can be based on binomial distribution, that is $\sqrt {\hat{p}\lpar 1 \minus \hat{p}\rpar \sol 500} $. For $\hat{p} \equals 0{\cdot}1$ (and hence $\hat{p} \equals 0{\cdot}9$) the standard error is 0·013, for $\hat{p} \equals 0{\cdot}3$ the standard error is 0·020 and for $\hat{p} \equals 0{\cdot}5$ the standard error is 0·022

Figure 7

Table 6. Mean estimated size of QTL effects (true size is 0·378 for all but C1·8 which has a size of −0·378) with empirical standard error (se) and mean square error of prediction (MSEP) for each method across 500 simulations for each population size

Figure 8

Table 7. Estimated variance components for the baseline model and the final WGAIM, WGAIMI and RWGAIM models. ints refers to the model term ME−s.a−s qtl followed by a linkage group refers to the QTL random effect using RWGAIM

Figure 9

Table 8. Putative QTL found using WGAIM, WGAIMI and RWGAIM