Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-08T02:17:43.708Z Has data issue: false hasContentIssue false

Detecting association of rare and common variants by adaptive combination of P-values

Published online by Cambridge University Press:  06 October 2015

YAJING ZHOU
Affiliation:
Department of Mathematics, School of Science, Harbin Institute of Technology, Harbin 150001, China School of Mathematical Sciences, Heilongjiang University, Harbin 150080, China
YONG WANG*
Affiliation:
Department of Mathematics, School of Science, Harbin Institute of Technology, Harbin 150001, China
*
* Corresponding author:Harbin Institute of Technology, No. 92 Xidazhi Street, Nangang District, Harbin 150001, P. R. China. E-mail: mathwy@hit.edu.cn
Rights & Permissions [Opens in a new window]

Summary

Genome-wide association studies (GWAS) can detect common variants associated with diseases. Next generation sequencing technology has made it possible to detect rare variants. Most of association tests, including burden tests and nonburden tests, mainly target rare variants by upweighting rare variant effects and downweighting common variant effects. But there is increasing evidence that complex diseases are caused by both common and rare variants. In this paper, we extend the ADA method (adaptive combination of P-values; Lin et al., 2014) for rare variants only and propose a RC-ADA method (common and rare variants by adaptive combination of P-values). Our proposed method combines the per-site P-values with the weights based on minor allele frequencies (MAFs). The RC-ADA is robust to directions of effects of causal variants and inclusion of a high proportion of neutral variants. The performance of the RC-ADA method is compared with several other association methods. Extensive simulation studies show that the RC-ADA method is more powerful than other association methods over a wide range of models.

Information

Type
Research Papers
Copyright
Copyright © Cambridge University Press 2015 
Figure 0

Table 1. The estimated type I error rates for all tests.

Figure 1

Fig. 1. Power comparisons of five tests for different percentages of deleterious rare variants based on case 1. RVT represents the rare variant threshold for (a) 0·01, (b) 0·03 and (c) 0·05. x-axis represents the percentage of deleterious rare variants. Sample size is 1000. Power is estimated at the 0·05 significance level.

Figure 2

Fig. 2. Power comparisons of five tests for different percentages of deleterious rare variants based on case 2. RVT represents the rare variant threshold for (a) 0·01, (b) 0·03 and (c) 0·05. x-axis represents the percentage of deleterious rare variants. Sample size is 1000. Power is estimated at the 0·05 significance level.

Figure 3

Fig. 3. Power comparisons of five tests for different percentages of neutral variants among all rare variants based on case 1. RVT represents the rare variant threshold for (a) 0·01, (b) 0·03 and (c) 0·05. A total of 80% of rare causal variants are deleterious variants. x-axis represents the percentage of neutral variants among all rare variants. Sample size is 1000. Power is estimated at the 0·05 significance level.

Figure 4

Fig. 4. Power comparisons of five tests for different sample sizes based on case 1. RVT represents the rare variant threshold for (a) 0·01, (b) 0·03 and (c) 0·05. A total of 80% of rare causal variants are deleterious variants. x-axis represents sample sizes. Power is estimated at the 0·05 significance level.

Figure 5

Table 2. Power(%) of RC-ADA method with three sets of candidate truncation thresholds.