Hostname: page-component-77f85d65b8-zzw9c Total loading time: 0 Render date: 2026-03-28T01:38:27.358Z Has data issue: false hasContentIssue false

Optimal sample sizes for group testing in two-stage sampling

Published online by Cambridge University Press:  28 November 2014

Osval Antonio Montesinos-López*
Affiliation:
Facultad de Telemática, Universidad de Colima, Avenida Universidad 333, Col. Las Víboras, C.P. 28040 Colima, Colima, México
Kent Eskridge
Affiliation:
Department of Statistics, University of Nebraska, Lincoln, Nebraska, USA
Abelardo Montesinos-López
Affiliation:
Departamento de Estadística, Centro de Investigación en Matemáticas (CIMAT), Guanajuato, Guanajuato, México
José Crossa
Affiliation:
Biometrics and Statistics Unit, Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, Mexico, D.F., Mexico
*
*Correspondence E-mail: oamontes1@ucol.mx
Rights & Permissions [Opens in a new window]

Abstract

Optimal sample sizes under a budget constraint for estimating a proportion in a two-stage sampling process have been derived using individual testing. However, when group testing is used, these optimal sample sizes are not appropriate. In this study, optimal sample sizes at the cluster and individual levels are derived for group testing. First, optimal allocations of clusters and individuals are obtained under the assumption of equal cluster sizes. Second, we obtain the relative efficiency (RE) of unequal versus equal cluster sizes when estimating the average population proportion, $$\tilde {>\pi } $$ . By multiplying the sample of clusters obtained assuming equal cluster size by the inverse of the RE, we adjust the sample size required in the context of unequal cluster sizes. We also show the adjustments that need to be made to allocate clusters and individuals correctly in order to estimate the required budget and achieve a certain power or precision.

Information

Type
Research Papers
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited
Copyright
Copyright © Cambridge University Press 2014
Figure 0

Figure 1 Ratio of the number of clusters and number of pools per cluster. (a) Ratio of the number of clusters and number of pools per cluster m/g as a function of the proportion ($$\tilde {>\pi } $$), for $$C = 10,000 $$, $$c _{1} = 250, c _{2} = 800, s = 10, S _{ e } = 0.98, S _{ p } = 0.96 $$ and five different values of $$\sigma _{ b }^{2} $$. (b) Ratio of m/g as a function of $$c _{2} $$, for $$C = 10,000 $$, $$\sigma _{ b }^{2} = 0.5, \tilde {>\pi } = 0.05 $$, $$s = 10, S _{ e } = 0.98, S _{ p } = 0.96 $$ and several values of $$c _{1} $$. (c) Required number of clusters, m, as a function of the desired confidence interval width $$\left ( \omega \right ) $$, for $$c _{1} = 50, c _{2} = 1600, \tilde {>\pi } = 0.05, s = 10, S _{ e } = 0.98, S _{ p } = 0.96 $$, and five different values of $$\sigma _{ b }^{2} $$. (d) Required number of clusters, m, as a function of the desired power $$(1 - \gamma ) $$, for $$c _{1} = 50, c _{2} = 1600, \tilde {>\pi } = 0.04, d = 0.015, s = 10, S _{ e } = 0.98,S_{ p } = 0.96, \alpha = 0.05 $$ and five different values of $$\sigma _{ b }^{2} $$.

Figure 1

Table 1 Cluster size distribution used for calculating relative efficiency

Figure 2

Figure 2 Relative efficiency of unequal versus equal cluster sizes as a function of the intraclass correlation $$( \rho ) $$ for four distributions of cluster size: (a) uniform, (b) unimodal, (c) bimodal and (d) positively skewed distribution.

Figure 3

Table 2 Optimal sample sizes (g and m) for group testing for two stages given a pool size that minimizes the variance of the proportion $$( \circ {>\pi } ) $$

Figure 4

Table 3 Optimal sample sizes (g and m) for confidence interval estimation using group testing in two stages given a pool size

Figure 5

Table 4 Optimal sample sizes (g and m) for power estimation using group testing in two stages given a pool size