Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-08T22:21:03.289Z Has data issue: false hasContentIssue false

Sample size estimation to substantiate freedom from disease for clustered binary data with a specific risk profile

Published online by Cambridge University Press:  07 September 2012

P. KOSTOULAS*
Affiliation:
Laboratory of Epidemiology, Biostatistics and Animal Health Economics, University of Thessaly, Karditsa, Greece
S. S. NIELSEN
Affiliation:
Department of Large Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Frederiksberg, Denmark
W. J. BROWNE
Affiliation:
School of Clinical Veterinary Sciences, University of Bristol, Bristol, UK
L. LEONTIDES
Affiliation:
Laboratory of Epidemiology, Biostatistics and Animal Health Economics, University of Thessaly, Karditsa, Greece
*
*Author for correspondence: Dr P. Kostoulas, Laboratory of Epidemiology, Biostatistics and Animal Health Economics, School of Veterinary Medicine, University of Thessaly, 224 Trikalon St, 43100 Karditsa, Greece. (Email: pkost@vet.uth.gr)
Rights & Permissions [Opens in a new window]

Summary

Disease cases are often clustered within herds or generally groups that share common characteristics. Sample size formulae must adjust for the within-cluster correlation of the primary sampling units. Traditionally, the intra-cluster correlation coefficient (ICC), which is an average measure of the data heterogeneity, has been used to modify formulae for individual sample size estimation. However, subgroups of animals sharing common characteristics, may exhibit excessively less or more heterogeneity. Hence, sample size estimates based on the ICC may not achieve the desired precision and power when applied to these groups. We propose the use of the variance partition coefficient (VPC), which measures the clustering of infection/disease for individuals with a common risk profile. Sample size estimates are obtained separately for those groups that exhibit markedly different heterogeneity, thus, optimizing resource allocation. A VPC-based predictive simulation method for sample size estimation to substantiate freedom from disease is presented. To illustrate the benefits of the proposed approach we give two examples with the analysis of data from a risk factor study on Mycobacterium avium subsp. paratuberculosis infection, in Danish dairy cattle and a study on critical control points for Salmonella cross-contamination of pork, in Greek slaughterhouses.

Information

Type
Original Papers
Copyright
Copyright © Cambridge University Press 2012
Figure 0

Fig. 1. Flow diagram illustrating the proposed modelling approach for sample size estimation to substantiate freedom from disease. VPC, Variance partition coefficient.

Figure 1

Table 1. Estimated VPCs (medians and 95% credible intervals) and predicted risk of Mycobacterium avium subsp. paratuberculosis (MAP) infection for subgroups of herds that share common characteristics – common covariate pattern. Estimations were based on models that adjusted for the age-specific diagnostic accuracy of the milk ELISA for MAP. Estimates from an intercept-only model are also given

Figure 2

Table 2. Medians (95% credible intervals) of VPCs and predicted risk of carcass cross-contamination with Salmonella. Significant fitted covariates are the daily Salmonella status of the eviscerator and the trimmer. Estimates from an intercept-only model are also given

Figure 3

Table 3. Estimated number of herds (j) required for sampling to substantiate freedom from (i) MAP infection (assuming a within-herd sample of i=50 animals) and (ii) Salmonella cross-contamination (assuming a within-day sample of i=30 pork carcasses), for selected covariate patterns that exhibit markedly greater or less heterogeneity. Estimates are based on the analysis of 100 simulated datasets for each considered combination of i and j. For each selected covariate pattern, priors on the minimum expected prevalence (μ) were based on the mean predicted risk of MAP infection and carcass cross-contamination for (i) and (ii), respectively. Priors on the variability of the within-herd prevalence (ψ) were based on the corresponding VPC estimates. Estimates from an intercept-only model that ignored the covariate-pattern-specific heterogeneity and correspond to the whole population are also given for comparisons

Supplementary material: File

Kostoulas Supplementary Material

Supplementary Material.doc

Download Kostoulas Supplementary Material(File)
File 28.7 KB