Hostname: page-component-6766d58669-nqrmd Total loading time: 0 Render date: 2026-05-20T10:20:19.242Z Has data issue: false hasContentIssue false

A comparison of frequentist and Bayesian methods for meta-analysis of diagnostic test accuracy studies

Published online by Cambridge University Press:  30 April 2026

Xanthoula Rousou*
Affiliation:
Public and One Health, University of Thessaly, School of Health Sciences, Greece
Luis Furuya-Kanamori
Affiliation:
The University of Queensland Centre for Clinical Research, Australia
Polychronis Kostoulas
Affiliation:
Public and One Health, University of Thessaly, Greece
Suhail A. R. Doi
Affiliation:
Population Medicine, Qatar University College of Medicine, Doha, Qatar
Eleftherios Meletis
Affiliation:
Public and One Health, University of Thessaly, School of Health Sciences, Greece
Nikolaos Solomakos
Affiliation:
Veterinary Medicine, University of Thessaly School of Health Sciences, Greece
*
Corresponding author: Xanthoula Rousou; Email: xanrous@yahoo.com
Rights & Permissions [Opens in a new window]

Abstract

Diagnostic test accuracy studies assess a diagnostic test’s performance against a reference standard. In this review, we explore and compare statistical methods used in meta-analyses of diagnostic test accuracy studies. Specifically, we evaluate two frequentist methods – split component synthesis (SCS) and bivariate model (BM) – alongside two Bayesian approaches: Bayesian hierarchical summary receiver operating characteristic (BHSROC) and Bayesian bivariate model (BBM). We also include their latent class variants (LC-BHSROC and LC-BBM). Using a meta-analysis of various multiplex nucleic acid amplification tests (NAATs/PCRs) against Campylobacter spp. as a case study we illustrate the practical applications of these methods. The reference standard was culture, and due to differences in cut-off values and primers among the NAAT/PCR brands, substantial heterogeneity was anticipated. Our findings reveal that the BM and BBM methods tend to estimate higher sensitivities than the other approaches, even when the number of studies is substantial, and heterogeneity is moderate – as observed in this case study. In such scenario, the SCS method or the BHSROC model may offer more robust and reliable outcomes. While our review is based on a real-life meta-analysis rather than simulations, it offers practical insights into the strengths and limitations of these statistical approaches for diagnostic test accuracy studies.

Information

Type
Review
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Table 1. Mean and range for the sensitivity (Se) and specificity (Sp) priors of culture and the corresponding beta distributions, beta (a, b)

Figure 1

Table 2. Prior distributions for the prevalence (pi), sensitivity (Se), and specificity (Sp) of multiplex NAAT/PCR (αi, θi, β, Λ, Θ, σ1, σ2)

Figure 2

Figure 1. Sensitivity and specificity estimates with corresponding confidence/credible intervals (error bars) of multiplex NAAT/PCR against Campylobacter spp., compared to culture with or without the assumption of gold standard.Abbreviations: BBM, Bayesian bivariate model; BHSROC, Bayesian hierarchical summary receiver operating characteristic model; BM, Bivariate method; LC_BBM, Latent class Bayesian bivariate model; LC_BHSROC, Latent class Bayesian hierarchical summary receiver operating characteristic model; SCS, split component synthesis.

Figure 3

Table 3. Accuracy estimates with 95% credible intervals (95% CC) from the meta-analysis of the included studies with the Bayesian methods bivariate and the HSROC without the assumption of a gold standard

Figure 4

Figure 2. Sensitivity and specificity estimates with corresponding confidence/credible intervals (error bars) of index test Allplex/Seeplex, Filmarray, BDMax, and Luminex with number of studies 5, 5, 6(7 datasets), and 11(12 datasets), respectively. Heterogeneity (I2) for each subgroup was estimated 45.16%, 0%, 0%, and 62.48%, respectively.Abbreviations: BBM, Bayesian bivariate model; BHSROC, Bayesian hierarchical summary receiver operating characteristic model; BM, Bivariate method; LC_BBM, Latent class Bayesian bivariate model; LC_BHSROC, Latent class Bayesian hierarchical summary receiver operating characteristic model; SCS, split component synthesis.

Supplementary material: File

Rousou et al. supplementary material

Rousou et al. supplementary material
Download Rousou et al. supplementary material(File)
File 282 KB