Hostname: page-component-77f85d65b8-6bnxx Total loading time: 0 Render date: 2026-03-27T21:20:59.851Z Has data issue: false hasContentIssue false

Diagnostic test accuracy meta-analysis based on an exact within-study variance calculation method

Published online by Cambridge University Press:  04 February 2026

Olana Angesa Dabi
Affiliation:
Department of Statistics and Actuarial Science, University of Waterloo , Canada
Zelalem Firisa Negeri*
Affiliation:
Department of Statistics and Actuarial Science, University of Waterloo , Canada
*
Corresponding author: Zelalem Firisa Negeri; Email: znegeri@uwaterloo.ca
Rights & Permissions [Opens in a new window]

Abstract

A meta-analysis of diagnostic test accuracy (DTA) studies typically synthesizes study-specific test sensitivity ($Se$) and specificity ($Sp$) to quantify the accuracy of an index test of interest. The bivariate linear mixed effects model with logit transformation of $Se$ and $Sp$ (BLMM-Logit) is commonly used to make statistical inferences, but may lead to misleading results due to the need for Haldane–Anscombe correction and an approximate estimation of variance within the study. Alternative models based on the arcsine square root and Freeman–Tukey double arcsine transformation have been proposed to address these issues; however, they still rely on approximate variance estimation, which is suitable only for large sample sizes. The bivariate generalized linear mixed effects model (BGLMM) is another option, but it faces convergence issues with small meta-analyses or sparse primary studies. To address these limitations, we proposed an exact within-study variance calculation method that does not require Haldane–Anscombe correction and is applicable regardless of the transformation used or the number of studies and participants. We evaluated this method against existing approaches using real-life and simulated DTA meta-analyses. The methods were comparable for large meta-analyses. However, BLMM-Logit demonstrated substantial negative bias in estimating variances between studies and consistently underestimated summary $Se$ and $Sp$ in all simulation scenarios. In contrast, the proposed exact methods (Exact-Logit, Exact-ASR, and Exact-FTDA) and BGLMM had minimal bias and better performance metrics, particularly for meta-analyses with sparse primary studies. Thus, the proposed exact methods should be preferred for DTA meta-analyses with small or sparse studies.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NoDerivatives licence (https://creativecommons.org/licenses/by-nd/4.0), which permits re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology
Figure 0

Figure 1 Forest plot for sensitivity (left) and specificity (right) of the US-Children data obtained after a Haldane–Anscombe correction is applied.

Figure 1

Figure 2 Forest plot for sensitivity (left) and specificity (middle) of the Mini-Mental State Examination (MMSE) data obtained without applying a Haldane–Anscombe correction.

Figure 2

Table 1 Data structure of a diagnostic test result for a single study

Figure 3

Table 2 True parameter settings for the simulation study

Figure 4

Figure 3 Bias for sensitivity (Se) and specificity (Sp) when $\mathcal {\sigma }_1^2=1.59$, $\mathcal {\sigma }_{12}=-0.03$, $\sigma _2^2=1.83$, $n_1=50,$ and $n_2=100$.

Figure 5

Figure 4 RMSE for sensitivity (Se) and specificity (Sp) when $\mathcal {\sigma }_1^2=1.59$, $\mathcal {\sigma }_{12}=-0.03$, $\sigma _2^2=1.83$, $n_1=50,$ and $n_2=100$.

Figure 6

Figure 5 Coverage probability for sensitivity (Se) and specificity (Sp) when $\mathcal {\sigma }_1^2=1.59$, $\mathcal {\sigma }_{12}=-0.03$, $\mathcal {\sigma }_2^2=1.83$, $n_1=50,$ and $n_2=100$.

Figure 7

Figure 6 CI widths for sensitivity ($Se$) and specificity ($Sp$) when $\mathcal {\sigma }_1^2=1.59$, $\mathcal {\sigma }_{12}=-0.03$, $\mathcal {\sigma }_2^2=1.83$, $n_1=50,$ and $n_2=100$.

Figure 8

Figure 7 Bias for $\mathcal {\sigma }_1^2$ and $\mathcal {\sigma }_2^2 $ when the true $\mathcal {\sigma }_1^2=1.59$, $\mathcal {\sigma }_{12}=-0.03$, $\mathcal {\sigma }_2^2=1.83$, $n_1=50,$ and $n_2=100$.

Figure 9

Figure 8 RMSE for $\mathcal {\sigma }_1^2$ and $\mathcal {\sigma }_2^2 $ when the true $\sigma _1^2=1.59$, $\mathcal {\sigma }_{12}=-0.03$, $\sigma _2^2=1.83$, $n_1=50,$ and $n_2=100$.

Figure 10

Figure 9 Bias and RMSE for $\mathcal {\sigma }_{12}$ when the true $\mathcal {\sigma }_1^2=1.59$, $\mathcal {\sigma }_{12}=-0.03$, $\mathcal {\sigma }_2^2=1.83$, $n_1=50,$ and $n_2=100$.

Figure 11

Table 3 Estimates of mean Se and mean Sp with their corresponding 95% CI and estimates of the between-study heterogeneity parameters for the US-Children dataset

Figure 12

Figure 10 SROC curves along with their AUCs and 95% confidence and prediction regions based on the BLMM, BGLMM, Exact-Logit, Exact-ASR, and Exact-FTDA methods for the US-Children dataset.

Figure 13

Table 4 Estimates of mean Se and mean Sp with their corresponding 95% CI and estimates of the between-study heterogeneity parameters for the MMSE dataset

Figure 14

Figure 11 SROC curves along with their AUCs and 95% confidence and prediction regions for the BLMM, BGLMM, Exact-Logit, Exact-ASR, and Exact-FTDA methods for the MMSE dataset.

Supplementary material: File

Dabi and Negeri supplementary material

Dabi and Negeri supplementary material
Download Dabi and Negeri supplementary material(File)
File 2.4 MB