Hostname: page-component-77f85d65b8-8v9h9 Total loading time: 0 Render date: 2026-03-28T07:54:53.287Z Has data issue: false hasContentIssue false

Assessing the properties of the prediction interval in random-effects meta-analysis

Published online by Cambridge University Press:  09 January 2026

Péter Mátrai*
Affiliation:
Institute of Bioanalysis, Medical School, University of Pécs, Pécs, Hungary Institute for Translational Medicine, Medical School, University of Pécs, Pécs, Hungary
Tamás Kói
Affiliation:
Department of Stochastics, Institute of Mathematics, Budapest University of Technology and Economics, Budapest, Hungary Centre for Translational Medicine, Semmelweis University, Budapest, Hungary
Zoltán Sipos
Affiliation:
Institute of Bioanalysis, Medical School, University of Pécs, Pécs, Hungary Institute for Translational Medicine, Medical School, University of Pécs, Pécs, Hungary
Nelli Farkas
Affiliation:
Institute of Bioanalysis, Medical School, University of Pécs, Pécs, Hungary Institute for Translational Medicine, Medical School, University of Pécs, Pécs, Hungary
*
Corresponding author: Péter Mátrai; Email: peter.matrai@pte.hu
Rights & Permissions [Opens in a new window]

Abstract

Random-effects meta-analysis is a widely applied methodology to synthesize research findings of studies related to a specific scientific question. Besides estimating the mean effect, an important aim of the meta-analysis is to summarize the heterogeneity, that is, the variation in the underlying effects caused by the differences in study circumstances. The prediction interval is frequently used for this purpose: a 95% prediction interval contains the true effect of a similar new study in 95% of the cases when it is constructed, or in other words, it covers 95% of the true effects distribution on average in repeated sampling. In this article, after providing a clear mathematical background, we present an extensive simulation investigating the performance of all frequentist prediction interval methods published to date. The work focuses on the distribution of the coverage probabilities and how these distributions change depending on the amount of heterogeneity and the number of involved studies. Although the single requirement that a prediction interval has to fulfill is to keep a nominal coverage probability on average, we demonstrate why the distribution of coverages should not be disregarded. We show that for meta-analyses with small number of studies, this distribution has an unideal, asymmetric shape. We argue that assessing only the mean coverage can easily lead to misunderstanding and misinterpretation. The length of the intervals and the robustness of the methods concerning the non-normality of the true effects are also investigated.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open materials
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology
Figure 0

Table 1 Default and other available prediction interval estimators in common meta-analysis software

Figure 1

Table 2 HTS-type prediction interval estimators tested in the simulation

Figure 2

Figure 1 Histograms showing the coverage probability distribution of the HTS-DL (tK-2), the parametric bootstrap, and the ensemble prediction interval methods for a low heterogeneity simulation scenario (N = 100, τ2 = 0.2, I2 = 33%, v = 0.5). The vertical green lines on the histograms indicate 95% coverage probability. The number of involved studies is constant in each column, showing the distribution for 5, 10, 30, and 100 studies. Results for the HTS-DL (tK-2) method are represented in the first row of histograms with letters (a), (b), (c), and (d), the parametric bootstrap method is represented in the second row of histograms with letters (e), (f), (g), and (h), and the ensemble method is represented in the third row of histograms with letters (i), (j), (k), and (l).Abbreviations: HTS-DL (tK-2), Higgins–Thompson–Spiegelhalter method with the DerSimonian and Laird estimation of ${\tau}^2$ parameter and using the t distribution with K-2 degrees of freedom.

Figure 3

Figure 2 Histograms showing the coverage probability distribution of the HTS-HKSJ (tK-2), the HTS-DL (tK-1), and the HTS-DL (z) prediction interval methods for a low heterogeneity simulation scenario (N = 100, τ2 = 0.2, I2 = 33%, v = 0.5). The vertical green lines on the histograms indicate 95% coverage probability. The number of involved studies is constant in each column, showing the distribution for 5, 10, 30, and 100 studies. Results for the HTS-HKSJ (tK-2) method are represented in the first row of histograms with letters (a), (b), (c), and (d), the HTS-DL (tK-1) method is represented in the second row of histograms with letters (e), (f), (g), and (h), and the HTS-DL (z) method is represented in the third row of histograms with letters (i), (j), (k), and (l).Abbreviations: HTS-HKSJ (tK-2), Higgins–Thompson–Spiegelhalter method with the Hartung–Knapp–Sidik–Jonkman variance estimation for the μ parameter and using the t distribution with K–2 degrees of freedom; HTS-DL (tK-1), Higgins–Thompson–Spiegelhalter method with the DerSimonian and Laird estimation of ${\tau}^2$ parameter and using the t distribution with K–1 degrees of freedom; HTS-DL (z), Higgins–Thompson–Spiegelhalter method with the DerSimonian and Laird estimation of ${\tau}^2$ parameter and using the standard normal distribution.

Figure 4

Figure 3 Mean coverage probability (a), Median coverage probability (b), and Mean observed length relative to the theoretical length (c) of the investigated prediction interval methods as a function of the number of involved studies (horizontal axis) for a low heterogeneity simulation scenario (N = 100, τ2 = 0.2, I2 = 33%, v = 0.5).Abbreviations: HTS, Higgins–Thompson–Spiegelhalter method; DL, DerSimonian and Laird estimation of ${\unicode{x3c4}}^2$ parameter; HKSJ, Hartung–Knapp–Sidik–Jonkman variance estimation for the μ parameter; tK-2, method calculated with t distribution with K-2 degrees of freedom; z, method calculated with standard normal distribution.

Figure 5

Figure 4 Histograms showing the coverage probability distribution of the HTS-DL (tK-2), the parametric bootstrap, and the ensemble prediction interval methods for a high heterogeneity simulation scenario (N = 100, τ2 = 1, I2 = 71%, v = 2.5). The vertical green lines on the histograms indicate 95% coverage probability. The number of involved studies is constant in each column, showing the distribution for 5, 10, 30, and 100 studies. Results for the HTS-DL (tK-2) method are represented in the first row of histograms with letters (a), (b), (c), and (d), the parametric bootstrap method is represented in the second row of histograms with letters (e), (f), (g), and (h), and the ensemble method is represented in the third row of histograms with letters (i), (j), (k), and (l).Abbreviations: HTS-DL (tK-2), Higgins–Thompson–Spiegelhalter method with the DerSimonian and Laird estimation of ${\unicode{x3c4}}^2$ parameter and using the t distribution with K-2 degrees of freedom.

Figure 6

Figure 5 Histograms showing the coverage probability distribution of the HTS-HKSJ (tK-2), the HTS-DL (tK-1), and the HTS-DL (z) prediction interval methods for a high heterogeneity simulation scenario (N = 100, τ2 = 1, I2 = 71%, v = 2.5). The vertical green lines on the histograms indicate 95% coverage probability. The number of involved studies is constant in each column, showing the distribution for 5, 10, 30, and 100 studies. Results for the HTS-HKSJ (tK-2) method are represented in the first row of histograms with letters (a), (b), (c), and (d), the HTS-DL (tK-1) method is represented in the second row of histograms with letters (e), (f), (g), and (h), and the HTS-DL (z) method is represented in the third row of histograms with letters (i), (j), (k), and (l).Abbreviations: HTS-HKSJ (tK-2), Higgins–Thompson–Spiegelhalter method with the Hartung–Knapp–Sidik–Jonkman variance estimation for the μ parameter and using the t distribution with K-2 degrees of freedom; HTS-DL (tK-1), Higgins–Thompson–Spiegelhalter method with the DerSimonian and Laird estimation of ${\unicode{x3c4}}^2$ parameter and using the t distribution with K-1 degrees of freedom; HTS-DL (z), Higgins–Thompson–Spiegelhalter method with the DerSimonian and Laird estimation of ${\unicode{x3c4}}^2$ parameter and using the standard normal distribution.

Figure 7

Figure 6 Mean coverage probability (a), Median coverage probability (b), and Mean observed length relative to the theoretical length (c) of the investigated prediction interval methods as a function of the number of involved studies (horizontal axis) for a high heterogeneity simulation scenario (N = 100, τ2 = 1, I2 = 71%, v = 2.5).Abbreviations: HTS, Higgins–Thompson–Spiegelhalter method; DL, DerSimonian and Laird estimation of ${\unicode{x3c4}}^2$ parameter; HKSJ, Hartung–Knapp–Sidik–Jonkman variance estimation for the μ parameter; tK-2, method calculated with t distribution with K-2 degrees of freedom; z, method calculated with standard normal distribution.

Figure 8

Figure 7 Forest plot showing the short-term efficacy of the active agent vortioxetine 10 mg in adult patients suffering from major depressive disorder based on the meta-analysis of Thase and his co-authors.35Abbreviations: N, total sample size; MD, mean difference; SE, standard error; CI, confidence interval; PI, prediction interval; MADRS, Montgomery–Åsberg Depression Rating Scale; HTS, Higgins–Thompson–Spiegelhalter method; REML, Restricted maximum likelihood estimation of ${\unicode{x3c4}}^2$ parameter; DL, DerSimonian, and Laird estimation of ${\unicode{x3c4}}^2$ parameter; HKSJ, Hartung–Knapp–Sidik–Jonkman variance estimation for the μ parameter; tK-2, method calculated with t distribution with K-2 degrees of freedom; z, method calculated with standard normal distribution.

Supplementary material: File

Mátrai et al. supplementary material

Mátrai et al. supplementary material
Download Mátrai et al. supplementary material(File)
File 1.2 MB