Hostname: page-component-89b8bd64d-rbxfs Total loading time: 0 Render date: 2026-05-07T13:48:39.472Z Has data issue: false hasContentIssue false

A novel robust meta-analysis model using the t distribution for outlier accommodation and detection

Published online by Cambridge University Press:  13 March 2025

Yue Wang
Affiliation:
School of Statistics and Mathematics, Yunnan University of Finance and Economics, Kunming, China
Jianhua Zhao*
Affiliation:
School of Statistics and Mathematics, Yunnan University of Finance and Economics, Kunming, China
Fen Jiang
Affiliation:
School of Statistics and Mathematics, Yunnan University of Finance and Economics, Kunming, China
Lei Shi
Affiliation:
School of Statistics and Mathematics, Yunnan University of Finance and Economics, Kunming, China
Jianxin Pan
Affiliation:
Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai, China
*
Corresponding author: Jianhua Zhao; Email: jhzhao.ynu@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Random effects meta-analysis model is an important tool for integrating results from multiple independent studies. However, the standard model is based on the assumption of normal distributions for both random effects and within-study errors, making it susceptible to outlying studies. Although robust modeling using the t distribution is an appealing idea, the existing work, that explores the use of the t distribution only for random effects, involves complicated numerical integration and numerical optimization. In this article, a novel robust meta-analysis model using the t distribution is proposed (tMeta). The novelty is that the marginal distribution of the effect size in tMeta follows the t distribution, enabling that tMeta can simultaneously accommodate and detect outlying studies in a simple and adaptive manner. A simple and fast EM-type algorithm is developed for maximum likelihood estimation. Due to the mathematical tractability of the t distribution, tMeta frees from numerical integration and allows for efficient optimization. Experiments on real data demonstrate that tMeta is compared favorably with related competitors in situations involving mild outliers. Moreover, in the presence of gross outliers, while related competitors may fail, tMeta continues to perform consistently and robustly.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology
Figure 0

Figure 1 Top row: forest plots on two datasets: (a) Mag and (b) Hipfrac, where each effect size$y_i$and 95% confidence interval are shown as circle and solid line, respectively. Bottom row: evolement of log-likelihood of$\mathcal {L}$versus number of iterations: (c) Mag and (d) Hipfrac.

Figure 1

Table 1 Results of parameter estimates, negative log-likelihood, and CPU time (in seconds) by various methods on Mag dataset

Figure 2

Figure 2 Results on outlier detection by various methods on Mag dataset: (a) tMeta; (b) tRE-Meta; (c) MIX-Meta; (d) SYM-Meta; (e) SKM-Meta. The marker solid point$\bullet $in blue represents normal studies judged by a method.

Figure 3

Table 2 Results of parameter estimates, negative log-likelihood, BIC, and CPU time (in seconds) by various methods on Hipfrac dataset

Figure 4

Figure 3 Results on outlier detection by various methods on Hipfrac dataset: (a) tMeta; (b) tRE-Meta; (c) MIX-Meta; (d) SYM-Meta; (e) SKM-Meta. The vertical line indicates the critical value for tMeta (red) and the threshold 0.9 (magenta) for MIX-Meta. The vertical line indicates the critical value for tMeta and the threshold 0.9 for MIX-Meta. The marker solid point$\bullet $in blue represents normal studies judged by a method. Star ‘*’ signals outlying studies, with red for tMeta and magenta for the other methods.

Figure 5

Figure 4 Top row: forest plots on the fluoride toothpaste dataset: (a) Flu and (b) modified Flu, where each effect size$y_i$and 95% confidence interval are shown as circle and solid line, respectively. Bottom row: evolement of log-likelihood of$\mathcal {L}$versus number of iterations: (c) Flu and (d) modified Flu.

Figure 6

Table 3 Results by various methods on the original and modified fluoride toothpaste dataset, including parameter estimates, negative log-likelihood, BIC, and CPU time (in seconds)

Figure 7

Figure 5 Results on outlier detection by various methods on fluoride toothpaste dataset. Top row: the original dataset; Bottom row: the modified dataset. (a), (f) tMeta; (b), (g) tRE-Meta; (c), (h) MIX-Meta; (d), (i) SYM-Meta; (e), (j) SKM-Meta. The vertical line indicates the critical value for tMeta and the threshold 0.9 for MIX-Meta. The marker solid point$\bullet $in blue represents normal studies judged by a method. Star ‘*’ signals outlying studies, with red for tMeta and magenta for the other methods.

Figure 8

Figure 6 Top row: forest plots on CDP-choline dataset: (a) original dataset; (b) modified dataset. Bottom row: evolement of log-likelihood of$\mathcal {L}$versus number of iterations: (c) original dataset and (d) modified dataset.

Figure 9

Figure 7 Results on outlier detection by various methods on CDP-choline dataset. Top row: the original dataset; Bottom row: the modified dataset. (a), (f) tMeta; (b), (g) tRE-Meta; (c), (h) MIX-Meta; (d), (i) SYM-Meta; (e), (j) SKM-Meta. The vertical line indicates the critical value for tMeta and the threshold 0.9 for MIX-Meta. The marker solid point$\bullet $in blue represents normal studies judged by a method. Star ‘*’ signals outlying studies, with red for tMeta and magenta for the other methods.

Figure 10

Table 4 Results by various methods on the CDP-choline dataset, including parameter estimates, negative log-likelihood, BIC, and CPU time (in seconds)

Supplementary material: File

Wang et al. supplementary material

Wang et al. supplementary material
Download Wang et al. supplementary material(File)
File 70.3 KB