Hostname: page-component-89b8bd64d-9prln Total loading time: 0 Render date: 2026-05-14T01:08:59.880Z Has data issue: false hasContentIssue false

ZIBGLMM: Zero-inflated bivariate generalized linear mixed model for meta-analysis with double-zero-event studies

Published online by Cambridge University Press:  21 March 2025

Lu Li
Affiliation:
Center for Health Analytics and Synthesis of Evidence, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Applied Mathematics and Computational Science Graduate Program, University of Pennsylvania, Philadelphia, PA, USA
Lifeng Lin
Affiliation:
Department of Epidemiology and Biostatistics, University of Arizona, Tucson, AZ, USA
Joseph C. Cappelleri
Affiliation:
Statistical Research and Data Science Center, Pfizer Inc, New York, NY, USA
Haitao Chu
Affiliation:
Statistical Research and Data Science Center, Pfizer Inc, New York, NY, USA Division of Biostatistics and Health Data Science, University of Minnesota Twin Cities, Minneapolis, MN, USA
Yong Chen*
Affiliation:
Center for Health Analytics and Synthesis of Evidence, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA Applied Mathematics and Computational Science Graduate Program, University of Pennsylvania, Philadelphia, PA, USA
*
Corresponding author: Yong Chen; Email: ychen123@pennmedicine.upenn.edu
Rights & Permissions [Opens in a new window]

Abstract

Double-zero-event studies (DZS) pose a challenge for accurately estimating the overall treatment effect in meta-analysis (MA). Current approaches, such as continuity correction or omission of DZS, are commonly employed, yet these ad hoc methods can yield biased conclusions. Although the standard bivariate generalized linear mixed model (BGLMM) can accommodate DZS, it fails to address the potential systemic differences between DZS and other studies. In this article, we propose a zero-inflated bivariate generalized linear mixed model (ZIBGLMM) to tackle this issue. This two-component finite mixture model includes zero inflation for a subpopulation with negligible or extremely low risk. We develop both frequentist and Bayesian versions of ZIBGLMM and examine its performance in estimating risk ratios against the BGLMM and conventional two-stage MA that excludes DZS. Through extensive simulation studies and real-world MA case studies, we demonstrate that ZIBGLMM outperforms the BGLMM and conventional two-stage MA that excludes DZS in estimating the true effect size with substantially less bias and comparable coverage probability.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology
Figure 0

Figure 1 Estimated effect size differences using four methods, that is, bivariate generalized linear mixed model (BGLMM) including double-zero-event studies (DZS), BGLMM excluding DZS, conventional two-stage meta-analysis (MA) excluding DZS (MA), and zero-inflated bivariate generalized linear mixed model (ZIBGLMM) including DZS. In each subfigure, the y-axis is the difference in log risk ratios (RRs) between the two methods, and the x-axis is the average of the log RRs of the two methods being compared. Subfigure (a) contrasts the BGLMM with and without DZS. Subfigure (b) explores the difference in effect size between ZIBGLMM and MA, both excluding DZS. Subfigure (c) presents a similar comparison between ZIBGLMM including DZS and BGLMM excluding DZS. Subfigures (a–c) shed light on how the inclusion or exclusion of DZS significantly impacts effect sizes based on 1,111 Cochrane meta-analyses. Subfigure (d) provides a comparison between ZIBGLMM and BGLMM, both incorporating DZS. It illustrates a more concentrated distribution, signifying a smaller difference in log RR. Each subfigure displays the number of MAs lying outside the 95% limits of agreement, the mean difference between log RRs, 95% limits of agreements, and 99% range of the averages of log RRs at its upper left corner.

Figure 1

Table 1 Motivation example study data33

Figure 2

Table 2 Descriptions of notation

Figure 3

Figure 2 PRISMA plot of the 1,111 Cochrane Database of Systematic Reviews meta-analyses included in this article. We extracted 1,111 studies with sample sizes between 10 and 50 and with double-zero ratios between 0.15 and 0.4 from a total of 72,716 studies.

Figure 4

Table 3 Summary of the comparative analysis of different methods in Figure 1

Figure 5

Figure 3 Model goodness of fit in terms of Akaike information criterion (AIC) and deviance information criterion (DIC) differences between the ZIBGLMM and BGLMM methods for simulation meta-analyses (MAs) in subfigure (a) and Cochrane MAs in subfigure (b). Each violin plot66 included a box plot where the box limits indicated the range of the central 50% of the data (i.e., the range between the 25th and 75th percentiles), and the median value was marked by a central black line, along with a kernel smoothed density plot representing the probability distribution. The y-axis in each subplot represents the goodness of fit of ZIBGLMM subtracting the goodness of fit of BGLMM. The dashed line represents when the difference is 0. Subfigure (a) illustrates the distribution of AIC and DIC differences for 18,000 simulation datasets. The ZIBGLMM exhibited superior fit for 8,839 of 16,215 (54.51%) simulation studies as measured by AIC, and for 17,805 out of 18,000 (98.92%) simulation studies when measured by DIC. Subfigure (b) illustrates the distribution of AIC and DIC differences for 1,111 Cochrane MAs. The ZIBGLMM exhibited superior fit for 365 of 1,010 (36.13%) Cochrane MAs as measured by AIC and for 986 out of 1,111 (88.74%) Cochrane MAs when measured by DIC.

Figure 6

Table 4 Specifications for the simulation studies

Figure 7

Table 5 Coverage probability of the five methods: two-stage meta-analysis excluding DZS (MA), BGLMM, Bayesian BGLMM, ZIBGLMM, and Bayesian ZIBGLMM

Figure 8

Figure 4 Coverage probability and mean CI length of conventional two-stage meta-analysis (MA), bivariate generalized linear mixed models (BGLMM), Bayesian BGLMM, zero-inflated bivariate generalized linear mixed models (ZIBGLMM), and Bayesian ZIBGLMM for meta-analyses with 10 studies (a), 25 studies (b), and 50 studies (c). The y-axis displays the coverage probability, and the x-axis displays the mean 95% confidence/credible interval widths. The number of studies in an MA is denoted by n. The Bayesian BGLMM and Bayesian ZIBGLMM displayed comparable, consistently high coverage probabilities to MA, while maintaining the shortest mean credible interval widths across all settings. The coverage probabilities for all methods decreased as the average marginal risk ratio increased and as the size of studies increased.

Figure 9

Figure 5 Bias in the estimation of risk ratio from conventional two-stage meta-analysis excluding DZS (MA), alongside the frequentist and Bayesian versions of BGLMM and ZIBGLMM. For meta-analyses (MAs) with a smaller size (10 studies), the frequentist ZIBGLMM exhibits the least bias. For MAs with a moderate size (25 studies), Bayesian BGLMM manifests the least bias. The Bayesian BGLMM and ZIBGLMM archives the smallest bias for large (50 studies) MAs. Frequentist ZIBGLMM consistently demonstrates less bias than the frequentist BGLMM. For all settings, both frequentist and Bayesian BGLMM and ZIBGLMM consistently yield smaller biases compared to MA.

Figure 10

Figure 6 Bias in the estimation of proportion of zero inflation $\pi $. Bayesian ZIBGLMM produces an estimate of the proportion of zero inflation with a smaller bias than the frequentist ZIBGLMM across all study sizes and proportions of zero inflation. However, it tends to overestimate the proportion of zero inflation for larger meta-analyses (MAs) (50 studies). In addition, Bayesian ZIBGLMM yields better estimates when the proportion of zero inflation is 50% than when it is 25%. The frequentist ZIBGLMM appears to have a more substantial bias as the size of the MAs grows.

Supplementary material: File

Li et al. supplementary material

Li et al. supplementary material
Download Li et al. supplementary material(File)
File 52.4 KB