Hostname: page-component-5db58dd55d-pjp64 Total loading time: 0 Render date: 2026-05-31T11:07:02.518Z Has data issue: false hasContentIssue false

Robust Inference for Generalized Linear Mixed Models: A “Two-Stage Summary Statistics” Approach Based on Score Sign Flipping

Published online by Cambridge University Press:  03 January 2025

Angela Andreella*
Affiliation:
University of Trento, Italy
Jelle Goeman
Affiliation:
Leiden University Medical Center, The Netherlands
Jesse Hemerik
Affiliation:
Erasmus University Rotterdam, The Netherlands
Livio Finos
Affiliation:
University of Padova, Italy
*
Corresponding author: Angela Andreella; Email: angela.andreella@unitn.it
Rights & Permissions [Opens in a new window]

Abstract

Despite the versatility of generalized linear mixed models in handling complex experimental designs, they often suffer from misspecification and convergence problems. This makes inference on the values of coefficients problematic. In addition, the researcher’s choice of random and fixed effects directly affects statistical inference correctness. To address these challenges, we propose a robust extension of the “two-stage summary statistics” approach using sign-flipping transformations of the score statistic in the second stage. Our approach efficiently handles within-variance structure and heteroscedasticity, ensuring accurate regression coefficient testing for 2-level hierarchical data structures. The approach is illustrated by analyzing the reduction of health issues over time for newly adopted children. The model is characterized by a binomial response with unbalanced frequencies and several categorical and continuous predictors. The proposed approach efficiently deals with critical problems related to longitudinal nonlinear models, surpassing common statistical approaches such as generalized estimating equations and generalized linear mixed models.

Information

Type
Application and Case Studies - Original
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 Estimated type I error considering $N \in \{10,50,100\}$ number of subjects and $n_j \in \{5,10,25,50\}$ repeated measurements. Each line represents one model, and the grey area around the dashed black line represents the $0.95$ confidence bound for $\alpha = 0.05$. The term “GLMM” refers to the GLMM with random intercept and random slope specified, whereas “misspecified GLMM” refers to the GLMM with solely the random intercept included in the model specification. The term “flip2sss” refers to the results using the maximum likelihood estimator in the first stage, while the term “flip2sss Firth” refers to the results using the bias correction proposed by Firth (1995). Finally, the terms GLM and GEE refer to a GLM with a logit link function and a GEE with an independent working correlation matrix, respectively.

Figure 1

Figure 2 Zoom of Figure 1 considering the estimated type I error $\in [0,0.1]$. Estimated type I error considering $N \in \{10,50,100\}$ number of subjects and $n_j \in \{5,10,25,50\}$ repeated measurements. Each line represents one model, and the grey area around the dashed black line represents the $0.95$ confidence bound for $\alpha = 0.05$. The term “GLMM” refers to the GLMM with random intercept and random slope specified, whereas “misspecified GLMM” refers to the GLMM with solely the random intercept included in the model specification. The term “flip2sss” refers to the results using the maximum likelihood estimator in the first stage, while the term “flip2sss Firth” refers to the results using the bias correction proposed by Firth (1995). Finally, the terms GLM and GEE refer to a GLM with a logit link function and a GEE with an independent working correlation matrix, respectively.

Figure 2

Figure 3 Estimated power considering $N \in \{10,50,100\}$ number of subjects and $n_j \in \{5,10,25,50\}$ repeated measurements. Each line represents one model with corresponding colored confidence bounds at $0.95$ level. The term “flip2sss” refers to the results using the maximum likelihood estimator in the first stage, while the term “flip2sss Firth” refers to the results using the bias correction proposed by Firth (1995).

Figure 3

Figure 4 Estimated type I error considering $N \in \{10,50,100\}$ number of subjects and $n_j \sim \text {Pois}(\lambda )$ where $\lambda \in \{5, 10, 25, 50\}$ repeated measurements, i.e., in the case of unbalanced data. Each line represents one model, and the grey area around the dashed black line represents the $0.95$ confidence bound for $\alpha = 0.05$. The term “GLMM” refers to the GLMM with random intercept and random slope specified, whereas “misspecified GLMM” refers to the GLMM with solely the random intercept included in the model specification. The term “flip2sss” refers to the results using the classic GLM in the first stage, while the term “flip2sss Firth” refers to the results using the bias correction proposed by Firth (1995). Finally, the terms GLM and GEE refer to a GLM with a logit link function and a GEE with an independent working correlation matrix, respectively.

Figure 4

Figure 5 Estimated power considering $N \in \{10,50,100\}$ number of subjects and $n_j \sim \text {Pois}(\lambda )$ where $\lambda \in \{5, 10, 25, 50\}$ repeated measurements, i.e., in the case of unbalanced data. Each line represents one model with corresponding colored confidence bounds at $0.95$ level. The term “flip2sss” refers to the results using the classic GLM in the first stage, while the term “flip2sss Firth” refers to the results using the bias correction proposed by Firth (1995).

Figure 5

Table 1 Description of the socio-demographic variables.

Figure 6

Figure 6 Sample proportion of children with an unhealthy status across time for each country. The colored bars indicate a $\approx 0.3$ confidence interval for each time point/country combination. If bars are absent, it indicates that the variance of Unhealth equals $0$.

Figure 7

Table 2 flip2sss model results: summary of the model and related Anova-like table.

Figure 8

Table 3 GLMM model results: summary of the model and related analysis of variance table.

Figure 9

Table 4 GEE results: summary of the model and related analysis of variance table.

Figure 10

Figure A1 Estimated type I error considering $N \in \{10,50,100\}$ number of subjects and $n_j \in \{5,10,25,50\}$ repeated measurements. Each line represents one model, and the grey area around the dashed black line represents the $0.95$ confidence bounds for $\alpha = 0.05$. The term “GLMM” refers to the GLMM with random intercept and random slope specified, whereas “misspecified GLMM” refers to the GLMM with solely the random intercept included in the model specification. The term “flip2sss” refers to the results using the classic GLM in the first stage, while the term “flip2sss Firth” refers to the results using the bias correction proposed by Firth (1995). Finally, the terms GLM and GEE refer to a GLM with a logit link function and a GEE with an independent working correlation matrix, respectively.

Figure 11

Figure A2 Estimated power considering $N \in \{10,50,100\}$ number of subjects and $n_j \in \{5,10,25,50\}$ repeated measurements. Each line represents one model with corresponding colored confidence bounds at level $0.95$. The term “flip2sss” refers to the results using the classic GLM in the first stage, while the term “flip2sss Firth” refers to the results using the bias correction proposed by Firth (1995).

Figure 12

Figure A3 Estimated type I error considering $N \in \{10,50,100\}$ number of subjects and $n_j \in \{5,10,25,50\}$ repeated measurements. Each line represents one model, and the grey area around the dashed black line represents the $0.95$ confidence bounds for $\alpha = 0.05$. The term “misspecified GLMM 2” refers to the GLMM with random intercept and random slope for $X_{ij}$ specified, whereas “misspecified GLMM” refers to the GLMM with solely the random intercept included in the model specification. The term “flip2sss” refers to the results using the classic GLM in the first stage, while the term “flip2sss Firth” refers to the results using the bias correction proposed by Firth (1995). Finally, the terms GLM and GEE refer to a GLM with a logit link function and a GEE with an independent working correlation matrix, respectively.

Figure 13

Figure A4 Estimated power considering $N \in \{10,50,100\}$ number of subjects and $n_j \in \{5,10,25,50\}$ repeated measurements. Each line represents one model with corresponding colored confidence bounds at level $0.95$. The term “flip2sss” refers to the results using the classic GLM in the first stage, while the term “flip2sss Firth” refers to the results using the bias correction proposed by Firth (1995).

Figure 14

Figure A5 Estimated type I error considering $N \in \{10,50,100\}$ number of subjects and $n_j \in \{5,10,25,50\}$ repeated measurements in the case of heteroscedasticity. Each line represents one model, and the grey area around the dashed black line represents the $0.95$ confidence bounds for $\alpha = 0.05$. The “LMM” refers to the LMM specifying random intercept and random slope, while the “LMM misspecified” refers to the LMM with only random intercept. Finally, the terms LM and GEE refer to a normal linear model and a GEE with an independent working correlation matrix, respectively.

Figure 15

Figure A6 Zoom of Figure A5 considering the estimated type I error $\in [0,0.1]$. Estimated type I error considering $N \in \{10,50,100\}$ number of subjects and $n_j \in \{5,10,25,50\}$ repeated measurements in the case of heteroscedasticity. Each line represents one model, and the grey area around the dashed black line represents the $0.95$ confidence bounds for $\alpha = 0.05$. The “LMM” refers to the LMM specifying random intercept and random slope, while the “LMM misspecified” refers to the LMM with only random intercept. Finally, the terms LM and GEE refer to a normal linear model and a GEE with an independent working correlation matrix, respectively.

Figure 16

Table B1 GLMM results assuming only the random intercept: summary of the model and related analysis of variance table.