Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-07T20:32:15.087Z Has data issue: false hasContentIssue false

An Optimally Regularized Estimator of Multilevel Latent Variable Models with Improved MSE Performance

Published online by Cambridge University Press:  22 September 2025

Valerii Dashuk
Affiliation:
Hector Research Institute of Education Sciences and Psychology, University of Tübingen , Germany Department of Psychology, Faculty of Human Sciences, MSH Medical School Hamburg , Germany
Martin Hecht
Affiliation:
Department of Methodology and Statistics for Psychology, Helmut Schmidt University Hamburg , Germany
Oliver Lüdtke*
Affiliation:
Educational Measurement and Data Science, Leibniz Institute for Science and Mathematics Education at the University of Kiel , Germany Educational Measurement and Data Science, Centre for International Student Assessment, Germany
Alexander Robitzsch
Affiliation:
Educational Measurement and Data Science, Leibniz Institute for Science and Mathematics Education at the University of Kiel , Germany Educational Measurement and Data Science, Centre for International Student Assessment, Germany
Steffen Zitzmann
Affiliation:
Department of Psychology, Faculty of Human Sciences, MSH Medical School Hamburg , Germany
*
Corresponding author: Oliver Lüdtke; Email: oluedtke@leibniz-ipn.de
Rights & Permissions [Opens in a new window]

Abstract

We propose an optimally regularized Bayesian estimator of multilevel latent variable models that aims to outperform traditional maximum likelihood (ML) estimation in mean squared error (MSE) performance. We focus on the between-group slope in a two-level model with a latent covariate. Our estimator combines prior information with data-driven insights for optimal parameter estimation. We present a “proof of concept” by computer simulations, involving varying numbers of groups, group sizes, and intraclass correlations (ICCs), which we conducted to compare the newly proposed estimator with ML. Additionally, we provide a step-by-step tutorial on applying the regularized Bayesian estimator to real-world data using our MultiLevelOptimalBayes package.

Encouragingly, our results show that our estimator offers improved MSE performance, especially in small samples with low ICCs. These findings suggest that the estimator can be an effective means for enhancing estimation accuracy.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 A multilevel structural equation model using the within-between framework that decomposes the variables X and Y into within-group and between-group components.Note: The within-group components are denoted by subscript w, and the between-group components are denoted by subscript b. The between-group components ($X_b$ and $Y_b$) are connected through a regression, where $Y_b$ serves as the dependent variable and $X_b$ as the predictor. Similarly, the within-group components ($X_w$ and $Y_w$) are related to each other in an analogous manner. The notation includes $\beta _b$ for the between-group slope and $\beta _w$ for the within-group slope.

Figure 1

Figure 2 Log of root MSE (RMSE) in estimating the between-group slope $\beta _b$ for the ML and the two Bayesian estimators as a function of the sample size at the group level (J) and the ICC of the predictor ICC$_X$.Note: The scale of the y-axis differs between the four subplots. Results are shown for $n = 15$ people per group, and constant within-group and between-group slopes of $\beta _w = 0.5$ and $\beta _b = 0.2$, respectively.

Figure 2

Figure 3 Log of root MSE (RMSE) in estimating the between-group slope $\beta _b$ for the two Bayesian estimators as a function of the sample size at the group level (J) and the ICC of the predictor ICC$_X$.Note: The scale of the y-axis differs between the four subplots. Results are shown for $n = 15$ people per group, and constant within-group and between-group slopes of $\beta _w = 0.5$ and $\beta _b = 0.2$, respectively.

Figure 3

Figure 4 Relative bias in estimating the between-group slope $\beta _b$ for the ML and the two Bayesian estimators as a function of the sample size at the group level (J) and the ICC of the predictor ICC$_X$.Note: The scale of the y-axis differs between the four subplots. Results are shown for $n = 15$ people per group, and constant within- and between-group slopes of $\beta _w = 0.5$ and $\beta _b = 0.2$, respectively.

Figure 4

Table 1 Average RMSE and relative bias values of the ML (RMSE$_{\text {ML}}$ and Bias$_{\text {ML}}$, respectively), the Bayesian estimator with $\beta _b$ (RMSE$_{\text {Bay}}$ and Bias$_{\text {Bay}}$, respectively), and the Bayesian estimator with $\hat {\beta }_b$ (RMSE$_{\text {BML}}$ and Bias$_{\text {BML}}$, respectively) for different values of n and J. Values in bold indicate the smallest RMSE and the smallest relative bias for each combination of n and J

Figure 5

Table 2 RMSE values of the ML (RMSE$_{\text {ML}}$) and the Bayesian estimators (RMSE$_{\text {Bay}}$ represents the Bayesian with $\beta _b$ and RMSE$_{\text {BML}}$ represents the Bayesian with $\hat {\beta }_b$) for ICC$_X=0.05$ and different values of n, J, $\beta _b$, and $\beta _w$

Figure 6

Table 3 RMSE values of the ML (RMSE$_{\text {ML}}$) and the Bayesian estimators (RMSE$_{\text {Bay}}$ represents the Bayesian with $\beta _b$ and RMSE$_{\text {BML}}$ represents the Bayesian with $\hat {\beta }_b$) for ICC$_X=0.1$ and different values of n, J, $\beta _b$, and $\beta _w$

Figure 7

Table 4 RMSE values of the ML (RMSE$_{\text {ML}}$) and the Bayesian estimators (RMSE$_{\text {Bay}}$ represents the Bayesian with $\beta _b$ and RMSE$_{\text {BML}}$ represents the Bayesian with ${\hat {\beta }}_b$) for ICC$_X=0.3$ and different values of n, J, $\beta _b$, and $\beta _w$

Figure 8

Table 5 RMSE values of the ML (RMSE$_{\text {ML}}$) and the Bayesian estimators (RMSE$_{\text {Bay}}$ represents the Bayesian with $\beta _b$ and RMSE$_{\text {BML}}$ represents the Bayesian with $\hat {\beta }_b$) for ICC$_X=0.5$ and different values of n, J, $\beta _b$, and $\beta _w$

Figure 9

Table 6 Relative bias in % of the ML (BiasML) and the Bayesian estimators (BiasBay represents the Bayesian with $\beta _b$ and BiasBML represents the Bayesian with $\hat {\beta }_b$) for ICC$_X=0.05$ and different values of n, J, $\beta _b$, and $\beta _w$

Figure 10

Table 7 Relative bias in % of the ML (Bias$_{\text {ML}}$) and the Bayesian estimators (Bias$_{\text {Bay}}$) represents the Bayesian with $\beta _b$ and Bias$_{\text {BML}}$ represents the Bayesian with $\hat {\beta }_b$) for ICC$_X=0.1$ and different values of n, J, $\beta _b$, and $\beta _w$

Figure 11

Table 8 Relative bias in % of the ML (Bias$_{\text {ML}}$) and the Bayesian estimators (Bias$_{\text {Bay}}$ represents the Bayesian with $\beta _b$ and BiasBML represents the Bayesian with $\hat {\beta }_b$) for ICC$_X=0.3$ and different values of n, J, $\beta _b$, and $\beta _w$

Figure 12

Table 9 Relative bias in % of the ML (Bias$_{\text {ML}}$) and the Bayesian estimators (Bias$_{\text {Bay}}$ represents the Bayesian with $\beta _b$ and Bias$_{\text {BML}}$ represents the Bayesian with $\hat {\beta }_b$) for ICC$_X=0.5$ and different values of n, J, $\beta _b$, and $\beta _w$