Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-07T12:09:33.739Z Has data issue: false hasContentIssue false

Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

Published online by Cambridge University Press:  03 January 2025

Weicong Lyu
Affiliation:
Faculty of Education, University of Macau, Macau, China
Chun Wang*
Affiliation:
College of Education, University of Washington, Seattle, WA, USA
Gongjun Xu*
Affiliation:
Department of Statistics, University of Michigan, Ann Arbor, MI, USA.
*
Corresponding author: Chun Wang and Gongjun Xu; Emails: wang4066@uw.edu; gongjun@umich.edu
Corresponding author: Chun Wang and Gongjun Xu; Emails: wang4066@uw.edu; gongjun@umich.edu
Rights & Permissions [Opens in a new window]

Abstract

Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across studies. In this study, we focus on a regularized explanatory multidimensional item response theory model (re-MIRT) for establishing measurement equivalence across instruments and studies, where regularization enables the detection of items that violate measurement invariance, also known as differential item functioning (DIF). Because the MIRT model is computationally demanding, we leverage the recently developed Gaussian Variational Expectation–Maximization (GVEM) algorithm to speed up the computation. In particular, the GVEM algorithm is extended to a more complicated and improved multi-group version with categorical covariates and Lasso penalty for re-MIRT, namely, the importance weighted GVEM with one additional maximization step (IW-GVEMM). This study aims to provide empirical evidence to support feasible uses of IW-GVEMM for re-MIRT DIF detection, providing a useful tool for integrative data analysis. Our results show that IW-GVEMM accurately estimates the model, detects DIF items, and finds a more reasonable number of DIF items in a real world dataset. The proposed method has been integrated into R package VEMIRT (https://map-lab-uw.github.io/VEMIRT).

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Table 1 Mean running times (in Seconds) of the first five replications

Figure 1

Table 2 DIF Parameters in simulation study I

Figure 2

Table 3 Means (standard deviations) of true positive rates across replications of simulation study I

Figure 3

Table 4 Means (standard deviations) of false positive rates in simulation study I

Figure 4

Table 5 DIF parameters in simulation study II

Figure 5

Table 6 Means (standard deviations) of true positive rates in simulation study II

Figure 6

Table 7 Means (standard deviations) of false positive rates in simulation study II

Figure 7

Figure 1 Relationship between number of non-zero DIF parameters and c of GIC in PROMIS data.

Figure 8

Table 8 DIF detection results of PROMIS anxiety and depression scales

Figure 9

Table 9 Estimated mean and covariance matrix (impact) of PROMIS anxiety and depression scales using IW-GVEMM

Supplementary material: File

Lyu et al. supplementary material

Lyu et al. supplementary material
Download Lyu et al. supplementary material(File)
File 209.6 KB