Hostname: page-component-77f85d65b8-8v9h9 Total loading time: 0 Render date: 2026-04-21T12:30:23.280Z Has data issue: false hasContentIssue false

Detecting Differential Item Functioning across Multiple Groups Using Group Pairwise Penalty

Published online by Cambridge University Press:  11 August 2025

Weicong Lyu
Affiliation:
Faculty of Education, University of Macau , Macau, China
Chun Wang*
Affiliation:
College of Education, University of Washington , Seattle, WA, USA
Gongjun Xu*
Affiliation:
Department of Statistics, University of Michigan , Ann Arbor, MI, USA
*
Corresponding authors: Chun Wang and Gongjun Xu; Emails: wang4066@uw.edu; gongjun@umich.edu
Corresponding authors: Chun Wang and Gongjun Xu; Emails: wang4066@uw.edu; gongjun@umich.edu
Rights & Permissions [Opens in a new window]

Abstract

We introduce a novel regularization method for detecting differential item functioning (DIF) in two-parameter logistic (2PL) models. Existing regularization methods require choosing a reference group and using an $L_1$ penalty (LP) to shrink the item parameters of focal groups toward those of the reference. This approach has two key limitations: (1) shrinking all focal groups toward a reference is inherently unfair, as results are affected by the choice of reference and direct comparison among focal groups is unavailable and (2) the LP leads to biased estimates because it overly shrinks large nonzero parameters toward zero. These limitations are particularly problematic for intersectional DIF, where various identity aspects intersect to create multiple smaller groups. Our method addresses these issues by penalizing item parameter differences between all pairs of groups using a truncated LP, thereby treating groups equally and avoiding excessive penalization of large differences. Simulations demonstrate that the proposed method outperforms existing approaches by accurately identifying items exhibiting DIF even with multiple small groups. Application to two real-world datasets further illustrates its utility. We recommend this method as a more equitable and precise tool for DIF detection. The proposed method is available as D2PL_pair_em() in the R package VEMIRT (https://map-lab-uw.github.io/VEMIRT).

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 $L_1$ truncated L1 penalties.

Figure 1

Figure 2 An illustration of the union-find data structure.

Figure 2

Table 1 Impact and DIF parameters for $S=3$ groups

Figure 3

Table 2 Impact and DIF parameters for $S=10$ groups

Figure 4

Table 3 Means (standard deviations) of true positive rates across replications of Simulation I

Figure 5

Table 4 Means (standard deviations) of false positive rates across replications of Simulation I

Figure 6

Figure 3 Mean true positive rates across replications of Simulation I.

Figure 7

Figure 4 Mean false positive rates across replications of Simulation I.

Figure 8

Table 5 Group sizes for $S=3$ groups under unbalanced design

Figure 9

Table 6 Group sizes for $S=10$ groups under unbalanced design

Figure 10

Table 7 Means (standard deviations) of true positive rates across replications of Simulation II

Figure 11

Table 8 Means (standard deviations) of false positive rates across replications of Simulation II

Figure 12

Figure 5 Mean true positive rates across replications of Simulation II.

Figure 13

Figure 6 Mean false positive rates across replications of Simulation II.

Figure 14

Table 9 Countries and economies in the PISA analysis

Figure 15

Table 10 Frequency table of numbers of distinct groups

Figure 16

Figure 7 ICCs of PISA items.

Figure 17

Table 11 Sample sizes and estimated impact of economies

Figure 18

Figure 8 Numbers of DIF items between pairs of economies using TLP with $\rho =0.5$.

Figure 19

Table 12 Groups in the language assessment

Figure 20

Figure 9 Numbers of DIF items between pairs of groups in the language assessment using TLP with $\rho =0.25$.

Figure 21

Figure 10 Numbers of DIF items between pairs of groups in the language assessment using IW-GVEMM.