Hostname: page-component-77f85d65b8-pkds5 Total loading time: 0 Render date: 2026-04-17T23:33:10.659Z Has data issue: false hasContentIssue false

Explaining Performance Gaps with Problem-Solving Process Data via Latent Class Mediation Analysis

Published online by Cambridge University Press:  11 August 2025

Sunbeom Kwon
Affiliation:
Department of Psychology, University of Illinois Urbana-Champaign , Champaign, IL, USA
Susu Zhang*
Affiliation:
Department of Psychology, University of Illinois Urbana-Champaign , Champaign, IL, USA Department of Statistics, University of Illinois Urbana-Champaign
*
Corresponding author: Susu Zhang; Email: szhan105@illinois.edu
Rights & Permissions [Opens in a new window]

Abstract

Process data, in particular, log data collected from a computerized test, documents the sequence of actions performed by an examinee in pursuit of solving a problem, affording an opportunity to understand test-taking behavioral patterns that account for demographic group differences in key outcomes of interest, for instance, final score on a cognitive item. Addressing this aim, this article proposes a latent class mediation analysis procedure. Using continuous process features extracted from action sequence data as indicators, latent classes underlying the test-taking behavior are identified in a latent class mediation model, where an examinee’s nominal latent class membership enters as the mediator between the observed grouping and outcome variables. A headlong search algorithm for selecting the subset of process features that maximizes the total indirect effect of the latent class mediator is implemented. The proposed procedure is validated with a series of simulations. An application to a large-scale assessment highlights how the proposed method can be used to explain performance gaps between students with learning disability and their typically developing peers on the National Assessment of Educational Progress (NAEP) math assessment.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use and/or adaptation of the article.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 Item VH336968 from the 2017 NAEP Grade 8 Math Assessment.Note: https://www.nationsreportcard.gov/nqt/.

Figure 1

Table 1 Descriptive statistics of the NAEP Math Assessment Item VH336968

Figure 2

Figure 2 Latent class mediation model.Note: $M_k$ represents a process feature, $\Omega $ is a latent class variable, G is a binary group membership (e.g., LD = 1 versus TD = 0), Y is a binary outcome (e.g., correct = 1 versus incorrect = 0)., and X is a covariate. Solid arrows indicate predictive relationships: G and X predict $\Omega $, while $\Omega $ and X predict Y. The dashed arrows indicate that the $M_k$s serve as measurement indicators of $\Omega $.

Figure 3

Figure 3 True mean structures in the simulation study.Note: The columns represent the four latent classes, and the rows represent the ten indicators. The first S rows are the signal indicators, and the rest are the noisy indicators.

Figure 4

Figure 4 Scatter plots of simulated indicators from the simulation conditions.

Figure 5

Table 2 Simulation study results

Figure 6

Table 3 Tool usage rates of latent classes from the NAEP math assessment from the NAEP math assessment item VH336968

Figure 7

Table 4 Model implied response probabilities and class probabilities from the NAEP math assessment item VH336968

Figure 8

Figure 5 t-SNE plot of the selected process features from the NAEP math assessment item VH336968.Source: U.S. Department of Education, National Center for Education Statistics, “Response Process Data from the NAEP 2017 Grade 8 Mathematics Assessment.”

Figure 9

Table D1 Parameter recovery with fixed L and $\mathbf {M}_\kappa $

Figure 10

Table E2 Results from the additional simulation with alternative data generating models