Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-06T01:10:03.095Z Has data issue: false hasContentIssue false

Exploratory General-Response Cognitive Diagnostic Models with Higher-Order Structures

Published online by Cambridge University Press:  16 April 2025

Jia Liu
Affiliation:
Department of Statistics, Columbia University, New York, NY, USA
Seunghyun Lee
Affiliation:
Department of Statistics, Columbia University, New York, NY, USA
Yuqi Gu*
Affiliation:
Department of Statistics, Columbia University, New York, NY, USA
*
Corresponding author: Yuqi Gu; Email: yuqi.gu@columbia.edu
Rights & Permissions [Opens in a new window]

Abstract

Cognitive Diagnostic Models (CDMs) are popular discrete latent variable models in educational and psychological measurement. While existing CDMs mainly focus on binary or categorical responses, there is a growing need to extend them to model a wider range of response types, including but not limited to continuous and count-valued responses. Meanwhile, incorporating higher-order latent structures has become crucial for gaining deeper insights into cognitive processes. We propose a general modeling framework for higher-order CDMs for rich types of responses. Our framework features a highly flexible data layer that is adaptive to various response types and measurement models for CDMs. Importantly, we address a challenging exploratory estimation scenario where the item-attribute relationship, specified by the Q-matrix, is unknown and needs to be estimated along with other parameters. In the higher-order layer, we employ a probit-link with continuous latent traits to model the binary latent attributes, highlighting its benefits in terms of identifiability and computational efficiency. Theoretically, we propose transparent identifiability conditions for the exploratory setting. Computationally, we develop an efficient Monte Carlo Expectation–Maximization algorithm, which incorporates an efficient direct sampling scheme and requires significantly reduced simulated samples. Extensive simulation studies and a real data example demonstrate the effectiveness of our methodology.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Table 1 RMSE and aBias for the main-effect HO-GRCDM

Figure 1

Table 2 Proportion of correctly recovered rows ($P_R$) and entries ($P_E$) in Q-matrix for the main-effect HO-GRCDM

Figure 2

Table 3 RMSE and aBias for the all-effect HO-GRCDM

Figure 3

Table 4 Proportion of correctly recovered rows ($P_R$) and entries ($P_E$) in Q-matrix for the all-effect HO-GRCDM

Figure 4

Table 5 RMSE and aBias for the DINA HO-GRCDM

Figure 5

Table 6 Proportion of correctly recovered rows ($P_R$) and entries ($P_E$) in Q-matrix for the DINA HO-GRCDM

Figure 6

Table 7 Q-matrix of the TIMSS 2019 data set

Figure 7

Table 8 $Q^{(H)\top }$ matrix of the TIMSS 2019 data set

Figure 8

Figure 1 TIMSS data analysis.Note: Probability histogram and fitted density curves (empirical density, Gamma model, and log-normal model) for response time data (in minutes).

Figure 9

Figure 2 TIMSS data analysis. (a) Heatmap for estimated bottom-layer parameters. (b) Correlation plot of the estimated latent attributes.

Supplementary material: File

Liu et al. supplementary material

Liu et al. supplementary material
Download Liu et al. supplementary material(File)
File 416.3 KB