Hostname: page-component-89b8bd64d-5bvrz Total loading time: 0 Render date: 2026-05-05T14:20:41.958Z Has data issue: false hasContentIssue false

Bayesian Transition Diagnostic Classification Models with Polya-Gamma Augmentation

Published online by Cambridge University Press:  08 August 2025

Joseph Resch
Affiliation:
Department of Statistics & Data Science, University of California , Los Angeles, CA, USA
Samuel Baugh
Affiliation:
Department of Statistics, The Pennsylvania State University , University Park, PA, USA
Hao Duan
Affiliation:
Department of Statistics & Data Science, University of California , Los Angeles, CA, USA
James Tang
Affiliation:
Department of Statistics & Data Science, University of California , Los Angeles, CA, USA
Matthew J. Madison
Affiliation:
Department of Education, University of Georgia , Athens, GA, USA
Michael Cotterell
Affiliation:
Department of Computer Science, University of Georgia , Athens, GA, USA
Minjeong Jeon*
Affiliation:
Department of Education, University of California , Los Angeles, CA, USA
*
Corresponding author: Minjeong Jeon; Email: mjjeon@ucla.edu
Rights & Permissions [Opens in a new window]

Abstract

Diagnostic classification models assume the existence of latent attribute profiles, the possession of which increases the probability of responding correctly to questions requiring the corresponding attributes. Through the use of longitudinally administered exams, the degree to which students are acquiring core attributes over time can be assessed. While past approaches to longitudinal diagnostic classification modeling perform inference on the overall probability of acquiring particular attributes, there is particular interest in the relationship between student progression and student covariates such as intervention effects. To address this need, we propose an integrated Bayesian model for student progression in a longitudinal diagnostic classification modeling framework. Using Pòlya-gamma augmentation with two logistic link functions, we achieve computationally efficient posterior estimation with a conditionally Gibbs sampling procedure. We show that this approach achieves accurate parameter recovery when evaluated using simulated data. We also demonstrate the method on a real-world educational testing data set.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Table 1 List of notation used throughout the article

Figure 1

Table 2 The $2^T$ types of transitions (denoted r) for each attribute for (a) $T=2$, (b) $T=3$, and (c) $T=4$

Figure 2

Table 3 Q matrix for empirical data (Bottge et al., 2014, 2015) indicating attribute requirements for each $J=21$ test item questions

Figure 3

Figure 1 Comparison of $\mathcal {B}$ point estimates between the standard TDCM(left) and the extended TDCM (right) for the single-group setting.

Figure 4

Figure 2 Posterior sample distributions for $\mathcal {B}$ (left) and $\Gamma $ (right) bounded by two standard deviations.Note: For $\mathcal {B}$, each question item contains an intercept in red and a main effect in blue. The four segments for $\Gamma $ plot correspond to the four attributes: RPR, MD, NF, and GG. The indices within each attribute segment denote transition $0\to 1$ intercept, $1\to 0$ intercept, and $1\to 1$ intercept, respectively.

Figure 5

Table 4 Comparison of implicit conditional transition probabilities for each $K=4$ attributes of the single-group empirical study for the standard TDCM (left) and extended TDCM (right) posterior means and standard deviations in parenthesis

Figure 6

Table 5 Probabilities of extended TDCM fit to match the empirical response data are averaged over all $849$ respondents for each of the $21$ test items, done for both time points of the study in the single-group setting

Figure 7

Figure 3 Distribution of probability of extended TDCM fit to match the empirical response data average for $849$ respondents averaged over $21$ total questions, done for both time points of the study in the single-group setting.

Figure 8

Figure 4 Percentage of $849$ respondents that answered each of the $21$ questions correctly, with the value for the original data shown by the red points and simulated posterior predictive data shown by distributions.Note: Results for both time points are shown for the single-group setting.

Figure 9

Figure 5 Posterior $\mathcal {B}$ distributions plotted using the TDCM results (obtained using the Mplus code from Madison & Bradshaw (2018b)) bounded by two reported standard errors (left) is compared to the extended TDCM bounded by two posterior standard deviations (right).Note: Red indicates the question intercepts, while blue indicates the main effects.

Figure 10

Figure 6 The four segments for $\Gamma $ plot correspond to the four attributes: RPR, MD, NF, and GG.Note: The indices within each attribute segment denote transition $0\to 1$ intercept, $0\to 1$ intervention, $1\to 0$ intercept, $1\to 0$ intervention, and $1\to 1$ intercept, respectively.

Figure 11

Table 6 Transition probabilities for $0\to 1$ and $1\to 0$ transitions between the control and treatment groups, compared between the standard multiple-group TDCM (Standard) and the extended TDCM (Extended)

Figure 12

Table 7 Probabilities of extended TDCM fit to match the empirical response data are averaged over all $849$ respondents for each of the $21$ test items, done for both time points of the study in the multiple-group setting

Figure 13

Figure 7 Distribution of probability of extended TDCM fit to match the empirical response data average for $849$ respondents averaged over $21$ total questions, done for both time points of the study in the multiple-group setting.

Figure 14

Figure 8 Percentage of $849$ respondents that answered each of the $21$ questions correctly, with the value for the original data shown by the red points and simulated posterior predictive data shown by distributions.Note: Results for both time points are shown for the multiple-group setting.

Figure 15

Figure 9 Posterior $\mathcal {B}$ distributions of extended TDCM bounded by two posterior standard deviations (right).Note: Red indicates the question intercepts, while blue indicates the main effects.

Figure 16

Figure 10 The four segments for $\Gamma $ plot correspond to the four attributes: RPR, MD, NF, and GG.Note: The indices within each attribute segment denote transition $0\to 1$ intercept, $0\to 1$ intervention, $0\to 1$ gender, $0\to 1$ ESL, $1\to 0$ intercept, $1\to 0$ intervention, $1\to 0$ gender, $1\to 0$ ESL, and $1\to 1$ intercept, respectively.

Figure 17

Table 8 Probabilities of extended TDCM fit to match the empirical response data are averaged over all $755$ respondents for each of the $21$ test items, done for both time points of the study in the multiple-group covariates setting

Figure 18

Figure 11 Distribution of probability of extended TDCM fit to match the empirical response data average for $849$ respondents averaged over $21$ total questions, done for both time points of the study in the multiple-group covariates setting.

Figure 19

Figure 12 Percentage of $755$ respondents that answered each of the $21$ questions correctly, with the value for the original data shown by the red points and posterior predictive data shown by distributions.Note: Results for both time points are shown for the multiple-group with covariates setting.

Figure 20

Table 9 Probabilities of extended TDCM fit to match the empirical response data are averaged over all $755$ respondents for each of the $21$ test items, done for both time points of the study in the single-group, no-covariate setting

Figure 21

Figure 13 Posterior $\mathcal {B}$ (left) and $\Gamma $ (right) distributions bound by two standard deviations for the $K=3$ simulation setting.Note: The four indices for each attribute denote transition $0\to 1$ intercept, $0\to 1$ treatment (single group), $1\to 0$ intercept, and $1\to 1$ intercept, respectively.

Figure 22

Figure 14 True value coverage rates for $95\%$ credible intervals of $\mathcal {B}$ (left) and $\Gamma $ (right) distributions bound for the $K=3$ simulation setting.Note: The four indices for each attribute denote transition $0\to 1$ intercept, $0\to 1$ treatment (single group), $1\to 0$ intercept, and $1\to 1$ intercept, respectively.

Figure 23

Figure 15 Posterior $\mathcal {B}$ (left) and $\Gamma $ (right) distributions bound by two standard deviations for the simulation setting with four covariates.

Figure 24

Figure 16 True value coverage rates for $95\%$ credible intervals of $\mathcal {B}$ (left) and $\Gamma $ (right) distributions bound for the $K=3$ with additional covariates simulation setting.Note: The first five estimates correspond to the $0\to 1$ transition (intercept plus four covariate effects), followed by the next five for $1\to 0$, and the intercept for $1\to 1$.

Figure 25

Figure 17 Posterior $\mathcal {B}$ (left) and $\Gamma $ (right) distributions bound by two standard deviations for the simulation setting with four covariates.

Figure 26

Figure 18 True value coverage rates for $95\%$ credible intervals of $\mathcal {B}$ (left) and $\Gamma $ (right) distributions bound for the $K=3$ with additional covariates simulation setting.Note: The first five estimates correspond to $0\to 1$ transition (intercept plus four covariate effects), followed by the next five for $1\to 0$, and the intercept for $1\to 1$.

Figure 27

Figure 19 Posterior $\mathcal {B}$ (left) and $\Gamma $ (right) distributions bound by two standard deviations for the $T=3$ simulation setting.

Figure 28

Figure 20 True value coverage rates for $95\%$ credible intervals of $\mathcal {B}$ (left) and $\Gamma $ (right) distributions bound for the $K=3$ simulation setting.Note: The four indices for each attribute denote transition $0\to 1$ intercept, $0\to 1$ treatment (single group), $1\to 0$ intercept, and $1\to 1$ intercept, respectively.

Supplementary material: File

Resch et al. supplementary material

Resch et al. supplementary material
Download Resch et al. supplementary material(File)
File 920 KB