Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-09T14:08:09.390Z Has data issue: false hasContentIssue false

A Family of Sequential Item Response Models for Multiple-Choice, Multiple-Attempt Test Items

Published online by Cambridge University Press:  03 January 2025

Yikai Lu
Affiliation:
Department of Psychology, University of Notre Dame, Notre Dame, IN, USA
Jim Fowler
Affiliation:
Department of Mathematics, The Ohio State University, Columbus, OH, USA
Ying Cheng*
Affiliation:
Department of Psychology, University of Notre Dame, Notre Dame, IN, USA
*
Corresponding author: Ying Cheng; Email: ycheng4@nd.edu
Rights & Permissions [Opens in a new window]

Abstract

We consider a test which allows students to attempt a multiple-choice question multiple times (i.e., multiple attempts). The most extreme form of multiple attempts is the answer-until-correct (AUC) procedure. Previous research has demonstrated that multiple-attempt procedures such as AUC could potentially enhance learning and increase reliability. However, for multiple-choice items, guessing is still non-ignorable. Traditional models of sequential item response theory (SIRT) could model multiple-attempt procedures but fail to take guessing into account. The purpose of this study is to propose SIRT models for multiple-choice, multiple-attempt items (SIRT-MM). First, we defined a family of SIRT-MM models to account for the idiosyncrasies of items, answer options, and examinee behavior. We also explained how these models could improve person parameter estimates by taking into account partial (mis)information of examinees. Second, we conducted model comparisons between the SIRT-MM models, the graded response model, and the nominal response model. Third, we discussed how the item and person parameters can be estimated, and evaluated item and person parameter recovery of SIRT-MM models under different conditions through a simulation study. Finally, we applied the SIRT-MM models to a real dataset and demonstrated their utility through model selection, person parameter recovery, and information functions.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 Item category response function: $a = 1.7, b = 0.0, K = 5$.

Figure 1

Table 1 Probabilities of submitting a correct response at each attempt for two hypothetical test items

Figure 2

Figure 2 Item category response function: $a = 1.7, b = 0.0, \gamma _{3} = 0.5, \gamma _{4} = 0$, and $K = 5$ with different $\gamma _{2}$.

Figure 3

Figure 3 $\frac {\overline {p_D}(\theta ,u)}{p_T(\theta ,u)}$ when $a = 1.7, b = 0.0, \gamma _{2} = 1, \gamma _{3} = 0.1, K = 4$.

Figure 4

Figure 4 Item category response function when the maximum number of attempts is 3: $a = 1.7, b = 0.0, \gamma _{2} = 1$, and $K = 5$.

Figure 5

Table 2 Family of SIRT-MM models

Figure 6

Figure 5 Fisher information of SIRT-MM models with $K = 4$, $b = 0$, and $\gamma _{u} = 0$ for $u = 2 \text { and } 3$, and different a; and the corresponding 2.5PL models with $\gamma _{1} \equiv 0$.

Figure 7

Figure 6 Fisher information of SIRT-MM models with $a = 0.75$, $b = 0$, $\gamma _{u} = 0$ for $u = 2 \text { and } 3$, and different K; and the corresponding 2.5PL models with $\gamma _{1} \equiv 0$.

Figure 8

Figure 7 Fisher information of SIRT-MM models with $a = 0.75$, $b = 0$, $\gamma _{3} = 0$, $K = 4$, and different $\gamma _{2}$; and the corresponding 2.5PL models with $\gamma _{1} \equiv 0$.

Figure 9

Figure 8 Model selection performance of AIC and BIC for SIRT-MM models when data are generated from SIRT-MM models with $N = 500$, $M = 30$, $K = 4$, $\theta \sim N(0, 1)$, $b_j \sim \text {Unif}(-2, 2)$, $a_j \sim \text {Unif}(0.75, 1.33)$, and $\gamma _{j u} \sim \text {Unif}(-1, 1)$. The freely estimated $\gamma _{j u}$ are denoted as Ga where a is the number of $\gamma _{j u}$ parameters for all u.

Figure 10

Figure 9 Model selection performance of AIC and BIC for SIRT-MM models when data are generated from SIRT-MM models with $N = 4,000$, $M = 30$, $K = 4$, $\theta \sim N(0, 1)$, $b_j \sim \text {Unif}(-2, 2)$, $a_j \sim \text {Unif}(0.75, 1.33)$, and $\gamma _{j u} \sim \text {Unif}(-1, 1)$. The freely estimated $\gamma _{j u}$ are denoted as Ga where a is the number of $\gamma _{j u}$ parameters for all u.

Figure 11

Table 3 Item recovery statistics for items without $\gamma _{ju}$

Figure 12

Table 4 Item recovery statistics for items with $\gamma _{j2}$

Figure 13

Table 5 Item recovery statistics for an item with $\gamma _{j2}$ and $\gamma _{j3}$

Figure 14

Figure 10 Person parameter statistics when $\theta \sim N(0,1), a_j \sim \text {Unif}(0.75, 1.33), b_j \sim \text {Unif}(-2, 2)$, and $\gamma _{j 2} \sim \text {Unif}(-1, 1)$. M is the number of items administered. The scoring scheme used in classical test theory is denoted as SS in the correlation plot.

Figure 15

Figure 11 RMSE for $\theta $ estimates conditioning on $\theta $ when $N = 1,000, M = 25, \theta \sim N(0,1), a_j\sim \text {Unif}(0.75, 1.33), b_j \sim \text {Unif}(-2, 2)$.

Figure 16

Table 6 Model-fit statistics for an SIRT-MM model without $\gamma _{j u}$ and an SIRT-MM model with a freely estimated $\gamma _{j 2}$

Figure 17

Figure 12 Scatter plot of $\theta $ estimated by the SIRT-MM models using only one attempt and two attempts from the real data.

Figure 18

Figure 13 Test information functions of the real data with different maximum numbers of attempts using the item parameter estimates for the SIRT-MM model.

Supplementary material: File

Lu et al. supplementary material

Lu et al. supplementary material
Download Lu et al. supplementary material(File)
File 2.2 MB