Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-05T23:10:21.213Z Has data issue: false hasContentIssue false

Plausible and Proper Multiple-Choice Items for Diagnostic Classification

Published online by Cambridge University Press:  19 December 2025

Chia-Yi Chiu*
Affiliation:
Human Development, Teachers College at Columbia University, USA
Hans Friedrich Köhn
Affiliation:
Psychology, University of Illinois at Urbana–Champaign, USA
Yu Wang
Affiliation:
Educational Testing Service, Princeton, USA
*
Corresponding author: Chia-Yi Chiu; Email: cc5010@tc.columbia.edu
Rights & Permissions [Opens in a new window]

Abstract

The multiple-choice (MC) item format has been adapted to the cognitive diagnosis (CD) framework. Early approaches simply dichotomized the responses and analyzed them with a CD model for binary responses. Obviously, this strategy cannot exploit the additional diagnostic information provided by MC items. De la Torre’s (2009, Applied Psychological Measurement, 33, 163–183) MC-DINA model was the first for the explicit analysis of MC items. However, the q-vectors of the distractors were constrained to be nested within the key and each other, which imposes serious restrictions on item development. Relaxing the nestedness-constraint, comes at a price. First, distractors may become redundant: they do not improve the classification of examinees beyond the response options already available for an item. Second, undesirable diagnostic ambiguity can arise from distractors that are equally likely to be chosen by an examinee, but have distinct attribute profiles pointing at different diagnostic classifications. In this article, two criteria, plausible and proper, are developed for detecting these problematic cases. Two theorems that permit for the detection and amendment of improper and implausible items are presented. An R function serving this purpose is used in several practical applications. Results of simulation studies and real data analysis are also reported.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Table 1 Example: An MC item from an introductory statistics test

Figure 1

Table 2 Ideal responses to the introductory statistics MC item

Figure 2

Table 3 Example: A Q-matrix containing plausible items (left panel) and its altered correspondent containing two implausible items (underlain in yellow)

Figure 3

Table 4 The effect of implausible items on the average PAR values obtained from MC-NPC and MC-DINA when $H_j=4$

Figure 4

Table 5 The effect of implausible items on the average PAR values obtained from MC-NPC and MC-DINA when $H_j=5$

Figure 5

Table 6 The proper Q-matrix $\mathbf {Q}_{P}$ (left panel) and the corresponding improper Q-matrix $\mathbf {Q}_{NP}$ (right panel); $K=4$, $J=5$

Figure 6

Table 7 Impact of improper items on the mean PAR values from MC-NPC and MC-DINA

Figure 7

Table 8 Attributes measured in the dataset used in Application II

Figure 8

Table 9 Estimated attribute profiles under the proper and improper Q-matrices and subscores; highlighted rows indicate classification changes