Hostname: page-component-89b8bd64d-5bvrz Total loading time: 0 Render date: 2026-05-13T19:24:27.404Z Has data issue: false hasContentIssue false

A Beta Mixture Model for Careless Respondent Detection in Visual Analogue Scale Data

Published online by Cambridge University Press:  23 September 2025

Lijin Zhang*
Affiliation:
Graduate School of Education, Stanford University , Stanford, CA, USA
Benjamin W. Domingue
Affiliation:
Graduate School of Education, Stanford University , Stanford, CA, USA
Leonie V. D. E. Vogelsmeier
Affiliation:
Department of Methodology, Tilburg University , Tilburg, The Netherlands
Esther Ulitzsch
Affiliation:
Centre for Educational Measurement (CEMO), University of Oslo , Oslo, Norway Centre for Research on Equality in Education (CREATE), University of Oslo , Oslo, Norway
*
Corresponding author: Lijin Zhang; Email: lijinzhang@stanford.edu
Rights & Permissions [Opens in a new window]

Abstract

Visual Analogue scales (VASs) are increasingly popular in psychological, social, and medical research. However, VASs can also be more demanding for respondents, potentially leading to quicker disengagement and a higher risk of careless responding. Existing mixture modeling approaches for careless response detection have so far only been available for Likert-type and unbounded continuous data but have not been tailored to VAS data. This study introduces and evaluates a model-based approach specifically designed to detect and account for careless respondents in VAS data. We integrate existing measurement models for VASs with mixture item response theory models for identifying and modeling careless responding. Simulation results show that the proposed model effectively detects careless responding and recovers key parameters. We illustrate the model’s potential for identifying and accounting for careless responding using real data from both VASs and Likert scales. First, we show how the model can be used to compare careless responding across different scale types, revealing a higher proportion of careless respondents in VAS compared to Likert scale data. Second, we demonstrate that item parameters from the proposed model exhibit improved psychometric properties compared to those from a model that ignores careless responding. These findings underscore the model’s potential to enhance data quality by identifying and addressing careless responding.

Information

Type
Application and Case Studies - Original
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 Likert scale and VAS.

Figure 1

Figure 2 Beta IRM.Note: (a) ICC for different difficulty parameters, with dispersion fixed at 0. (b) ICC with response densities for $\theta _i - \delta _j = 0$ under different dispersion parameters. (c) Item Information Function for different dispersion parameters, with difficulty fixed at 0.

Figure 2

Figure 3 Model structure.Note: For attentive respondents, responses are assumed to follow a Beta IRM, governed by item and person parameters; a two-factor, six-item structure is used as an example. For careless respondents, responses are assumed to stem from a common Beta distribution, with Beta(2, 5) used as an illustrative example in the figure.

Figure 3

Figure 4 Density plot for different Beta distributions.

Figure 4

Table 1 Classification results based on the threshold

Figure 5

Table 2 Classification results based on $\pi _{\mathcal {P}}$

Figure 6

Table 3 Estimation accuracy of item parameters and $\pi _{\mathcal {P}}$

Figure 7

Figure 5 Beta approximation for careless responses.Note: Beta = Random responses generated from $\text {Beta}(0.5, 0.5)$; Preference = Overly consistent pattern; Normal = Random responses generated from a truncated normal distribution. Blue lines indicate the Beta approximation for careless responses across all simulated datasets, while red dashed lines represent the distribution used for data generation.

Figure 8

Table 4 Parameter estimation for the careless response distributions

Figure 9

Table 5 Classification results of the mixture CFA model when $1-\pi _{\mathcal {P}} = 0.15$.

Figure 10

Figure 6 Beta(0.388, 0.728): Model-implied distribution of careless responses in the empirical illustration.

Figure 11

Figure 7 Inter-item correlations for attentive and careless groups for VAS.Note: The item labels “h” and “a” represent the respective scales for “height” and “autonomy.”

Figure 12

Table 6 Summary of estimates for VAS data

Figure 13

Figure 8 Item parameter estimates for VAS obtained from different models.Note: The dashed line shows where the obtained estimates are equal; the labels “h” and “a” represent the item parameters of the “height” and “autonomy” scales.

Figure 14

Figure 9 Difference in factor score estimates.Note: $\pi _i$ denotes the probability that person i belongs to the attentive group; the labels “h” and “a” represent the respective factors of “height” and “autonomy.”

Figure 15

Table 7 Contingency table for classification.

Figure 16

Figure 10 Comparison of individual probabilities ($\pi _i$) between VAS and Likert scale.

Figure 17

Figure 11 Item parameter estimates for Likert scale obtained from different models.Note: The dashed line shows where the obtained estimates are equal; the labels “h” and “a” represent the item parameters of the “height” and “autonomy” scales.