Hostname: page-component-77f85d65b8-7lfxl Total loading time: 0 Render date: 2026-03-30T10:24:16.259Z Has data issue: false hasContentIssue false

Analysis of Log Data From an International Online Educational Assessment System: A Multi-State Survival Modeling Approach to Reaction Time Between and Across Action Sequence

Published online by Cambridge University Press:  01 September 2025

Jina Park
Affiliation:
Department of Applied Statistics, Yonsei University , Seoul, South Korea Department of Statistics and Data Science, Yonsei University , Seoul, South Korea
Ick Hoon Jin*
Affiliation:
Department of Applied Statistics, Yonsei University , Seoul, South Korea Department of Statistics and Data Science, Yonsei University , Seoul, South Korea
Minjeong Jeon
Affiliation:
School of Education and Information Studies, University of California , Los Angeles, CA, USA
*
Corresponding author: Ick Hoon Jin; Email: ijin@yonsei.ac.kr
Rights & Permissions [Opens in a new window]

Abstract

With increasingly available computer-based or online assessments, researchers have shown keen interest in analyzing log data to improve our understanding of test takers’ problem-solving processes. In this article, we propose a multi-state survival model (MSM) to action sequence data from log files, focusing on modeling test takers’ reaction times between actions, in order to investigate which factors and how they influence test takers’ transition speed between actions. We specifically identify the key actions that differentiate correct and incorrect answers, compare transition probabilities between these groups, and analyze their distinct problem-solving patterns. Through simulation studies and sensitivity analyses, we evaluate the robustness of our proposed model. We demonstrate the proposed approach using problem-solving items from the Programme for the International Assessment of Adult Competencies (PIAAC).

Information

Type
Application and Case Studies - Original
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Table 1 The description and mean (standard deviation) of the five selected covariates from the Public Use Files

Figure 1

Figure 1 A publicly available example of the PSTRE assessment of 2012 PIAAC about simulated job searches.Note: Figure (a) and (b) are the list of job search sites and the page for the first link, respectively.

Figure 2

Table 2 The numbers and percentages of participants who answered CD tally and Lamp Return correctly and incorrectly in the 2012 PIAAC data

Figure 3

Table 3 The $2 \times 2$ contingency for chi-square test of action i

Figure 4

Figure 2 Line plot of chi-square scores in descending order for selecting key actions of (a) CD Tally and (b) Lamp Return test items.Note: The red circle indicates the elbow point of the line plot. Actions with chi-square scores above this point are designated as key actions.

Figure 5

Table 4 The top 12 actions based on the chi-square scores, along with the average occurrence frequency and average occurrence time (min) per person for the CD Tally test item

Figure 6

Table 5 The top 15 actions based on the chi-square scores, along with the average occurrence frequency and average occurrence time (min) per person for the Lamp Return test item

Figure 7

Figure 3 An illustration of the illness–death model.

Figure 8

Figure 4 The process of interpreting the result of the proposed model which consist of parameter estimation and comparing the transition probability between correct and incorrect group.

Figure 9

Table 6 The five highest and lowest $\kappa _{1, m, l}$ and $\kappa _{0, m, l}$ for correct and incorrect groups in the USA CD Tally test item

Figure 10

Figure 5 The boxplots of posterior means of individual transition speed parameters ($\tau _i$) across the 14 countries for CD Tally test item.Note: Outliers omitted.

Figure 11

Table 7 Posterior means of $\boldsymbol {\alpha }$ for the CD Tally and Lamp Return test items

Figure 12

Table 8 Posterior means of key action effect ($\beta _{c_i, 1}$ and $\beta _{c_i, 2}$) for CD Tally and Lamp Return test items

Figure 13

Figure 6 Transition probability differences between correct and incorrect groups for USA CD Tally test item: (a) heatmap and (b) network visualization.Note: Blue indicates higher probability for correct group, red for incorrect. Color intensity in (a) and the arrow thickness in (b) represents difference magnitude between two groups. Key actions are highlighted in red text.

Figure 14

Table 9 The five highest and lowest $\kappa _{1, m, l}$ and $\kappa _{0, m, l}$ for correct and incorrect groups in the USA Lamp Return test item

Figure 15

Figure 7 The boxplots of posterior means of individual transition speed parameters ($\tau _i$) across the 14 countries for Lamp Return test item.Note: Outliers omitted.

Figure 16

Figure 8 Network visualization of transition probability differences for the Lamp Return test item in the USA.Note: (a) shows significantly higher transition probabilities for the correct group (blue arrows), while (b) shows higher probabilities for the incorrect group (red arrows). The arrow thickness indicates the magnitude of the difference in transition probability between the two groups. Key actions are highlighted with red text.

Figure 17

Table 10 Results of prior sensitivity analysis of $\beta _{1, 1}$, $\beta _{1, 2}$, $\beta _{0, 1}$, and $\beta _{0, 2}$ for USA CD Tally test item

Figure 18

Table 11 Summary of simulation scenarios

Figure 19

Figure 9 Boxplots summarizing the estimation errors across 200 simulation replications for each scenario.Note: Estimation error is defined as the mean squared error (MSE), calculated by averaging the element-wise squared differences between the estimated and true transition probability matrices for each individual, and then averaging across individuals per run. Each boxplot reflects the distribution of these simulation-level MSEs under varying levels of group heterogeneity and covariate effects.

Supplementary material: File

Park et al. supplementary material

Park et al. supplementary material
Download Park et al. supplementary material(File)
File 5.5 MB