Hostname: page-component-77f85d65b8-5ngxj Total loading time: 0 Render date: 2026-03-28T10:59:43.847Z Has data issue: false hasContentIssue false

An eye-tracking study of the impact of item presentation and item format on cognitive processing in L2 listening assessment

Published online by Cambridge University Press:  26 March 2026

Tingting Liu
Affiliation:
School of English Studies, Sichuan International Studies University , Chongqing, China National Institute of Education, Nanyang Technological University , Singapore
Vahid Aryadoust*
Affiliation:
National Institute of Education, Nanyang Technological University , Singapore
*
Corresponding author: Vahid Aryadoust; Email: vahid.aryadoust@nie.edu.sg
Rights & Permissions [Opens in a new window]

Abstract

Listening ability in second language (L2) assessment is elicited through test methods that often require other abilities, which may introduce construct-irrelevant variance and threaten the validity of the test. While previous research has examined the effects of item presentation and item format on test performance, studies on test-takers’ cognitive processing remain exploratory. This study investigates the effects of two test method variables, item presentation (operationalized as while-listening performance [WLP] vs. post-listening performance [PLP]) and item format (operationalized as multiple-choice questions [MCQs] vs. open-ended questions [OEQs]), on test-takers’ cognitive processing, using gaze behaviors as real-time indicators. A Graeco-Latin square design was employed to administer four psychometrically validated short-talk listening testlets across the test conditions. Eye-tracking data were collected while controlling for word count and typing speed as covariates. Linear mixed-effects modeling revealed higher total fixation duration, fixation counts, total visit duration, and visit counts in the WLP condition than in the PLP condition. Interaction effects for all four metrics further indicated that these differences were more pronounced for MCQs than for OEQs. Additionally, typing speed and word count contributed to the variance in eye-tracking measures. The findings contribute to our growing understanding of how test methods shape listening processes and offer implications for the design and interpretation of L2 listening assessments.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Figure 1. A visualization of the research design.Note. “Talk” refers to the script and original items of the Michigan English Test listening sections, whereas “Testlet” refers to the listening testlet adapted for use in the experiment.

Figure 1

Figure 2. A visualization of AOIs for four test conditions.Note. MCQ = multiple-choice question; OEQ = open-ended question; PLP = post-listening performance; WLP = while-listening performance. The actual items were replaced with Sample Test 1001 provided on the Michigan English Test official website for confidentiality.

Figure 2

Table 1. ANOVA Results of the Four Groups’ Language and Typing Proficiencies

Figure 3

Figure 3. Mean of the eye-tracking measures for testlets and AOIs across conditions.

Figure 4

Table 2. Summary of the Fixed and Random Effects for the Eye-tracking Measures (with WLP + MCQ Condition as the Reference Level)

Figure 5

Figure 4. Eye-tracking measures by item presentation and item format.

Figure 6

Figure 5. Effects of word count and typing speed on eye-tracking measures.

Supplementary material: File

Liu and Aryadoust supplementary material

Liu and Aryadoust supplementary material
Download Liu and Aryadoust supplementary material(File)
File 470.5 KB