Hostname: page-component-89b8bd64d-9prln Total loading time: 0 Render date: 2026-05-06T16:30:17.778Z Has data issue: false hasContentIssue false

The influence of word co-occurrence frequency on predictive processing in first and second languages: A webcam-based eye-tracking study

Published online by Cambridge University Press:  04 August 2025

Haerim Hwang*
Affiliation:
Department of English, The Chinese University of Hong Kong, Hong Kong, China
Sun Hee Park
Affiliation:
Department of Korean Studies, Ewha Womans University, Seoul, Republic of Korea
*
Corresponding author: Haerim Hwang; Email: haerimhwang@cuhk.edu.hk
Rights & Permissions [Opens in a new window]

Abstract

The questions of whether first language (L1) speakers and second language (L2) learners can both predict what follows based on given linguistic cues and what factors may influence this predictive processing are still underexplored. Prior research has focused on the success or failure of predictions in real-time processing, paying relatively less attention to the speed of prediction. This study addresses these gaps by investigating the role of word co-occurrence frequency and proficiency in L1 and L2 predictive processing, using the Korean classifier system. In a webcam-based visual-world eye-tracking experiment, both L1-Korean speakers and L2-Korean learners showed sound predictive processing, with the frequency of co-occurrence between classifiers and nouns playing a crucial role. Higher co-occurrence frequency expedited predictive processing for L1-Korean speakers and boosted the ability to make online predictions for L2-Korean learners. The study also revealed a proficiency effect, where more advanced L2-Korean learners made predictions regardless of co-occurrence frequency, unlike their less advanced counterparts. Our findings suggest that predictive mechanisms in L1 and L2 operate in a qualitatively similar way. In addition, the use of webcam eye-tracking is expected to create a more inclusive and equitable research environment for (applied) psycholinguistics.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (https://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Background information of participants

Figure 1

Figure 1. Example visual scene in the visual-world eye-tracking task.

Figure 2

Figure 2. Example item in the fill-in-the-blank task.

Figure 3

Table 2. Output from the logistic mixed-effects regression model for the fill-in-the-blank data

Figure 4

Figure 3. Elogit-transformed proportion of fixations to the target versus the competitor by group and co-occurrence frequency.Notes: The shaded area around the lines indicates 95% confidence intervals. The highlighted section between −2,537 ms (first classifier offset) and 0 ms (noun onset) indicates the predictive region.

Figure 5

Table 3. Output from the growth curve analysis model for the eye-tracking data including group and co-occurrence frequency

Figure 6

Table 4. Output from the generalized additive mixed model for the eye-tracking data including group and co-occurrence frequency

Figure 7

Figure 4. Difference between the elogit-transformed proportions of fixations to the target and the competitor over time during the predictive region by group and co-occurrence frequency.Notes. A value greater than zero on the y-axis indicates that the Target received more fixations than the Competitor, whereas a value less than zero suggests the opposite pattern, with the Competitor receiving more fixations than the Target. The vertical dashed lines represent the time period where significant differences between the conditions under comparison emerge.

Figure 8

Table 5. Output from the growth curve analysis model for the eye-tracking data including proficiency group and co-occurrence frequency

Figure 9

Table 6. Output from the generalized additive mixed model for the eye-tracking data including proficiency group and co-occurrence frequency

Figure 10

Figure 5. Difference between the elogit-transformed proportions of fixations to the target and the competitor over time during the predictive region by proficiency group and co-occurrence frequency. Notes: A value greater than zero on the y-axis indicates that the Target received more fixations than the Competitor, whereas a value less than zero suggests the opposite pattern, with the Competitor receiving more fixations than the Target. The vertical dashed lines represent the time period where significant differences between the conditions under comparison emerge.

Supplementary material: File

Hwang and Park supplementary material

Hwang and Park supplementary material
Download Hwang and Park supplementary material(File)
File 211.7 KB