Hostname: page-component-5b777bbd6c-w9n4q Total loading time: 0 Render date: 2025-06-19T04:08:20.459Z Has data issue: false hasContentIssue false

Eye-tracking research in instructed second language acquisition

Published online by Cambridge University Press:  22 May 2025

Aline Godfroid*
Affiliation:
Department of Linguistics, Languages, and Cultures, Michigan State University, East Lansing, MI, USA
Bronson Hui
Affiliation:
School of Languages, Literatures, and Cultures, University of Maryland, College Park, MD, USA
*
Corresponding author: Aline Godfroid; Email: godfroid@msu.edu
Rights & Permissions [Opens in a new window]

Abstract

Type
Research Timeline
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2025. Published by Cambridge University Press.

1. Introduction

Eye movement registration, or eye tracking, is the recording of an individual’s eye movements during a task. Because eye movement registration happens online (in real time), it can reveal the nuances of learner attention and cognitive processing during task performance. The online nature of the eye gaze data collected has proven to be a valuable asset in the field of instructed second language acquisition (ISLA), and represents a recent development within eye tracking in the broader field of second language acquisition (SLA). It was not until around 2010 that ISLA researchers recognized the opportunities that eye-tracking technology affords. In this timeline, we present a brief history of eye tracking in SLA, with the goal of situating the development of eye tracking in ISLA within its wider historical context. We review three major time periods, which we refer to as follows: (1) early eye-tracking research with bilinguals (1997 – present); (2) eye tracking as a tool to study language learning processes (2010 – present); and (3) the shift to ecologically valid, remote, and wearable eye tracking (2018 – present). Whereas the first time period lays the historical foundations for this work, the second time period marks the emergence of eye tracking as a tool to study ISLA, and along with the third, is the focus of this timeline. Recently, in the third time period, the accessibility and options available for eye-gaze tracking have expanded significantly, driving further innovation in ISLA research. We highlight major and emerging lines of research in eye tracking in ISLA and review various applications of the technology in terms of research design, settings, and focus. We conclude the narrative portion of this research timeline with our coding scheme of ISLA eye-tracking studies.

2. Early eye-tracking research with bilinguals (1997 – present)

In the late 1990s and early 2000s, eye-tracking research on bilingualism was still a niche area compared to the extensive research with so-called monolinguals already being conducted in cognitive psychology and psycholinguistics (for landmark reviews, see Rayner, Reference Rayner1998, Reference Rayner2009; Tanenhaus & Trueswell, Reference Tanenhaus, Trueswell, Traxler and Gernsbacher2006). Two major paradigms were influential at the time and remain so to this date, namely, eye tracking with reading and the visual world eye-tracking paradigm.Footnote 1 Some of the pioneering eye-tracking research in these two paradigms that we would now consider to mark the beginning of eye tracking in SLA and bilingualism originated in the realms of psychology (Marian & Spivey, Reference Marian and Spivey2003a, Reference Marian and Spivey2003b; Spivey & Marian, Reference Spivey and Marian1999) and linguistics (Frenck-Mestre & Pynte, Reference Frenck-Mestre and Pynte1997), highlighting its interdisciplinary beginnings.

Researchers at the time used eye-movement registration to examine questions about linguistic representation or knowledge, often as it relates to morphology and syntax (e.g., Keating, Reference Keating2009), and sometimes, the lexicon (Marian & Spivey, Reference Marian and Spivey2003a, Reference Marian and Spivey2003b; Spivey & Marian, Reference Spivey and Marian1999). Another focus was crosslinguistic influence and L2-specific factors in parsing, which is the mental process of building and revising syntactic structures to enable language comprehension (Dussias & Sagarra, Reference Dussias and Sagarra2007; Frenck-Mestre & Pynte, Reference Frenck-Mestre and Pynte1997; Roberts et al., Reference Roberts, Gullberg and Indefrey2008). Although eye movements by definition are recorded while participants are performing a task, that is, in real time during processing, the theoretical focus was on the cognitive architecture supporting language acquisition that the eye movements were believed to reflect; that is, eye tracking was primarily a knowledge measure. The materials (sentences) that participants read or listened to were not intended to teach them new information about the language. They were not meant as an instructional intervention (and hence, we will not consider them further in our ISLA timeline), but rather, they were specifically designed to examine linguistic representations (existing knowledge) and parsing procedures that were already established in the bilingual mind as a result of exposure or prior instruction. This idea is sometimes referred to as the “processing outcomes of acquisition” (e.g., Godfroid, Reference Godfroid and Leow2019; VanPatten & Rothman, Reference VanPatten, Rothman and Rebuschat2015). It would be another decade before eye-tracking researchers in SLA, (psycho)linguistics, and applied linguistics discovered that eye tracking could also be used to study how processing can lead to acquisition and started exploring this powerful idea – termed “acquisition-as-processing” (e.g., Godfroid, Reference Godfroid and Leow2019; VanPatten & Rothman, Reference VanPatten, Rothman and Rebuschat2015) – in ISLA research.

3. Eye tracking as a tool to study acquisition-as-processing (2010 – present)

The second wave of eye-tracking research in SLA began around 2010, when eye-movement registration was newly introduced as a processing measureFootnote 2 (Godfroid et al., 2013*; Smith, 2013*). This innovative use of eye tracking empowered researchers to capture learning processes in real time and measure the effects of an instructional intervention designed to promote language learning (see Lai et al., Reference Lai, Tsai, Yang, Hsu, Liu, Lee, Lee, Chiou, Liang and Tsai2013, for a similar review in education on learning-related eye-tracking research). As such, the second wave of eye tracking marked the beginning of eye tracking in ISLA (also see Godfroid, Reference Godfroid and Leow2019, for a review).

Two foundational assumptions informed eye-tracking research in ISLA, namely the eye-mind link (Just & Carpenter, Reference Just and Carpenter1980; Reichle et al., Reference Reichle, Pollatsek and Rayner2006) and Schmidt’s noticing hypothesis (e.g., Schmidt, Reference Schmidt1990). The eye-mind link in applied eye-tracking research is the notion that the eye gaze, as a marker of overt attention, is linked with ongoing cognitive processing; in other words, overt and covert attention are connected, even if the strength of the connection can vary. Attention, of course, is a central component of Schmidt’s noticing hypothesis, which highlights a crucial role for focal attention and a low level of awareness, called noticing, in second language learning. Combined, these two assumptions make eye-movement registration an exquisite tool to examine attentional allocation in language learning.

One line of related language-learning research has examined the role of attention in vocabulary learning, using both unimodal (reading-only) and multimodal (reading-while-listening, captioned/subtitled videos) input. Results generally support the beneficial role of attention, especially for the acquisition of meaning, and especially in reading-only contexts. The more attention a reader pays to an unfamiliar word during reading (i.e., the longer they look at it, as measured with an eye-tracking camera), the more likely they are to recognize or recall that word on a surprise test after the reading (e.g., Elgort et al., 2018*; Godfroid et al., 2013*, 2018*; Pellicer-Sánchez, 2016*). In multimodal studies with captioned/subtitled videos, the attention-learning relationship appears to be more complicated (e.g., Montero Perez et al., 2015*), potentially due to the need for learners to integrate both auditory and visual information in real time. Understanding how L1 and L2 users are able to accomplish this complex task successfully is another focus of contemporary eye-movement research (e.g., for reading-while-listening: Conklin et al., 2020*; for captioned/subtitled videos: Liao et al., 2022*).

The basic question of an attention-learning relationship is now being examined across a range of learning conditions, from implicit (unsupported, meaning-focused, input-only) input conditions to highly explicit (directed, form-focused) interventions (e.g., Indrarathne & Kormos, 2017*). Researchers have designed different forms of modified input with the goal of increasing the salience of the target structures in the input. They do this because salience may impact attention, and attention may increase the likelihood that language learners will notice and potentially learn the new target structures. Examples of input modifications are aural and visual enhancement, glossing, and different manipulations of task instructions designed to direct learners’ attention towards the form and/or meaning of the target structures (as described in the classic work by Doughty & Williams, Reference Doughty, Williams, Doughty and Williams1998, and Sharwood Smith, Reference Sharwood Smith1993). Across all of these, the novel addition of eye-movement registration, again, can tell how effective these interventions are in directing a learner’s attention to specific structures in the input. Eye tracking, therefore, provides the missing link between instructional design and student reception.

4. The shift to ecologically valid, remote, and wearable eye tracking (2018 – present)

In the past five to ten years, eye tracking has emerged as a readily accessible technology for researchers investigating second language learning in different settings, including those that do not require a computer in a lab. The current utilization of eye tracking therefore covers a full spectrum of possibilities, spanning a continuum between experimental control and ecological validity. On the one end of this continuum, we continue to see studies conducted in controlled laboratory settings that allow researchers to systematically manipulate variables, isolate specific factor(s), and strengthen causal inferences through experimental designs. In many of these cases, researchers employ highly accurate and precise, high-speed eye trackers, thus optimizing data quality, at least in principle (see Godfroid et al., Reference Godfroid, Finch and Koh2025). In contrast, the adoption of eye tracking in more naturalistic settings permits the investigation of learners engaging in authentic language learning tasks and activities. A parallel trend, driven in part by the COVID-19 global health crisis, is the shift from lab-based to remote data collection, which provides access to larger and more diverse participant pools. By leveraging webcam-based eye-tracking functionality on platforms like Gorilla, researchers are no longer confined to physical labs. These technological advances – including virtual data collection platforms and a full range of eye-tracking devices with varying technical capabilities – have eliminated the need for participants to remain tethered to a computer setup with a chinrest in an eye-tracking lab. As a result, researchers are free to explore eye-tracking applications in diverse real-world situations, thereby enhancing the external validity of the study.

These applications of eye tracking outside the lab include but are not limited to classrooms (Morell, 2018*), interactional contexts (McDonough et al., 2015*), computer-assisted language learning environments (Fievez et al., Reference Fievez, Montero Perez, Cornillie and Desmet2023; Michel & O’Rourke, 2019*; Michel & Smith, Reference Michel, Smith, Gass, Spinner and Behney2018), and online instruction (Satar & Wigham, 2017*). In some studies, eye movements remain a primary source of data; in other studies, researchers engage in intense triangulation of different research methodologies, whereby eye-gaze tracking provides an additional source of information amidst many others. In the latter case, the various data sources provide researchers with a more complete picture of the sociocognitive and affective processes that take place in L2 instructional settings (e.g., Lindberg et al., 2021*; Morell, 2018*). Given the growing interest in using eye tracking for classroom-based research on second language learning and instruction, wearable eye trackers seem to hold special promise because of their portability. To our knowledge, such research has not been attempted yet in ISLA. However, in education, researchers are actively using wearable eye trackers in classrooms, investigating areas such as the professional vision of the teacher (e.g., selective attention to a classroom situation, leading to classroom management decisions), sometimes with a focus on teacher training (Witt et al., Reference Witt, Schorer, Loffing and Roden2024). A specific research question in this line of work is the role of teaching expertise in understanding a classroom situation (Miller et al., Reference Miller, Correa, Cortina, Phelps and Chamberlain2024).

Another potential application of wearable devices lies in the ability to integrate augmented and virtual realities. This advance has considerable commercial potential, prompting tech giants like Google and Apple to acquire companies specializing in eye-tracking technology. New headsets have interactive eye-tracking applications built into them, which allow the viewer, for example, to make selections in a user interface simply by looking at the relevant objects.

As commercial applications of eye tracking become ubiquitous, we are also beginning to witness the integration of these ideas into the realm of L2 research. For instance, Révész and colleagues (2023*) used a first potential application of interactive eye tracking in vocabulary studies that demonstrated the usefulness of interactive, gaze-contingent support in instructed language learning. In their study, lexical glosses that readers would otherwise overlook (as revealed in skips of the glosses in the eye-movement records) were made visually more salient through yellow highlighting, whereas glosses that readers looked at (as revealed by fixations on the glosses in the eye-movement records) remained unenhanced. The interactive addition of enhancement – that is, only in cases where learners needed it – highlights how eye-movement registration could be integrated as a part of an instructional treatment or language assessment to tailor the input to an individual learner’s behavior. Considering the rapid, continuous pace of technological advancements, we foresee that an increasing number of pioneering studies will begin to incorporate eye tracking in these innovative, interactive applications (e.g., Smith & González-Lloret, Reference Smith and González-Lloret2021).

While these developments are exciting, researchers are reminded of the importance of the balance between ecological validity and methodological rigor. For example, by opting for commonly available technology like video cameras or webcams for recording eye movements, researchers often compromise the accuracy and precision of their recordings. This means that certain topics and questions (e.g., those requiring small interest areas, like words or units below the word level in reading) cannot be investigated in a valid manner with off-the-shelf eye-tracking devices like webcams, even though such work does get attempted sometimes. Despite the many technological advances, we believe that substantive questions should continue to drive the research process more than the technology. Eye-movement registration is simply a tool, and nothing more than that. Researchers enjoy the freedom to use that tool in ways that best suit their research purposes. They are, at the same time, presented with the challenge to judiciously select their eye-tracking equipment and match their research designs with the eye-tracker’s technical capabilities (Godfroid & Hui, Reference Godfroid and Hui2020).

In summary, eye tracking has continued to gain momentum as a valuable research method in ISLA research. Its versatility allows researchers to strike a balance between experimental control and ecological validity, catering to a wide range of research purposes.

5. Preview of research timeline

In what follows, we present a timeline of eye tracking in ISLA research. We review the historical significance of each study and situate it within the broader context of earlier and later works. Our timeline highlights the emergence and evolution of eye tracking as a tool for studying acquisition-as-processing – using eye movements to examine how attention and processing varies across individuals and learning conditions, and how these variations influence or mediate eventual learning outcomes (which are often measured independently through offline tests).

Major themes emerging from this research include investigations into incidental and intentional vocabulary learning during reading. Here, researchers have manipulated variables such as exposure frequency, input enhancement, glosses, and explicit instruction to examine their influence on attention, processing, and learning. Eye-movement registration has also been employed to evaluate and compare the effectiveness of instructional approaches, including processing instruction. Additionally, the affordances of multimodal input for language learning – including captions, reading-while-listening, and picture books – have been a focal point since the early days of eye-tracking in ISLA.

Increasingly, eye movements are being triangulated with other data sources, including stimulated recall, interviews, screen capture, keystroke logging, and skin conductance. These multi-method approaches provide rich accounts of complex activities, adding important insights to research on writing, face-to-face or computer-mediated interaction, and online or in-person teaching. We are also witnessing the emergence of eye tracking in social approaches to ISLA, such as analyzing eye gaze as a multimodal or semiotic resource in classroom or peer-to-peer interactions, or using eye tracking as a tool to promote learner development in L2 writing.

The timeline below includes 73 studies, each coded for the key characteristics outlined in Table 1.

Table 1. Codes for study characteristics

Competing interests

The authors have no competing interests to declare.

Aline Godfroid is Professor in Second Language Studies and TESOL at Michigan State University, where she co-directs the Second Language Studies Eye-Tracking Lab. She specializes in cognitive processes in second language acquisition, L2 psycholinguistics, eye-tracking, vocabulary acquisition, and quantitative research methods. In her research, she seeks to build bridges between psychology, psycholinguistics, second language acquisition, and real-world second language learning. Aline is currently an Associate Editor of Language Learning and Vice President of the European Second Language Association. She is the author of Eye tracking in second language acquisition and bilingualism (Routledge, 2020) and the co-editor (with Holger Hopp) of The Routledge handbook of second language acquisition and psycholinguistics (2023).

Bronson Hui is Assistant Professor of Second Language Acquisition at the University of Maryland. His main research interests are vocabulary acquisition, applied psycholinguistics, and quantitative research methods. His publications have appeared in international, peer-reviewed journals, such as Studies in Second Language Acquisition, Language Learning, Applied Psycholinguistics, and The Modern Language Journal, among other venues. He also serves on the editorial boards of Studies in Second Language Acquisition and TESOL Quarterly. He is associate editor of Digital Studies in Language and Literature.

Footnotes

1. Visual world eye tracking is an audiovisual paradigm used to examine auditory linguistic processing in the presence of real-world objects or images on the screen that are related to the sentences that participants are listening to.

2. Two book chapters appeared in 2010 that possibly mark the earliest use of eye tracking in this time period (Godfroid et al., Reference Godfroid, Housen, Boers, Pütz and Sicola2010; Smith, Reference Smith and Hult2010). We cite the later references by the same authors because they represent more complete reports of the work in full-length research articles.

3. Author names in small capitals indicate the work is mentioned elsewhere in this timeline.

* Indicates that the full reference for this work can be found in the subsequent timeline.

References

Doughty, C., & Williams, J. (1998). Pedagogical choices in focus on form. In Doughty, C., & Williams, J. (Eds.), Focus on form in classroom second language acquisition (pp. 197261). Cambridge University Press.Google Scholar
Dussias, P. E., & Sagarra, N. (2007). The effect of exposure on syntactic parsing in Spanish–English bilinguals. Bilingualism: Language and Cognition, 10(1), 101116. doi:10.1017/S1366728906002847CrossRefGoogle Scholar
Fievez, I., Montero Perez, M., Cornillie, F., & Desmet, P. (2023). Promoting incidental vocabulary learning through watching a French Netflix series with glossed captions. Computer Assisted Language Learning, 36(1–2), 2651. doi:10.1080/09588221.2021.1899244CrossRefGoogle Scholar
Frenck-Mestre, C., & Pynte, J. (1997). Syntactic ambiguity resolution while reading in second and native languages. The Quarterly Journal of Experimental Psychology A: Human Experimental Psychology, 50A(1), 119148. doi:10.1080/027249897392251CrossRefGoogle Scholar
Godfroid, A. (2019). Investigating instructed second language acquisition using L2 learners’ eye-tracking data. In Leow, R. P. (Ed.), The Routledge handbook of second language research in classroom learning (pp. 4457). Routledge.10.4324/9781315165080-4CrossRefGoogle Scholar
Godfroid, A., Finch, B., & Koh, J. (2025). Reporting eye-tracking research in second language acquisition and bilingualism: A synthesis and field-specific guidelines. Language Learning, 75(1), 250294. doi:10.1111/lang.12664CrossRefGoogle Scholar
Godfroid, A., Housen, A., & Boers, F. (2010). A procedure for testing the Noticing Hypothesis in the context of vocabulary acquisition. In Pütz, M., and Sicola, L. (Eds. ), Inside the learner’s mind: Cognitive processing and second language acquisition, John Benjamins. (pp. 169197). doi:10.1075/celcr.13.14godCrossRefGoogle Scholar
Godfroid, A., & Hui, B. (2020). Five common pitfalls in eye-tracking research. Second Language Research, 36(3), 277305. doi:10.1177/0267658320921218CrossRefGoogle Scholar
Just, M. A., & Carpenter, P. A. (1980). A theory of reading: From eye fixations to comprehension. Psychological Review, 87(4), 329354. doi:https://doi.org/10.1037/0033-295X.87.4.329CrossRefGoogle ScholarPubMed
Keating, G. D. (2009). Sensitivity to violations of gender agreement in native and nonnative Spanish: An eye-movement investigation. Language Learning, 59(3), 503535. doi:https://doi.org/10.1111/j.1467-9922.2009.00516.xCrossRefGoogle Scholar
Lai, M., Tsai, M., Yang, F., Hsu, C., Liu, T., Lee, S., Lee, M., Chiou, G., Liang, J., & Tsai, C. (2013). A review of using eye-tracking technology in exploring learning from 2000 to 2012. Educational Research Review, 10, 90115. doi:http://dx.doi.org/10.1016/j.edurev.2013.10.001CrossRefGoogle Scholar
Marian, V., & Spivey, M. (2003a). Bilingual and monolingual processing of competing lexical items. Applied Psycholinguistics, 24(2), 173193. doi:https://doi.org/10.1017/S0142716403000092CrossRefGoogle Scholar
Marian, V., & Spivey, M. (2003b). Competing activation in bilingual language processing: Within- and between-language competition. Bilingualism: Language and Cognition, 6(2), 97115. doi:https://doi.org/10.1017/S1366728903001068CrossRefGoogle Scholar
Michel, M., & Smith, B.(2018). Measuring lexical alignment during L2 peer interaction via written synchronous computer-mediated communication: An eye-tracking study. In Gass, S. M., Spinner, P., & Behney, J. (Eds.), Salience in Second Language Acquisition (pp. . Second language acquisition research series. Routledge.Google Scholar
Miller, K. F., Correa, C., Cortina, K., Phelps, L., & Chamberlain, L. (2024). Learning from (re)experience: What mobile eye-tracking video can help us learn about the cognitive processes of teaching. Frontiers in Education, 9, 1299896. doi:https://doi.org/10.3389/feduc.2024.1299896CrossRefGoogle Scholar
Rayner, K. (1998). Eye movements in reading and information processing: 20 years of research. Psychological Bulletin, 124 (3), 372422. doi:https://doi.org/10.1037/0033-2909.124.3.372CrossRefGoogle ScholarPubMed
Rayner, K. (2009). Eye movements and attention in reading, scene perception, and visual search. The Quarterly Journal of Experimental Psychology, 62(8), 14571506. doi:https://doi.org/10.1080/17470210902816461CrossRefGoogle ScholarPubMed
Reichle, E. D., Pollatsek, A., & Rayner, K. (2006). E–Z Reader: A cognitive-control, serial-attention model of eye-movement behavior during reading. Cognitive Systems Research, 7(1), 422. doi:https://doi.org/10.1016/j.cogsys.2005.07.002CrossRefGoogle Scholar
Roberts, L., Gullberg, M., & Indefrey, P. (2008). Online pronoun resolution in L2 discourse: L1 influence and general learner effects. Studies in Second Language Acquisition, 30(3), 333357. doi:https://doi.org/10.1017/S0272263108080480CrossRefGoogle Scholar
Schmidt, R. (1990). The role of consciousness in second language learning. Applied Linguistics, 11 (2), 129158. doi:https://doi.org/10.1093/applin/11.2.129CrossRefGoogle Scholar
Sharwood Smith, M. (1993). Input enhancement in instructed SLA: Theoretical bases. Studies in Second Language Acquisition, 15(2), 165179. doi:https://doi.org/10.1017/S0272263100011943CrossRefGoogle Scholar
Smith, B. (2010). Employing eye-tracking technology in researching the effectiveness of recasts in CMC. In Hult, F. (Ed.), Directions and prospects for educational linguistics: Educational linguistics (pp. 7997). Springer. doi:https://doi.org/10.1007/978-90-481-9136-9_6CrossRefGoogle Scholar
Smith, B., & González-Lloret, M. (2021). Technology-mediated task-based language teaching: A research agenda. Language Teaching, 54 (4), 518534. doi:https://doi.org/10.1017/S0261444820000233CrossRefGoogle Scholar
Spivey, M. J., & Marian, V. (1999). Cross talk between native and second languages: Partial activation of an irrelevant lexicon. Psychological Science, 10(3), 281284. doi:https://doi.org/10.1111/1467-9280.00151CrossRefGoogle Scholar
Tanenhaus, M. K., & Trueswell, J. C. (2006). Eye movements and spoken language comprehension. In Traxler, M. J., and Gernsbacher, M. A. (Eds.), Handbook of psycholinguistics (2nd ed., pp. 863900). Elsevier. doi:https://doi.org/10.1016/B978-012369374-7/50023-7CrossRefGoogle Scholar
VanPatten, B., & Rothman, J. (2015). What does current generative theory have to say about the explicit-implicit debate? In Rebuschat, P. (Ed.), Implicit and explicit learning of languages, John Benjamins (pp. 91116). doi:https://doi.org/10.1075/sibil.48CrossRefGoogle Scholar
Witt, J., Schorer, J., Loffing, F., & Roden, I. (2024). Eye-tracking research on teachers’ professional vision: A scoping review. Teaching and Teacher Education, 144, 104568. doi:https://doi.org/10.1016/j.tate.2024.104568CrossRefGoogle Scholar
Figure 0

Table 1. Codes for study characteristics