Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-07T22:34:40.692Z Has data issue: false hasContentIssue false

Text reading in English as a second language: Evidence from the Multilingual Eye-Movements Corpus

Published online by Cambridge University Press:  08 March 2022

Victor Kuperman*
Affiliation:
McMaster University, Hamilton, ON, Canada
Noam Siegelman
Affiliation:
Haskins Laboratories, New Haven, CT, USA
Sascha Schroeder
Affiliation:
University of Goettingen, Goettingen, Germany
Cengiz Acartürk
Affiliation:
Jagiellonian University, Kraków, Poland Middle East Technical University, Ankara, Turkey
Svetlana Alexeeva
Affiliation:
Saint Petersburg State University, Saint Petersburg, Russia
Simona Amenta
Affiliation:
University of Trento, Trento, Italy
Raymond Bertram
Affiliation:
University of Turku, Turku, Finland
Rolando Bonandrini
Affiliation:
University of Milano-Bicocca, Milan, Italy
Marc Brysbaert
Affiliation:
Ghent University, Ghent, Belgium
Daria Chernova
Affiliation:
Saint Petersburg State University, Saint Petersburg, Russia
Sara Maria Da Fonseca
Affiliation:
University of Oslo, Oslo, Norway
Nicolas Dirix
Affiliation:
Ghent University, Ghent, Belgium
Wouter Duyck
Affiliation:
Ghent University, Ghent, Belgium
Argyro Fella
Affiliation:
University of Nicosia, Nicosia, Cyprus
Ram Frost
Affiliation:
The Hebrew University, Jerusalem, Israel
Carolina A. Gattei
Affiliation:
Universidad de Buenos Aires, Buenos Aires, Argentina Universidad Torcuato di Tella, Buenos Aires, Argentina Pontificia Universidad Católica Argentina, Buenos Aires, Argentina
Areti Kalaitzi
Affiliation:
University of Oslo, Oslo, Norway
Kaidi Lõo
Affiliation:
University of Tartu, Tartu, Estonia
Marco Marelli
Affiliation:
University of Milano-Bicocca, Milan, Italy
Kelly Nisbet
Affiliation:
McMaster University, Hamilton, ON, Canada
Timothy C. Papadopoulos
Affiliation:
University of Cyprus, Nicosia, Cyprus
Athanassios Protopapas
Affiliation:
University of Oslo, Oslo, Norway
Satu Savo
Affiliation:
University of Turku, Turku, Finland
Diego E. Shalom
Affiliation:
Universidad de Buenos Aires, Buenos Aires, Argentina Universidad Torcuato di Tella, Buenos Aires, Argentina
Natalia Slioussar
Affiliation:
Saint Petersburg State University, Saint Petersburg, Russia Higher School of Economics (HSE) Moscow, Moscow, Russia
Roni Stein
Affiliation:
The Hebrew University, Jerusalem, Israel
Longjiao Sui
Affiliation:
Ghent University, Ghent, Belgium
Analí Taboh
Affiliation:
Universidad de Buenos Aires, Buenos Aires, Argentina Universidad Torcuato di Tella, Buenos Aires, Argentina
Veronica Tønnesen
Affiliation:
University of Oslo, Oslo, Norway
Kerem Alp Usal
Affiliation:
Middle East Technical University, Ankara, Turkey
*
*Corresponding author. E-mail: vickup@mcmaster.ca
Rights & Permissions [Opens in a new window]

Abstract

Research into second language (L2) reading is an exponentially growing field. Yet, it still has a relatively short supply of comparable, ecologically valid data from readers representing a variety of first languages (L1). This article addresses this need by presenting a new data resource called MECO L2 (Multilingual Eye Movements Corpus), a rich behavioral eye-tracking record of text reading in English as an L2 among 543 university student speakers of 12 different L1s. MECO L2 includes a test battery of component skills of reading and allows for a comparison of the participants’ reading performance in their L1 and L2. This data resource enables innovative large-scale cross-sample analyses of predictors of L2 reading fluency and comprehension. We first introduce the design and structure of the MECO L2 resource, along with reliability estimates and basic descriptive analyses. Then, we illustrate the utility of MECO L2 by quantifying contributions of four sources to variability in L2 reading proficiency proposed in prior literature: reading fluency and comprehension in L1, proficiency in L2 component skills of reading, extralinguistic factors, and the L1 of the readers. Major findings included (a) a fundamental contrast between the determinants of L2 reading fluency versus comprehension accuracy, and (b) high within-participant consistency in the real-time strategy of reading in L1 and L2. We conclude by reviewing the implications of these findings to theories of L2 acquisition and outline further directions in which the new data resource may support L2 reading research.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Publications on L2 and bilingualism presented as percentage of the total number of documents in Web of Science per year, in 1980–2019. The number of publications per year is reported within the plot.

Figure 1

Table 1. Information regarding participants in available samples

Figure 2

Table 2. The number of sentences, number of words and average word length, as well as readability indices of the texts used in the eye-tracking English reading task

Figure 3

Figure 2. Means of measures from the eye-tracking task across samples. Error bars stand for ± 1 SE. accuracy: percent answers correct; acc: comprehension accuracy; dur: total fixation time; firstfix.dur: first fixation duration; firstrun.dur: gaze duration; nfix: number of fixations; rate: reading rate; refix: likelihood of second fixation on the word; reg.in: regression rate; reread: likelihood of second pass; skip: skipping rate; du: Dutch; ee: Estonian; en: English; fi: Finnish; ge: German; gr: Greek; he: Hebrew; it: Italian; no: Norwegian; ru: Russian; sp: Spanish; tr: Turkish. See online version for color figures.

Figure 4

Figure 3. Means of measures of individual differences of English proficiency across samples. Error bars stand for ± 1 SE. cft: score in the CFT test; towre: pde: TOWRE, Phonemic Decoding Efficiency subtest (pseudoword naming); towre: swe: TOWRE, Sight Word Efficiency subtest (word naming); vocabulary: vocabulary knowledge (Groups 2–5); du: Dutch; ee: Estonian; en: English; fi: Finnish; ge: German; gr: Greek; he: Hebrew; it: Italian; no: Norwegian; ru: Russian; sp: Spanish; tr: Turkish. See online version for color figures.

Figure 5

Table 3. Correlation table for reading measures (data aggregated across samples, N = 543). Values above the diagonal show Pearson correlation coefficients; values below the diagonal show p values (p-value shown as 0 stands for p < .001), and significant correlations (p < .05) appear in bold text.

Figure 6

Figure 4. Stepwise partitioning of variance in L2 fluency and comprehension: Step 1: Relative contribution of L1 reading proficiency (i.e., corresponding L1 variable); Step 2: component skills of L2 reading and extralinguistic factors; and Step 3: cross-sample variability in L2 sites. acc: comprehension accuracy; dur: total fixation time; firstfix.dur: first fixation duration; firstrun.dur: gaze duration; nfix: number of fixations; rate: reading rate; refix: likelihood of second fixation on the word; reg.in: regression rate; reread: likelihood of second pass; skip: skipping rate. See online version for color figures.

Figure 7

Figure 5. Correlation between reading rate in L1 (scaled within samples) and L2, among L2 readers of English, N = 412.

Figure 8

Figure 6. Alternative partitioning of variance in L2 fluency and comprehension. L1 reading: unique contribution of L1 reading skill (i.e., corresponding L1 variable); skill tests: component skills of L2 reading and extralinguistic factors; L2 sample: cross-sample variability in L2 sites; shared: portions of variance explained by more than one variable; acc: comprehension accuracy; dur: total fixation time; firstfix.dur: first fixation duration; firstrun.dur: gaze duration; nfix: number of fixations; rate: reading rate; refix: likelihood of second fixation on the word; reg.in: regression rate; reread: likelihood of second pass; skip: skipping rate. See online version for color figures.

Supplementary material: File

Kuperman et al. supplementary material

Kuperman et al. supplementary material

Download Kuperman et al. supplementary material(File)
File 49 KB