Hostname: page-component-89b8bd64d-46n74 Total loading time: 0 Render date: 2026-05-06T16:48:42.738Z Has data issue: false hasContentIssue false

New data on text reading in English as a second language

The Wave 2 expansion of the Multilingual Eye-Movement Corpus (MECO)

Published online by Cambridge University Press:  12 March 2025

Victor Kuperman*
Affiliation:
McMaster University
Sascha Schroeder
Affiliation:
University of Goettingen
Cengiz Acartürk
Affiliation:
Jagiellonian University
Niket Agrawal
Affiliation:
Indian Institute of Technology Kanpur
Dominick M. Alexandre
Affiliation:
Universidade Federal do Ceará
Lena S. Bolliger
Affiliation:
University of Zurich
Jan Brasser
Affiliation:
University of Zurich
César Campos-Rojas
Affiliation:
Pontificia Universidad Católica de Valparaíso Millennium Nucleus for the Science of Learning
Denis Drieghe
Affiliation:
University of Southampton
Dušica Filipović Đurđević
Affiliation:
University of Belgrade University of Novi Sad
Luiz Vinicius Gadelha de Freitas
Affiliation:
Universidade Federal do Ceará
Sofya Goldina
Affiliation:
Université Paris Cité National Research University Higher School of Economics Moscow
Romualdo Ibáñez Orellana
Affiliation:
Pontificia Universidad Católica de Valparaíso Millennium Nucleus for the Science of Learning
Lena A. Jäger
Affiliation:
University of Zurich University of Potsdam
Ómar I. Jóhannesson
Affiliation:
University of Iceland
Anurag Khare
Affiliation:
Indian Institute of Technology Kanpur
Nik Kharlamov
Affiliation:
Aalborg University
Hanne B. S. Knudsen
Affiliation:
Aalborg University
Árni Kristjánsson
Affiliation:
University of Iceland
Charlotte E. Lee
Affiliation:
University of Southampton
Jun Ren Lee
Affiliation:
National Taiwan Normal University
Marina P. T. Leite
Affiliation:
Universidade Federal de Minas Gerais
Simona Mancini
Affiliation:
Basque Center on Cognition, Brain and Language Ikerbasque, Basque Foundation for Science
Nataša Mihajlović
Affiliation:
University of Novi Sad
Ksenija Mišić
Affiliation:
University of Belgrade
Miloslava Orekhova
Affiliation:
National Research University Higher School of Economics Moscow
Olga Parshina
Affiliation:
National Research University Higher School of Economics Moscow Middlebury College
Milica Popović Stijačić
Affiliation:
University of Novi Sad Singidunum University
Athanassios Protopapas
Affiliation:
University of Oslo
David R. Reich
Affiliation:
University of Potsdam
Anurag Rimzhim
Affiliation:
College of the Holy Cross
Rui Rothe-Neves
Affiliation:
Universidade Federal de Minas Gerais
Thais M. M. Sá
Affiliation:
Universidade Federal de Lavras
Andrea Santana Covarrubias
Affiliation:
Pontificia Universidad Católica de Valparaíso
Irina Sekerina
Affiliation:
College of Staten Island of the City University of New York
Heida M. Sigurdardottir
Affiliation:
University of Iceland
Anna Smirnova
Affiliation:
National Research University Higher School of Economics Moscow University of Groningen
Priyanka Srivastava
Affiliation:
International Institute of Information Technology Hyderabad
Elisangela N. Teixeira
Affiliation:
Universidade Federal do Ceará
Ivana Ugrinic
Affiliation:
University of Oslo
Kerem Alp Usal
Affiliation:
Middle East Technical University
Karolina Vakulya
Affiliation:
University of Plymouth
João M. M. Vieira
Affiliation:
University of Southampton
Ark Verma
Affiliation:
Indian Institute of Technology Kanpur
Denise H. Wu
Affiliation:
National Central University
Jin Xue
Affiliation:
Beijing Institute of Technology University of Science and Technology Beijing
Sunčica Zdravković
Affiliation:
University of Belgrade University of Novi Sad
Junjing Zhuo
Affiliation:
University of Science and Technology Beijing Northeast Normal University
Laoura Ziaka
Affiliation:
University of Oslo Oslo University Hospital
Noam Siegelman
Affiliation:
Hebrew University of Jerusalem
*
Corresponding author: Victor Kuperman; Email: vickup@mcmaster.ca
Rights & Permissions [Opens in a new window]

Abstract

This paper reports an expansion of the English as a second language (L2) component of the Multilingual Eye Movement Corpus (MECO L2), an international database of eye movements during text reading. While the previous Wave 1 of the MECO project (Kuperman et al., 2023) contained English as a L2 reading data from readers with 12 different first language (L1) backgrounds, the newly collected dataset adds eye-tracking data on English text reading from 13 distinct L1 backgrounds (N = 660) as well as participants’ scores on component skills of English proficiency and information about their demographics and language background and use. The paper reports reliability estimates, descriptive statistics, and correlational analyses as means to validate the expansion dataset. Consistent with prior literature and the MECO Wave 1, trends in the MECO Wave 2 data include a weak correlation between reading comprehension and oculomotor measures of reading fluency and a greater L1-L2 contrast in reading fluency than reading comprehension. Jointly with Wave 1, the MECO project includes English reading data from more than 1,200 readers representing a diversity of native writing systems (logographic, abjad, abugida, and alphabetic) and 19 distinct L1 backgrounds. We provide multiple pointers to new venues of how L2 reading researchers can mine this rich publicly available dataset.

Information

Type
Data Report
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use and/or adaptation of the article.
Open Practices
Open data
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Information regarding participants in available samples

Figure 1

Figure 1. Means of measures from the eye-tracking task across samples. Error bars stand for ± 1 SE. accuracy = percent comprehension answers correct; ba = Basque; bp = Brazilian Portuguese; ch_s = Chinese simplified; ch_t = Chinese traditional; da = Danish; en_uk = English (UK sample); ge_po = German (Potsdam sample); ge_zu = German (Zurich sample); hi_iiith = Hindi (Hyderabad sample); hi_iitk = Hindi (Kanpur sample); ic = Icelandic; n Fixations = number of fixations; no = Norwegian; refixation = likelihood of second fixation on the word; regressionIn = regression rate; rereading = likelihood of second pass; ru_mo = Russian (Moscow sample); skipping = skipping rate; se = Serbian; sp_ch = Spanish (Chile sample); tr = Turkish.

Figure 2

Figure 2. Means of measures of individual differences of English proficiency across samples. Error bars stand for ± 1 SE. Ba = Basque; bp = Brazilian Portuguese; cft = score in the CFT test; ch_s = Chinese simplified; ch_t = Chinese traditional; da = Danish; en_uk = English (UK sample); ge_po - German (Potsdam sample); ge_zu = German (Zurich sample); hi_iiith = Hindi (Hyderabad sample); hi_iitk = Hindi (Kanpur sample); ic = Icelandic; no = Norwegian; ru_mo = Russian (Moscow sample); se = Serbian; sp_ch = Spanish (Chile sample); towre: pde = TOWRE, phonemic decoding efficiency subtest (pseudoword naming); towre: swe = TOWRE, sight word efficiency subtest (word naming); vocabulary = vocabulary knowledge (Groups 2-5); tr = Turkish.

Figure 3

Table 2. Correlation table for reading measures (data aggregated across samples, N = 660)

Supplementary material: File

Kuperman et al. supplementary material

Kuperman et al. supplementary material
Download Kuperman et al. supplementary material(File)
File 36.3 KB