Hostname: page-component-5db58dd55d-lqwgf Total loading time: 0 Render date: 2026-06-03T10:44:59.126Z Has data issue: false hasContentIssue false

Lexical distributions of Spanish dialects and their processing implications for Chilean Spanish

Published online by Cambridge University Press:  15 January 2026

Julian van Bijnen
Affiliation:
Donders Centre for Cognition, Radboud Universiteit Nijmegen, Netherlands
Roberto A. Ferreira
Affiliation:
Millennium Nucleus for the Science of Learning (MiNSoL), Chile Facultad de Ciencias de la Educación, Universidad de Talca, Chile
Ton Dijkstra*
Affiliation:
Donders Centre for Cognition, Radboud Universiteit Nijmegen, Netherlands
*
Corresponding author: Ton Dijkstra; Email: t.dijkstra@donders.ru.nl
Rights & Permissions [Opens in a new window]

Abstract

Word frequency databases like SPALEX and SUBTLEX-ESP treat Spanish as a uniform language, but prior studies and an initial survey (Experiment 1) revealed significant lexical differences between Spanish in Spain and Latin American countries, especially Chile. To establish subjective frequencies of Spanish word usage, an extended survey (Experiment 2) was conducted with Chilean participants, categorizing words by usage area: General, Spain, Chile, and Latin America. Consistent with the initial survey, Chilean participants assigned subjective higher ratings to General and Chilean words. In a lexical decision experiment (Experiment 3), participants responded faster and more accurately to words from these categories. Using survey data, simulations with Multilink+ (Experiment 4) revealed that subjective word ratings better predicted Chilean reaction times than frequencies from existing databases. These findings emphasize the need to address Spanish dialectal differences in research, with word ratings offering a more accurate measure of region-specific lexical nuances than current databases.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Figure 1. Distribution of Spanish dialects across the world according to Lipski (2012).

Figure 1

Table 1. SPALEX and SUBTLEX-ESP occurrences per million (OPM) for typical words from Spain and Latin America

Figure 2

Table 2. Correlations between dialects for HIGHDIF words

Figure 3

Figure 2. A heatmap of the correlation matrix for all dialect pairs in the survey.

Figure 4

Table 3. Correlations between country ratings and log word frequencies from SPALEX or SUBTLEX-ESP

Figure 5

Figure 3. Scatterplots of current Chilean word ratings versus previous ratings (freq_survey) from (a) Chile (r = .95), (b) Argentina (r = .86), (c) Mexico (r = .70), and (d) Spain (r = .14).

Figure 6

Figure 4. Scatterplots of Chilean survey word ratings versus log word frequencies from (a) SPALEX (r = .55) and (b) SUBTLEX-ESP (r = .70) and of lexical decision reaction times (rt_LD) versus log word frequencies from (c) SPALEX (r = −.54) and (d) SUBTLEX-ESP (r = −.69). All significances at p < .0001.

Figure 7

Figure 5. Scatterplots of lexical decision reaction times (LD RTs) versus (a) SPALEX RTs (r = .62), (b) survey word ratings (r = −.75), (c) cycle times from combined converted frequencies (r = .84), and (d) SUBTLEX-ESP frequencies (r = .73). All significances at p < .0001.

Figure 8

Table 4. Summary statistics for all word categories in Experiment 3 (lexical decision)

Figure 9

Table 5. Correlations between lexical decision reaction times (RTs) and cycle times (CTs) based on different word frequency sources

Supplementary material: File

van Bijnen et al. supplementary material

van Bijnen et al. supplementary material
Download van Bijnen et al. supplementary material(File)
File 188.8 KB