Hostname: page-component-77f85d65b8-g4pgd Total loading time: 0 Render date: 2026-03-29T16:03:09.001Z Has data issue: false hasContentIssue false

Measuring the development of lexical richness of L2 Spanish: A longitudinal learner corpus study

Published online by Cambridge University Press:  31 July 2023

María Díez-Ortega*
Affiliation:
Stanford Language Center, Stanford University, Stanford, CA, USA
Kristopher Kyle
Affiliation:
Department of Linguistics, University of Oregon, Eugene, OR, USA
*
Corresponding author: María Díez-Ortega; Email: merydior@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Research has indicated that lexical richness is an important indicator of second language (L2) proficiency. However, most research has examined written, cross-sectional English L2 corpora and does not necessarily indicate how spoken lexical use develops over time or whether observed trends are stable across L2s. This study adds to previous research on the development of spoken vocabulary by investigating lexical features of L2 Spanish learners over a 21-month period, using the LANGSNAP corpus. Multiple lexical richness indices used in previous studies were examined including lexical diversity, word frequency, word concreteness, and bigram strength of association. Linear mixed-effects models were run to examine changes over time. The results suggest that although some features of lexical richness (e.g., word frequency) see meaningful change over time, others (e.g., bigram T score) may not be indicative of L2 oral development.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open data
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Table 1. Collection points, topics, and average words per learner

Figure 1

Table 2. Descriptive statistics for lexical diversity scores across six points

Figure 2

Figure 1. Visualization of group means and individual trajectories.

Figure 3

Table 3. Selected pairwise results for lexical diversity

Figure 4

Table 4. Adjacent pairwise results for lexical diversity

Figure 5

Figure 2. Visualization of pairwise comparisons for lexical diversity. Overlapping red lines indicate that comparisons are not significant (p > .05).

Figure 6

Table 5. Descriptive statistics for content word frequency scores across six points

Figure 7

Figure 3. Visualization of group means and individual trajectories.

Figure 8

Table 6. Selected pairwise results for content word frequency

Figure 9

Table 7. Adjacent pairwise results for content word frequency

Figure 10

Figure 4. Visualization of pairwise comparisons for content word frequency.

Figure 11

Table 8. Descriptive statistics for verb frequency scores across six points

Figure 12

Figure 5. Visualization of group means and individual trajectories.

Figure 13

Table 9. Selected pairwise results for verb frequency

Figure 14

Table 10. Adjacent pairwise results for verb frequency

Figure 15

Figure 6. Visualization of pairwise comparisons for verb frequency.

Figure 16

Table 11. Descriptive statistics for bigram MI scores across six points

Figure 17

Figure 7. Visualization of group means and individual trajectories.

Figure 18

Table 12. Selected pairwise results for bigram MI

Figure 19

Table 13. Adjacent pairwise results for bigram MI

Figure 20

Figure 8. Visualization of pairwise comparisons for bigram MI.

Figure 21

Table 14. Descriptive statistics for bigram T scores across six points

Figure 22

Figure 9. Visualization of group means and individual trajectories.

Figure 23

Table 15. Descriptive statistics for concreteness across six points

Figure 24

Figure 10. Visualization of group means and individual trajectories.

Figure 25

Table 16. Selected pairwise results for concreteness scores

Figure 26

Table 17. Adjacent pairwise results for concreteness scores

Figure 27

Figure 11. Visualization of pairwise comparisons for concreteness scores.

Figure 28

Table 18. Correlation matrix of lexical indices