Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-06T08:35:54.513Z Has data issue: false hasContentIssue false

Tracing the diachronic effects of data-driven learning on lexical complexity in EFL learners’ argumentative writing

Published online by Cambridge University Press:  06 May 2026

Yanan Zhao
Affiliation:
Shandong University, China (202220253@mail.sdu.edu.cn)
Jihua Dong*
Affiliation:
Shandong University, China (jdon104@aucklanduni.ac.nz; dongjihua@sdu.edu.cn)
*
Corresponding author: Jihua Dong; Email: jdon104@aucklanduni.ac.nz
Rights & Permissions [Opens in a new window]

Abstract

This study investigates the effectiveness of data-driven learning (DDL) in promoting lexical complexity in Chinese English as a foreign language (EFL) learners’ argumentative writing, tracks developmental trajectories, and examines learners’ perceptions. Adopting a quasi-experimental design, one class (n = 26) received DDL instruction, and the other (n = 22) received non-DDL instruction. Data were collected using triangulation, including argumentative writing samples from five time points, pre- and post-instruction questionnaires and semi-structured interviews. Results showed that learners in the DDL class significantly improved their lexical complexity, while the non-DDL class experienced declines. Across the five time points, nonlinear trajectories were observed in lexical complexity at the individual learner level. Learners reported positive attitudes toward DDL, though some challenges in corpus use remained. These findings provide empirical support for the effectiveness of DDL in promoting lexical complexity development in Chinese EFL learners’ argumentative writing and provide pedagogical implications for corpus-based writing instruction.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of EUROCALL, the European Association for Computer-Assisted Language Learning
Figure 0

Figure 1. The screenshot of the Sketch Engine interface showing a concordance search result “claim”.

Figure 1

Figure 2. The screenshot of the Sketch Engine interface showing collocates of “argue”.

Figure 2

Figure 3. The screenshot of the Sketch Engine interface showing concordances of “strongly argued”.

Figure 3

Table 1. Indices for assessing the lexical complexity

Figure 4

Table 2. Descriptive statistics for overall lexical complexity

Figure 5

Table 3. Results of paired-samples t-tests for overall lexical complexity

Figure 6

Table 4. Results of independent-samples t-tests for overall lexical complexity

Figure 7

Figure 4. Developmental trajectories of the composite z-scores.

Figure 8

Table 5. Results of within-group LME models with fixed factor of time for overall lexical complexity

Figure 9

Table 6. Results of between-group LME models for overall lexical complexity

Figure 10

Table 7. Descriptive statistics for lexical sophistication

Figure 11

Table 8. Results of paired-samples t-tests for lexical sophistication

Figure 12

Table 9. Results of independent-samples t-tests for lexical sophistication

Figure 13

Table 10. Learners’ writing examples from the posttest writing task

Figure 14

Figure 5. Developmental trajectories of lexical sophistication measures.

Figure 15

Table 11. Results of within-group LME models with fixed factor of time for lexical sophistication

Figure 16

Table 12. Results of between-group LME models for lexical sophistication

Figure 17

Table 13. Descriptive statistics for lexical diversity

Figure 18

Table 14. Results of paired-samples t-tests for lexical diversity

Figure 19

Table 15. Results of independent-samples t-tests for lexical diversity

Figure 20

Table 16. Results of within-group LME models with fixed factor of time for lexical diversity

Figure 21

Table 17. Results of between-group LME models for lexical diversity

Figure 22

Figure 6. Developmental trajectories of mattr50_cw.

Figure 23

Figure 7. Results of post-instruction questionnaire.

Figure 24

Table 18. Learner excerpts by theme

Supplementary material: File

Zhao and Dong supplementary material 1

Zhao and Dong supplementary material
Download Zhao and Dong supplementary material 1(File)
File 182 Bytes
Supplementary material: File

Zhao and Dong supplementary material 2

Zhao and Dong supplementary material
Download Zhao and Dong supplementary material 2(File)
File 549.2 KB