Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-10T08:44:25.804Z Has data issue: false hasContentIssue false

Modeling relationships between learning conditions, processes, and outcomes: An introduction to mediation analysis in SLA research

Published online by Cambridge University Press:  10 July 2025

Ruirui Jia*
Affiliation:
University of Maryland , College Park, MD, USA
Bronson Hui
Affiliation:
University of Maryland , College Park, MD, USA
*
Corresponding author: Ruirui Jia; Email: rjia@umd.edu
Rights & Permissions [Opens in a new window]

Abstract

In the past decade, researchers have been increasingly interested in understanding the process of language learning, in addition to the effect of instructional interventions on L2 performance gains (i.e., learning products). One goal of such investigations is to reveal the interplay between learning conditions, processes, and outcomes where, for example, certain conditions can promote attention to the learning targets, which in turn facilitates learning. However, the statistical modeling approach taken often does not align with the conceptualization of the complex relationships between these variables. Thus, in this paper, we introduce mediation analysis to SLA research. We offer a step-by-step, contextualized tutorial on the practical application of mediation analysis in three different research scenarios, each addressing a different research design using either simulated or open-source datasets. Our overall goal is to promote the use of statistical techniques that are consistent with the theorization of language learning processes as mediators.

Information

Type
Methods Forum
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1a. Moderation example 1.

Figure 1

Figure 1b. Moderation example 2.

Figure 2

Figure 1c. Moderation diagrams (Field, 2018).

Figure 3

Figure 2. The mediation model (Field, 2018).

Figure 4

Figure 3. The experiment flow chart of Tytko et al. (2024).

Figure 5

Figure 4. Visualization of the mediation model of Example 1.

Figure 6

Table 1. Sample dataset for Example 1

Figure 7

Table 2a. Descriptive statistics of test scores by feedback types

Figure 8

Table 2b. Descriptive statistics of awareness by feedback types

Figure 9

Figure 5a. Visualization of test scores by feedback types.

Figure 10

Figure 5b. Visualization of awareness by feedback types.Note: NR: No Report; N: Noticing; U: Understanding

Figure 11

Table 3. Summary of the mediation model (Example 1)

Figure 12

Figure 6. Visualization of the mediation model of Example 2.

Figure 13

Figure 7. Procedure of the hypothetical experiment in example 2.

Figure 14

Table 4. Descriptive statistics of cognitive load rating and VOCD by task complexity

Figure 15

Figure 8. Visualizing the descriptive statistics of cognitive load rating and VOCD.

Figure 16

Figure 9. Visualizing the relationship between cognitive load and VOCD by task complexity.

Figure 17

Figure 10. The statistical diagram of the mediation model (Montoya & Hayes, 2017).Note: In this model, the predictor X, which refers to task complexity (simple vs. complex), is modeled as a fixed variable. This is because in a within-subject design, participants experience both treatment conditions, meaning that it does not vary across participants.mdiff” represents the differences in cognitive load rating between the simple and complex tasks, indicating how cognitive load rating changes due to task complexity. “ydiff” refers to the differences in lexical diversity scores indexed by VOCD, representing how lexical diversity scores change due to the changes in cognitive load rating and task complexity. The inclusion of the grand-mean centered covariate is to account for the absolute level of cognitive load rating. In other words, it controls for individual differences in cognitive load rating, as participants with higher levels of cognitive load rating might exhibit different levels of lexical diversity compared to those with lower levels of cognitive load rating. Difference scores, which reflect only within-participant differences, cannot capture between-participant variations. Thus, to account for individual differences, absolute values must be controlled for.

Figure 18

Table 5. Sample dataset for Example 2

Figure 19

Figure 11. The MEMORE output for Example 2.

Figure 20

Figure 12. Path diagram of the eye-tracking mediation model (binary outcome).

Figure 21

Table 6. Sample eye-tracking dataset

Figure 22

Table 7. Descriptive statistics of dwell time percentage on images and comprehension accuracy by reading conditions

Figure 23

Figure 13. Visualization of raw and mean dwell time percentage to images by reading conditions.

Figure 24

Figure 14. Visualization of the proportion of accuracy by reading conditions.

Figure 25

Table 8. Random effects of model 1

Figure 26

Table 9. Model comparison results

Figure 27

Table 10a. Mediation model summary output from the brm() function

Figure 28

Table 10b. Mediation model summary output from the mediation() function