Hostname: page-component-77f85d65b8-2tv5m Total loading time: 0 Render date: 2026-03-30T01:13:48.766Z Has data issue: false hasContentIssue false

Optimizing L2 phonetic learning

Spaced vs. massed training schedule

Published online by Cambridge University Press:  19 December 2025

Kazuya Saito*
Affiliation:
University College London , London, United Kingdom Tohoku University, Sendai, Japan
Siying Chen
Affiliation:
University College London , London, United Kingdom
*
Corresponding author: Kazuya Saito; Email: k.saito@ucl.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

This methodological study investigated how the distribution of training sessions—massed, equal spacing, and expanding spacing—affects L2 phonetic learning, focusing on Mandarin-speaking learners’ perception of the English /ɛ/–/æ/ contrast. While most previous phonetic training studies have used massed schedules, the current quasi-experimental design revealed that both types of spaced practice significantly outperformed massed practice in terms of immediate gains and long-term retention. Effect sizes in the spaced groups were approximately double those of the massed group. No significant differences emerged between equal and expanding spacing. These findings suggest that distributed practice—regardless of spacing type—can enhance both the magnitude and durability of L2 phonetic learning. Crucially, this study makes it possible to revisit past findings based on massed training paradigms and to consider whether adopting alternative timing schedules could unlock greater learning potential—for instance, by doubling the size and durability of training effects through the use of spaced conditions.

Information

Type
Methods Forum
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Overview of the experimental procedure across three conditions: Equal spacing (three-day fixed intervals), Expanding spacing (intervals increasing from 0 to 8 days), and Massed practice (six sessions completed in one day). All participants completed a pre-test, and the two experimental groups received immediate post-tests after session 6, followed by a seven-day retention interval and delayed post-tests. Auditory processing aptitude tests were conducted after the delayed post-tests.

Figure 1

Table 1. Target Word pairs used during training and testing (trained items) vs. testing only (untrained items)

Figure 2

Figure 2. Sample screenshot of the two-alternative forced-choice identification task interface used in the perception test. On each trial, participants listened to a spoken word and selected the corresponding written form from two options (e.g., said vs. sad). Instructions were presented bilingually in English and Chinese (“Click the word you heard.” / 点击您所听到的单词) to ensure clarity for all participants. Responses were submitted by clicking one of the two options, after which the task automatically advanced to the next item.

Figure 3

Figure 3. Example screenshots from the phonetic training session. After selecting one of the two word options, participants received immediate visual feedback indicating whether their response was correct (green tick; 3A) or incorrect (red cross; 3B). This was followed by a feedback screen presenting the correct word along with its pronunciation and a bilingual prompt (“The correct word is” / 正确的单词是; 3C). The two-step feedback procedure was designed to enhance phonetic category learning by encouraging accurate form-meaning mapping and reinforcing correct auditory representations.

Figure 4

Table 2. Descriptive statistics of participants’ vowel perception scores at different time points as per group conditions

Figure 5

Figure 4. Vowel perception performance across groups and time. A visual summary of the three groups’ (Control, Expanding, Equal) vowel perception performance (%) at three different time points (Pre, Immediate, Delayed) overall (4A) and across two lexical conditions (4B). While the control/massed group showed improvement at the immediate post-tests, their performance declined at the delayed post-tests. Both expanding and equal groups maintained their performance up to the delayed post-tests.

Figure 6

Table 3. Results of mixed-effects model analysis for L2 vowel perception

Figure 7

Table 4. Results of multiple comparison analyses

Figure 8

Table 5. Results of multiple regression analyses