Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-05T17:38:15.119Z Has data issue: false hasContentIssue false

Effects of targeted perceptual training on L2 suprasegmental weighting strategies

Published online by Cambridge University Press:  21 January 2026

Magdalena Kachlicka*
Affiliation:
School of Psychological Sciences, Birkbeck University of London , UK
Ashley E. Symons
Affiliation:
Department of Psychology, Royal Holloway University of London , UK
Yaoyao Ruan
Affiliation:
Department of Education, University of Oxford , UK
Kazuya Saito
Affiliation:
Institute of Education, University College London , UK
Frederic Dick
Affiliation:
Experimental Psychology, University College London , UK
Adam Tierney
Affiliation:
School of Psychological Sciences, Birkbeck University of London , UK
*
Corresponding author: Magdalena Kachlicka; Email: m.kachlicka@bbk.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Learning a second language (L2) is challenging partly due to perceptual strategies inherited from learners’ first language. For example, speakers of tone languages like Mandarin over-use pitch in English prosody perception and production. We developed a novel training paradigm to help Mandarin learners adopt more native-like strategies by enhancing their use of duration relative to pitch cues during prosody categorization. After prosody training, participants used duration more during phrase boundary categorization but showed no clear change for contrastive focus and lexical stress, suggesting that cue weighting training is most effective when targeting a feature’s primary cue. The control group, who practiced English vocabulary, relied more on pitch in lexical stress categorization and phrase boundary production after training, suggesting that without targeted instruction, listeners default to existing strategies. Our findings demonstrate that although default strategies in L2 speech perception are difficult to resist, lifelong perceptual habits can be adjusted with training.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-ShareAlike licence (http://creativecommons.org/licenses/by-sa/4.0), which permits re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Table 1. Summary of basic demographics, musical training and language background information

Figure 1

Figure 1. Schematic of experimental design. All participants completed Part 1, Part 2 and Part 3 of the study. They completed tasks from Parts 1 and 3 on separate days to avoid fatigue. During the training, they were allowed to take up to 2 days break in total.

Figure 2

Table 2. Description and examples of vocabulary training task

Figure 3

Figure 2. Examples from prosody picture description task instruction. During the testing session, the researcher demonstrated these images and performed sample descriptions as examples. (A) Phrase boundary example: “In this picture, I can see an old lady and two kids. I think they are baking something. At some point, the girl, Jenny, says, ‘I’m going to eat, grandma!’ Most probably, she’s hungry. They all look very happy.” (B) Contrastive focus example: “Two girls in this picture are eating something from their bowls. I think it’s a strawberry yoghurt, not a natural yoghurt, because what they have in their bowls is pink. Pink yoghurt is more likely strawberry. I don’t like strawberry yoghurt.” (C) Lexical stress example: “There is a lady in a pink suit who looks troubled. She doesn’t know how to compress wav files into mp3. Her boss will be angry if she doesn’t do it.” If participants did not know the meaning of the word, they were advised to attempt to describe the picture by including that word in their description, for example, by saying: “There is a lady that is thinking about something. She says ‘compress,’ so she most probably needs to compress something, but she doesn’t know what to do” or even “The picture shows a working woman. I don’t know what the word ‘compress’ means, but the lady is saying it. She’s wearing a pink suit and a nice haircut.” This was to encourage the articulation of all the prosodic contrasts even if participants were not familiar with the target words and their meaning.

Figure 4

Table 3. Summary of effects across mixed-effects regression models for prosody and musical beats categorization tasks

Figure 5

Figure 3. Cue weighting patterns in speech prosody and musical beats categorization tasks. The lines represent the proportion of categorization responses across training groups and sessions, with error bars depicting 95% CI. Participants’ performance is plotted as a function of pitch level (left) and duration level (right) for Vocabulary and Prosody groups to visualize the differences in pitch and duration use during categorization before and after the training. For discussion of observed response patterns, see Supplementary Section S12.

Figure 6

Table 4. Summary of effects in mixed-effects regression models for accuracy in prosody perception (Models 5–7) and prosody production (Models 8–9); separate models for phrase boundary, contrastive focus and lexical stress

Figure 7

Figure 4. Performance accuracy for phrase boundary perception before and after the training. Error bars depict 95% CI.

Figure 8

Table 5. Summary of effects in mixed-effects regression model for pitch and duration use in phrase boundary production

Figure 9

Figure 5. Pitch and duration use during phrase boundary production. Error bars represent standard error of the mean.

Supplementary material: File

Kachlicka et al. supplementary material

Kachlicka et al. supplementary material
Download Kachlicka et al. supplementary material(File)
File 1.3 MB