Hostname: page-component-89b8bd64d-9prln Total loading time: 0 Render date: 2026-05-07T15:39:09.679Z Has data issue: false hasContentIssue false

Unattended distributional training can shift phoneme boundaries

Published online by Cambridge University Press:  25 March 2022

Kateřina Chládková*
Affiliation:
Institute of Psychology, Czech Academy of Sciences, Czech Republic Institute of Czech Language and Theory of Communication, Charles University, Prague, Czech Republic Amsterdam Center for Language and Communication, University of Amsterdam, the Netherlands
Paul Boersma
Affiliation:
Amsterdam Center for Language and Communication, University of Amsterdam, the Netherlands
Paola Escudero
Affiliation:
The MARCS Institute for Brain Behaviour and Development, Western Sydney University, Penrith, Australia Australian Research Council Centre of Excellence for the Dynamics of Language, Australian National University, Canberra, Australia
*
Address for correspondence: Kateřina Chládková, Amsterdam Center for Language and Communication, University of Amsterdam, P.O. Box 1642, Amsterdam, NL 1090BB Email: k.chladkova@uva.nl
Rights & Permissions [Opens in a new window]

Abstract

Listeners are sensitive to speech sounds’ probability distributions. Distributional training (DT) studies with adults typically involve conscious activation of phoneme labels. We show that distributional exposure can shift existing phoneme boundaries (Spanish /e/–/i/) pre-attentively. Using a DT paradigm involving two bimodal distributions we assessed listener's neural discrimination across three sounds, showing pre-to-post-test improvement for the two adjacent sounds that fell into different clusters of the trained distribution than for those that fell into one cluster. Upon unattended exposure to an intricate stimulus set, listeners thus relocate native phoneme boundaries. We assessed whether the paradigm also works for category creation (Spanish establishing a duration contrast), where it has methodological advantages over the usual unimodal-versus-bimodal paradigm. DT yielded a greater effect for the /e/–/i/ boundary shift than for duration contrast creation. It seems that second-language phoneme contrasts similar to native ones might be easier to acquire than new contrasts.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. (a) Typical unimodal and bimodal training distributions as used in the distributional training literature. The left and middle graphs show “continuously sampled” bimodal and unimodal training distributions such as those reported in Wanrooij and Boersma (2013), where the actual stimuli are represented by the vertical lines. The right graph illustrates the locations of the test stimuli traditionally used to assess the effects of distributional training. (b) Illustration of two bimodal training distributions (left and middle), and three test stimuli (right). The right graph illustrates that under the lowered-boundary distribution (solid curve), test stimuli Y and Z fall within the same peak, whereas test stimulus X falls into the other peak. In contrast, under the raised-boundary distribution (dashed curve), test stimuli X and Y fall within the same peak, whereas test stimulus Z falls into the other peak. (c) The actual training distributions used in the present experiment for the low-boundary (left graph, solid line) and high-boundary (middle graph, dashed line) training groups; for the meaning of D1, S, and D2, see text. The values of peak locations are marked on the x-axis for each training dimension (F1 and duration): top row of marks = F1 values in Hz; bottom row = duration values in ms. The right graph shows the stimuli used in the oddball paradigm in pre- and post-test, and their values in Hz or ms. The x-axis is scaled in ERB for F1 and logarithmically for duration.

Figure 1

Table 1. Predicted discrimination per boundary location, and the division of the 40 participants into the 4 groups, i.e., 2 auditory dimensions times 2 boundary locations.

Figure 2

Figure 2: The grand average ERPs at FCz of Deviant 1 and Deviant 2 in pre- (black dashed line) and posttest (black solid line), and the posttest-pretest difference (red online, grey in print, solid line), plotted for each training boundary location, and dimension. Shading shows the 100-ms window in which we searched for the grand-peak in the MMN analysis.

Figure 3

Figure 3: The absolute difference waves at FCz for each dimension, training boundary location, and stimulus type. The MMR analysis window is shaded. Note that for the duration dimension, two windows are marked, because the analysis window differed between D1 (earlier window) and D2 (later window), see section 2.6.

Figure 4

Table 2: The absolute MMR (means and 95% confidence intervals in μV) elicited by D1 and D2 in low and high boundary training groups, for the F1 dimension and for duration.

Supplementary material: PDF

Chládková et al. supplementary material

Appendix

Download Chládková et al. supplementary material(PDF)
PDF 512.9 KB