Hostname: page-component-6766d58669-l4t7p Total loading time: 0 Render date: 2026-05-14T20:41:07.575Z Has data issue: false hasContentIssue false

Subsegmental representation in child speech production: structured variability of stop consonant voice onset time in American English and Cantonese

Published online by Cambridge University Press:  05 August 2022

Eleanor CHODROFF*
Affiliation:
University of York, Department of Language and Linguistic Science, York, UK
Leah BRADSHAW
Affiliation:
University of Zurich, Institute of Computational Linguistics, Zurich, Switzerland
Vivian LIVESAY
Affiliation:
Mount Holyoke College, Department of Psychology and Education, South Hadley, Massachusetts, USA
*
*Corresponding author. Eleanor Chodroff, University of York, Department of Language and Linguistic Science, York, UK, YO10 5DD. Email: eleanor.chodroff@york.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Voice onset time (VOT) of aspirated stop consonants is marked by variability and systematicity in adult speech production. The present study investigated variability and systematicity of voiceless aspirated stop VOT from 161 two- to five-year-old talkers of American English and Cantonese. Overall, many aspects of child VOT productions parallel adult patterns, the analysis of which can help inform our understanding of early speech production. For instance, VOT means were comparable between children and adults, despite greater variability. Further, across children in both languages, talker-specific VOT means were strongly correlated between [th] and [kh]. This correlation may reflect a constraint of “target uniformity” that minimizes variation in the phonetic realization of a shared distinctive feature. Therefore findings suggest that target uniformity is not merely a product of a mature grammar, but may instead shape speech production representations in children as young as two years of age.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NC
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial licence (http://creativecommons.org/licenses/by-nc/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Table 1. Previously reported VOT values in milliseconds from adult talkers of English and Cantonese. M refers to the mean, SD to the standard deviation, RM to the range of talker means, RSD to the range of talker standard deviations, and RI to the range of individual VOT values.

Figure 1

Table 2. Previously reported VOT values in milliseconds from child talkers of English and Cantonese. M refers to the mean, SD to the standard deviation, RM to the range of talker means, RSD to the range of talker standard deviations, and RI to the range of individual VOT values.

Figure 2

Table 3. American English talker-specific means and standard deviations (SD) of [th] VOT in milliseconds presented for each age group and overall. The grand mean, SD, and range of talker means are taken over the distribution of talker-specific VOT means. The mean talker SD and range of talker SDs are taken over the distribution of talker-specific VOT standard deviations. Adult values calculated from the Chodroff and Wilson (2017) isolated speech data are reproduced here for comparison.

Figure 3

Table 4. American English talker-specific means and standard deviations (SD) of [kh] VOT in milliseconds presented for each age group and overall. The grand mean, SD, and range of talker means are taken over the distribution of talker-specific VOT means. The mean talker SD and range of talker SDs are taken over the distribution of talker-specific VOT standard deviations. Adult values calculated from the Chodroff and Wilson (2017) isolated speech data are reported here for comparison.

Figure 4

Table 5. Correlations between American English talker-specific means for [th] and [kh] along with 95% bootstrapped confidence intervals. Adult data calculated from the Chodroff and Wilson (2017) isolated speech study are reported here for comparison. ** reflects p < 0.001.

Figure 5

Figure 1. Variation and covariation of VOT means (ms) across American English child talkers at a) age two, b) age three, c) age four, and d) age five. Each ellipsis is centered on the paired talker-specific means for [th] and [kh], and the diameter reflects the stop-specific standard deviation, scaled by 0.5. Marginal histograms show variation in talker means. The dashed line reflects the line of equality, the solid line the best-fit linear regression line, and the gray shading the local confidence interval around the best-fit linear regression line.

Figure 6

Table 6. Correlations of American English talker-specific means and corresponding standard deviations (SD) along with 95% bootstrapped confidence intervals. Adult values from the Chodroff and Wilson (2017) isolated speech data are reported here for comparison. * reflects p < 0.01. ** reflects p < 0.001.

Figure 7

Table 7. Cantonese means and standard deviations (SD) of [th] VOT in milliseconds presented for each age group and overall. The grand mean, SD, and range of talker means are taken over the distribution of talker-specific VOT means. The mean talker SD and range of talker SDs are taken over the distribution of talker-specific VOT standard deviations. Adult values come from Clumeck et al. (1981) and are reported here for comparison.

Figure 8

Table 8. Cantonese talker-specific means and standard deviations (SD) of [kh] VOT in milliseconds presented for each age group and overall. The grand mean, SD, and range of talker means are taken over the distribution of talker-specific VOT means. The mean talker SD and range of talker SDs are taken over the distribution of talker-specific VOT standard deviations. Adult values come from Clumeck et al. (1981) and are reported here for comparison.

Figure 9

Table 9. Correlations between Cantonese talker-specific means for [th] and [kh] along with 95% bootstrapped confidence intervals. * reflects p < 0.01; ** reflects p < 0.001.

Figure 10

Figure 2. Variation and covariation of VOT means (ms) across Cantonese child talkers at a) age two, b) age three, c) age four, and d) age five. Each ellipsis is centered on the paired talker-specific means for [th] and [kh], and the diameter reflects the stop-specific standard deviation, scaled by 0.5. Marginal histograms show variation in talker means. The dashed line reflects the line of equality, the solid line the best-fit linear regression line, and the gray shading the local confidence interval around the best-fit linear regression line.

Figure 11

Table 10. Correlations of Cantonese talker-specific means and corresponding standard deviations (SD) along with 95% bootstrapped confidence intervals. * reflects p < 0.01; ** reflects p < 0.001.

Figure 12

Table 11. Stimulus list of English words beginning with aspirated stops.

Figure 13

Table 12. Stimulus list of Cantonese words beginning with simple aspirated stops.