Hostname: page-component-6766d58669-bp2c4 Total loading time: 0 Render date: 2026-05-22T12:44:17.578Z Has data issue: false hasContentIssue false

The clarity of word repetitions in American English infant-directed speech

Published online by Cambridge University Press:  15 September 2025

Daniel Swingley*
Affiliation:
Department of Psychology, University of Pennsylvania , Philadelphia, PA, USA
Rights & Permissions [Opens in a new window]

Abstract

Words in infant-directed speech (IDS) are often phonetically reduced. This likely renders words harder for infants to learn and recognize. This difficulty might be mitigated by the repetitive nature of IDS, in particular if reduced instances are often preceded by clear instances (i.e., the first-mention effect). To characterize phonetic clarity in American English word repetitions, words were extracted from the IDS of eight mothers and presented to adults (n = 36) who judged their clarity. First mentions of repeated words were found to be clearer than second mentions, though this effect was small. Clarity was rated as greater for less common words and for utterance-final words. Clarity was also greater for words parents thought their child knew. The results help guide intuitions about the phonetic problem infants face when learning their first words.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-ShareAlike licence (http://creativecommons.org/licenses/by-sa/4.0), which permits re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Stimulus distribution over Brent dyads. Dyad labels (c1, d1…) are as given in the Brent dataset

Figure 1

Table 2. Stimulus distribution by sentence position. Tokens from one-word utterances are excluded from the other categories in the table

Figure 2

Figure 1. For each word (item), the proportion of judges selecting the first mention as the clearer token. Boxplots show medians and quartiles of the items averaging over subjects. Means are shown as red triangles. Results for each pair are overlaid on the boxplots, with items from the Experiment 1 set as circles and Experiment 2 as x’s. Dyads are listed using the codes given in the Brent corpus.

Figure 3

Table 3. Regression predicting judges’ choice of whether the first-mention or second-mention token was the clearer one. Predictors are named at left. The condition that contrasts from baseline is given in square brackets. exp(coef) indicates the multiplicative change in the odds of choosing the first-mention token when going from the baseline case to the indicated case

Figure 4

Table 4. Logistic regression predicting judges’ choice of whether the first-mention or second-mention token was the clearer one. Predictors are named at left. The condition that contrasts from baseline is given in square brackets

Figure 5

Figure 2. For each item pair, the proportion of judges who chose the first mention as the clearer one, split into facets according to whether the first or second mention was utterance-final. Within each facet, items are arrayed on the x axis by (log) frequency.

Figure 6

Figure 3. For each word pair, the mean rating of the first mention and the second mention. Each Brent corpus dyad is given separately, indicated by the dyad identifier from the corpus. Points above the diagonal line show items that had a first-mention advantage.

Figure 7

Table 5. Analysis of first-mention and second-mention mean difference scores in ratings of word clarity, including CDI result and utterance-final positioning

Figure 8

Table 6. Analysis of first-mention and second-mention mean difference scores in ratings of word clarity. The analysis includes all trials, and tests the interaction between frequency and utterance-final position. Log frequency is mean-centred

Figure 9

Figure 4. For each word pair, the mean rating of the first mention and the second mention, arrayed according to word frequency. The facets indicate which tokens of the pair were utterance-final. The first-mention tokens’ ratings are given in darker, blue points, the second-mention tokens’ ratings in lighter, red points. Lines show linear fits, with the grey shading indicating a 95% confidence interval.

Figure 10

Table 7. Ordinal regression predicting ratings of words. Data were entered at the trial level. Contrasts were treatment coded. [yes] and [known] in brackets refer to the level designated as treatment. All data for which CDI scores were available were included

Figure 11

Table 8. Estimated phonological distances of all word responses from the canonical pronunciation of the corpus word

Figure 12

Table 9. Negative binomial regression predicting the phonological distance of transcriptions from the correct word. For the “mention” predictor, regression coefficients are negative when first-mention effects are stronger. Exponentiated coefficients (exp(coef)) give the multiplicative change in phonological distance expected given a unit change in the predictor. Contrasts are treatment coded. Material in brackets, like [yes], refers to the level designated as treatment. Random effects were listener and item pair with random slopes for mention

Figure 13

Table 10. Negative binomial regression predicting the phonological distance of transcriptions from the correct word, with nonsignificant interactions removed from the formula (cf. Table 9)

Figure 14

Figure 5. For each word pair, the mean rating of the first mention and the second mention (left panel) or the transcriptions’ mean distance from the correct target word (right panel) arrayed according to word frequency. Utterance-final words are shown in red, non-utterance-final words in grey. Coloured lines are linear regressions of rating or distance on log frequency, separated according to sentence position category. Marginal density plots show ratings or distance distributions by utterance position; areas under the curves sum to one. The plots show a very large effect of utterance position on both measures, and a significant tendency for more frequent words to be spoken less clearly.

Figure 15

Figure 6. Over all items, mean rating for the first mention and the second mention of each pair. The first-mention advantage is shown by colour and position. When judges gave higher (better) clarity ratings to the first mention than the second mention, the plot point for that item falls in the upper left portion of the plot.

Supplementary material: File

Swingley supplementary material

Swingley supplementary material
Download Swingley supplementary material(File)
File 29.9 KB