Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-13T04:31:34.191Z Has data issue: false hasContentIssue false

Bilingual Toddlers’ Vocabulary Growth Interacts with Existing Knowledge and Cross-Linguistic Similarity

Published online by Cambridge University Press:  04 February 2025

Serene Siow*
Affiliation:
Department of Experimental Psychology, University of Oxford, Oxford, UK Department of English, Linguistics & Theatre Studies, National University of Singapore, Singapore, Singapore
Irina Lepadatu
Affiliation:
Department of Experimental Psychology, University of Oxford, Oxford, UK
Nicola A. Gillen
Affiliation:
Department of Experimental Psychology, University of Oxford, Oxford, UK
Kim Plunkett
Affiliation:
Department of Experimental Psychology, University of Oxford, Oxford, UK
*
Corresponding author: Serene Siow; Email: s.siow@nus.edu.sg
Rights & Permissions [Opens in a new window]

Abstract

We explored whether bilingual toddlers make use of semantic and phonological overlap between their languages to learn new words. We analysed cross-sectional and longitudinal CDI data on the words understood and produced by 1.0 to 3.0-year-old bilingual toddlers with English and one additional language. Cognates were more likely to be understood and produced compared to non-cognates. Cognate effects were modulated by whether the toddler knew the translation equivalent in the other language, highlighting that young learners are sensitive to the similarities across their languages. Additionally, exploratory analyses suggest that children with smaller vocabularies rely more on translation equivalents to support the acquisition of difficult words. Children with larger vocabulary sizes exhibited no preference for translation equivalents in comprehension, and a preference for new concepts in production. The rapid acceleration of vocabulary growth in the second year of life may explain this developmental change in translation equivalent preference.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Diagram of the cascaded activation model, with examples of an identical cognate, a partially overlapping cognate with 1 phoneme match, and a non-cognate with zero matches.

Figure 1

Table 1. Number of participants per language, with mean age at time of response and mean percentage of overall English exposure

Figure 2

Figure 2. Plot visualising model predictions for the generalised linear model of English comprehension, with lines of best fit showing the probability of words being understood depending on word difficulty. Coloured lines represent levels of phonological similarity (PhonSim) and plots are faceted by whether or not the TE is also understood.

Figure 3

Table 2. Model coefficients for the full generalised linear mixed-effect model of English comprehension, with age in months, English exposure, AL TE knowledge, word difficulty and phonological similarity as predictors, an interaction between English exposure and phonological similarity, and an interaction between AL TE knowledge, and phonological similarity

Figure 4

Figure 3. Plot visualising model predictions for the generalised linear model of English production, with lines of best fit showing the probability of words being produced depending on its word difficulty. Coloured lines represent levels of phonological similarity and plots are faceted by whether or not the TE is also produced.

Figure 5

Table 3. Model coefficients for the full generalised linear mixed-effect model of English production, with age in months, English exposure, AL TE knowledge, word difficulty and phonological similarity as predictors, an interaction between AL TE knowledge and phonological similarity, and an interaction between English exposure and phonological similarity

Figure 6

Figure 4. Model predictions for the likelihood of a word to be learned between T(N) and T(N+1), with word difficulty on the x-axis, split by phonological similarity and faceted by whether the word was a singlet or not understood at T(N). Grey bars indicate standard error.

Figure 7

Table 4. Model coefficients for the generalised linear mixed-effect model for comprehension at T(N + 1)

Figure 8

Table 5. Marginal R2 and chi square comparisons on AIC for models of comprehension with increasing complexity

Figure 9

Figure 5. Model predictions for the likelihood of a word to be learned between T(N) and T(N+1), with word difficulty on the x-axis, split by whether the word was a singlet or not understood at T(N), and faceted by child’s vocabulary size. Grey bars indicate standard error.

Figure 10

Figure 6. Model predictions for the likelihood of a word to be produced between T(N) and T(N+1), with word difficulty on the x-axis, split by phonological similarity and faceted by whether the word was a singlet or not produced at T(N). Grey bars indicate standard error.

Figure 11

Table 6. Model coefficients for the generalised linear mixed-effect model for production at T(N + 1)

Figure 12

Figure 7. Model predictions for the likelihood of a word to be produced between T(N) and T(N+1), with word difficulty on the x-axis, split by whether the word was a singlet or not produced at T(N), and faceted by child’s vocabulary size. Grey bars indicate standard error.

Figure 13

Table 7. Marginal R2 and chi square comparisons on AIC for models of production with increasing complexity