Hostname: page-component-89b8bd64d-n8gtw Total loading time: 0 Render date: 2026-05-07T16:15:44.651Z Has data issue: false hasContentIssue false

A broadened estimate of syntactic and lexical ability from the MB-CDI

Published online by Cambridge University Press:  11 May 2021

Trevor K.M. DAY*
Affiliation:
Institute of Child Development, University of Minnesota, Minneapolis, MN, USA
Jed T. ELISON
Affiliation:
Institute of Child Development, University of Minnesota, Minneapolis, MN, USA Department of Pediatrics, University of Minnesota, Minneapolis, MN, USA
*
Address for correspondence: Trevor Day: 51 East River Parkway, Minneapolis, MN 55455. Email: day00096@umn.edu
Rights & Permissions [Opens in a new window]

Abstract

A critical question in the study of language development is to understand lexical and syntactic acquisition, which play different roles in speech to the extent it would be natural to surmise they are acquired differently. As measured through the comprehension and production of closed-class words, syntactic ability emerges at roughly the 400-word mark. However, a significant proportion of the developmental work uses a coarse combination of function and content words on the MacArthur-Bates Communicative Development Inventory (MB-CDI). Using the MB-CDI Wordbank database, we implemented a factor analytic approach to distinguish between lexical and syntactic development from the Words and Sentences (WS) form that involves both function words and the explicit categorizations. Although the Words and Gestures (WG) form did not share the factor structure, common WG/WS elements recapitulate the expected age-related changes. This parsing of the MB-CDI may prove simple, yet fruitful in subsequent investigation.

Information

Type
Brief Research Report
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re- use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press
Figure 0

Table 1. Demographic variables given in Wordbank for WG and WS. Percentages given as percent of non-missing data.

Figure 1

Table 3. The number of items in the categories shared and distinct between WG and WS. Where the categories are not named identically, the names are given as WG / WS. “Word Forms” is use of correct irregulars and “Word Endings” is use of incorrect overgeneralizations.

Figure 2

Figure 1. Scree plot for a principal axis factoring of the WS data, used to select the number of factors to extract from a subsequent FA. Two factors show the most improvement in variance explained. The dashed red line shows the eigenvalue for simulated random data; factor solutions below this line are considered to not be valid. The line at y = 1 is the Kaiser criterion for selecting factors; however, we also examine the third factor (just below).

Figure 3

Table 2. Descriptive statistics for the EFA/CFA divisions of the WG and WS forms. Percentages may not add to 100% because of rounding. For this analysis, mother's education was converted to years and birth order simplified to first vs. later.

Figure 4

Figure 2. Correlations between lexical subcategories (WS). Note that all correlations are positive.

Figure 5

Table 4. Factor loadings for the 2- and 3-factor EFAs on the Wordbank WS data. Loadings greater than 0.4 bolded “Word Forms” is use of correct irregulars and “Word Endings” is use of incorrect overgeneralizations. Categories with asterisks are easily considered function categories.

Figure 6

Table 6. Model fit indices for WS CFAs. Typical criteria for “good fit” are given in the first row. CFI: comparative fit index; Tucker-Lewis index: TLI; RMSEA: root-mean-square error of approximation; SRMR: standardized root mean square residual. 95% confidence interval given for RMSEA. Chi-square tests are almost always significant (indicating bad fit) in large sample sizes.

Figure 7

Figure 3. (a) Distribution of lexical/syntactic ability scores plotted by age band. The central line represents the mean score, and the bands 1 IQR (inner; 50%) and 3 IQR (outer; 99%). (b) Lexical and syntactic ability scores plotted against each other within-individual. The red dashed line indicates the hypothetical equal-development rate, and the solid black line indicates the second-degree polynomial line of best fit. Age is plotted youngest (purple) to oldest (yellow-green) using viridis (Garnier, 2018).

Figure 8

Figure 4. Estimates of lexical and syntactic ability scores plotted against each other. Each dot represents the scores from one individual. The red dashed line indicates the hypothetical equal-development rate, and the solid black line indicates the second-degree polynomial line of best fit. All 5520 WS individuals are included.

Figure 9

Table 5. Factor loadings for the 2- and 3-factor EFAs on the Wordbank WG data. Loadings greater than 0.4 bolded. Categories with asterisks are easily considered function categories.

Figure 10

Figure 5. Distribution of lexical/syntactic ability scores plotted against one another within individual, in both WS (blue) and WG (red). Ages are obscured for clarity. The red dashed line represents y = 1, the hypothesized relationship if abilities developed at the same rate. All WG and WG individuals included.