Hostname: page-component-77f85d65b8-6c7dr Total loading time: 0 Render date: 2026-04-17T18:04:02.981Z Has data issue: false hasContentIssue false

Multiphase Structured Latent Curve Models for Count Response Data: A Re-Analysis of the Acquisition of Morphology in English

Published online by Cambridge University Press:  18 March 2025

Marian M. Strazzeri*
Affiliation:
Department of Human Development and Quantitative Methodology, University of Maryland, College Park, MD, USA
Jeffrey R. Harring
Affiliation:
Department of Human Development and Quantitative Methodology, University of Maryland, College Park, MD, USA
Nan Bernstein Ratner
Affiliation:
Department of Hearing and Speech Sciences, University of Maryland, College Park, MD, USA
*
Corresponding author: Marian M. Strazzeri; m.m.callaham@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Structured latent curve models (SLCMs) for continuous repeated measures data have been the subject of considerable recent research activity. In this article, we develop a first-order SLCM for repeated measures count data where the underlying change process is theorized to develop in distinct phases. Parameters of the multiphase or piecewise growth model, including changepoints, are allowed to vary across individuals. Exposure is allowed to vary across both individuals and time. We demonstrate our modeling approach on empirical expressive language data (grammatical morpheme counts) drawn from multiple distinct corpora available in the Child Language Data Exchange System (CHILDES), where the acquisition of grammatical morphology is understood to occur in distinct phases in typically developing children. A multiphase SLCM is fit to summarize individuals’ data as well as the average developmental pattern. Change in time-varying dispersion (unexplained variability in morpheme counts) over the course of early childhood is modeled concurrently to provide additional insights into acquisition. Unique characteristics of count data create modeling, identification, estimation, and diagnostic challenges that are exacerbated by incorporating growth models with nonlinear random effects. These are discussed at length. We provide annotated software code for each of models used in the empirical example.

Information

Type
Application and Case Studies - Original
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Table 1 Brown’s (1973) grammatical morphemes (BGMs) in order of acquisition

Figure 1

Figure 1 Cross-sectionally estimated NB2 log expected BGM2 production rate and dispersion parameter by chronological age. Note: BGM2 denotes Brown’s (1973) second grammatical morpheme, “in” (see Table 1). NB2 denotes the Negative Binomial distribution with mean $\mu $, dispersion $\phi $, and quadratic variance function $\mu + \phi \mu ^{2}$. Sample characteristics are provided in Table 3.

Figure 2

Table 2 Probability distributions for a single counting process

Figure 3

Table 3 Sample characteristics

Figure 4

Figure 2 Rate at which the morpheme “in” is produced within an oral language sample by chronological age and corpus. Note: Brown’s (1973) grammatical morphemes are summarized in Table 1. Sample characteristics are provided in Table 3.

Figure 5

Figure 3 Relationship between chronological age and number of sampled utterances. Note: Sample characteristics are provided in Table 3.

Figure 6

Figure 4 Path diagram for a first-order linear–linear latent growth model fit to repeated measurements of a single count response variable that conditionally follows a distribution in the two-parameter exponential family at each measurement occasion. Note: A solid, single-line, black arrow indicates a structural relationship with an identity link function. A solid, single-line, red arrow indicates a structural relationship with a non-identity (e.g., natural log) link function. A dashed black arrow from A to B indicates A gives rise to B directly and/or indirectly. A solid, double-line, black arrow from A to B indicates A generates B.

Figure 7

Figure 5 Model-data fit evaluations conducted to inform measurement model selection.Note: Sample characteristics are provided in Table 3.

Figure 8

Table 4 Population parameter estimates for NB2 first-order linear-linear SLCM fit to longitudinal BGM2 counts

Figure 9

Figure 6 NB2 log expected BGM2 production rate and dispersion by chronological age. Note: BGM2 denotes Brown’s (1973) second grammatical morpheme, "in” (see Table 1). NB2 denotes the Negative Binomial distribution with mean $\mu $, dispersion $\phi $, and quadratic variance function $\mu + \phi \mu ^{2}$. FO-LL-SLCM denotes the first-order linear–linear structured latent curve model fit to the data, yielding the population parameter estimates in Table 4. Sample characteristics are provided in Table 3.

Figure 10

Figure 7 Example individual fitted trajectories.

Figure 11

Figure 8 Path diagram for three-tier formation of a second-order linear–linear latent growth model fit to repeated measurements of multiple count response variables that each conditionally follow a 2PEF distribution at each measurement occasion. Note: Much of the notation used in Figure 8 is the same as the notation used in Figure 4. A solid, single-line, red arrow indicates a structural relationship with a non-identity (e.g., natural log) link function. A dashed black arrow from A to B indicates A gives rise to B directly and/or indirectly. A solid, double-line, black arrow from A to B indicates A generates B.

Figure 12

Figure 9 Path diagram for two-tier formation of a second-order linear–linear latent growth model fit to repeated measurements of multiple count response variables that each conditionally follow a 2PEF distribution at each measurement occasion. Note: Much of the notation used in Figure 9 is the same as the notation used in Figure 4. A solid, single-line, red arrow indicates a structural relationship with a non-identity (e.g., natural log) link function. A dashed black arrow from A to B indicates A gives rise to B directly and/or indirectly. A solid, double-line, black arrow from A to B indicates A generates B.