Hostname: page-component-5db58dd55d-l8wb7 Total loading time: 0 Render date: 2026-05-31T09:10:45.940Z Has data issue: false hasContentIssue false

Bayesian Identification and Estimation of Growth Mixture Models

Published online by Cambridge University Press:  07 April 2025

Xingyao Xiao
Affiliation:
BSE, University of California, Berkeley, CA, USA
Sophia Rabe-Hesketh*
Affiliation:
BSE, University of California, Berkeley, CA, USA
Anders Skrondal
Affiliation:
CEFH, Norwegian Institute of Public Health, Oslo, Norway CREATE & CEMO, University of Oslo, Oslo, Norway
*
Corresponding author: Sophia Rabe-Hesketh; sophiarh@berkeley.edu
Rights & Permissions [Opens in a new window]

Abstract

This article addresses problematic behaviors of Markov chain Monte Carlo (MCMC) methods for finite mixture models due to what we call degenerate nonidentifiability. We discuss the reasons for these behaviors, propose diagnostics to detect them, and show through simulations that using more informative priors than the vague defaults can mitigate the problems in growth mixture models (GMMs). Our motivating example is an application of GMMs to data from the National Longitudinal Survey of Youth (NLSY) to examine heterogeneity in the development of reading skills in children aged 6–14. We also suggest ways of describing and visualizing within-class heterogeneity in GMMs, provide a literature review of likelihood identification and Bayesian identification, propose a viable definition of Bayesian identification for latent variable models based on the marginal likelihood (integrated over the latent variables), and give a brief didactic description of Hamiltonian Monte Carlo (HMC) as implemented in Stan.

Information

Type
Application and Case Studies - Original
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Table 1 Information criteria for GMMs with 1–4 classes with D10N50 priors (smallest value for each criterion in italics)

Figure 1

Figure 1 (Color online) Class-specific mean trajectories with shaded 50% mid-range and box-plots of reading scores.Note: Class 1 is “Early Bloomers,” Class 2 is “Rapid Catch-Up Learners,” and Class 3 is “Steady Progressors.”

Figure 2

Table 2 Estimates for preferred three-class GMM with D10N50 priors

Figure 3

Figure 2 (Color online) Traceplot for Chain 6 for D2N500 priors where the Class-1 probability $\lambda ^{(1)}$ is persistently close to 0.

Figure 4

Figure 3 Q-Q plots of mean slope and random-intercept standard deviation parameter draws for Class 1 compared with their priors for Chain 6 with D2N500 priors.

Figure 5

Figure 4 (Color online) Traceplot for Chain 5 with D2C5 priors where $\lambda ^{(1)}$ and $\lambda ^{(2)}$ are close to 0 and the chain is completely stuck.

Figure 6

Figure 5 (Color online) Class-specific mean trajectories for Chain 5 for the D2C5 priors and box-plots of reading scores.

Figure 7

Figure 6 (Color online) Persistently stuck chains for D2C5 priors.Note: Top Panel: Traceplot of $\lambda ^{(1)},\, \lambda ^{(2)}$, and $\lambda ^{(3)}$ Bottom Panel: Moving standard deviation of $\lambda ^{(1)}$.

Figure 8

Figure 7 (Color online) Miniscule-class behavior for D4N100 priors.Note: Top Panel: Traceplot of $\lambda^{(1)},\, \lambda^{(2)}$, and $\lambda^{(3)}$ Mid Panel: Moving average and moving standard deviation of $\lambda^{(1)}$ Bottom Panel: Distinguishability index for all class pairs.

Figure 9

Figure 8 (Color online) Miniscule-class behavior for D2N500 priors.Note: Top Panel: Traceplot of $\lambda^{(1)},\, \lambda^{(2)}$, and $\lambda^{(3)}$ Mid Panel: Moving average and moving standard deviation of $\lambda^{(1)}$ Bottom Panel: Distinguishability index for all class pairs.

Figure 10

Table 3 Percent problematic behavior according to different diagnostics and Stan warnings for different prior combinations

Figure 11

Figure 9 (Color online) Percent problematic behavior for combinations of standard deviation priors (x-axis) and Dirichlet priors (D2, D6, D10 from left to right).Note: Top panels: percentage of 4-chain batches with $\widehat {R}>1.10$ for any parameter (dashed) and percentage of chains with stuck-sequence and/or miniscule class behavior (solid). Bottom panels: percentage of chains with miniscule class (solid), stuck chain (dashed), and stuck sequence (dot-dash).

Supplementary material: File

Xiao et al. supplementary material

Xiao et al. supplementary material
Download Xiao et al. supplementary material(File)
File 1.6 MB