Hostname: page-component-5db58dd55d-8mwbx Total loading time: 0 Render date: 2026-06-01T09:03:42.577Z Has data issue: false hasContentIssue false

Implications of Alternative Parameterizations in Structural Equation Models for Longitudinal Categorical Variables

Published online by Cambridge University Press:  03 January 2025

Silvia Bianconcini
Affiliation:
Department of Statistical Sciences, University of Bologna, Bologna, Italy
Kenneth A. Bollen*
Affiliation:
Department of Psychology and Neuroscience, Department of Sociology, University of North Carolina, Chapel Hill, NC, USA
*
Corresponding author: Kenneth A. Bollen; Email: bollen@unc.edu
Rights & Permissions [Opens in a new window]

Abstract

When analyzing scaling conditions in latent variable structural equation models (SEMs) with continuous observed variables, analysts scaling a latent variable typically set the factor loading of one indicator to one and either set its intercept to zero or the mean of its latent variable to zero. When binary and ordinal observed variables are part of SEMs, the identification and scaling choices are more varied and multifaceted. Longitudinal data further complicate this. In SEM software, such as lavaan and Mplus, fixing the underlying variables’ variances or the error variances to one are two primary scaling conventions. As demonstrated in this paper, choosing between these constraints can significantly impact longitudinal analysis, affecting model fit, degrees of freedom, and assumptions about the dynamic process and error structure. We explore alternative parameterizations and conditions of model equivalence with categorical repeated measures.

Using data from the National Longitudinal Survey of Youth 1997, we empirically explore how different parameterizations lead to varying conclusions in longitudinal categorical analysis. More specifically, we provide insights into the specifications of the autoregressive latent trajectory model and its special cases—the linear growth curve and first-order autoregressive models—for categorical repeated measures. These findings have broader implications for a wide range of longitudinal models.

Information

Type
Application and Case Studies - Original
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Table 1 Proportions for each year of the categories related to depressive symptoms, general health status, and illegal drug use

Figure 1

Figure 1 Path diagram depicting the cross-lagged and autoregressive component (right) and multivariate growth part (left) of alternative autoregressive latent trajectory model specifications for illegal drug use (drug), depressive symptoms (depr), and general health status (health). The top panel is based on the standard (theta) parameterization of the auxiliary model linking observed variables to the underlying continuous ones, as adopted by Mplus 8.6 and lavaan 0.6-16. Dashed lines indicate statistically significant effects ($p-value< 0.05$) only under this specification. The middle panel showcases an alternative parameterization proposed by Muthén and Asparouhov (2002). Paths in dark gray signify significance under the alternative parameterizations but not under the standard one. The bottom panel employs the parameterization introduced by Jöreskog (2001). Light gray paths indicate a significant pattern under this specification but not in the others.

Figure 2

Table 2 Alternative sets of identification constraints for the auxiliary model in presence of ordinal data

Figure 3

Table 3 Alternative sets of identification constraints for the auxiliary model in presence of binary data

Figure 4

Table 4 Fit statistics for the alternative(1 and 2) auxiliary models based on threshold invariance estimated for depressive symptoms, general health status, and for all the three variables jointly considered

Figure 5

Table 5 Identification constraints and degrees of freedom for the linear growth model for categorical repeated measures

Figure 6

Table 6 Parameter transformations for the linear growth models based on the parameterization proposed by Muthén and Muthén (1998–2017) (Model A) and the one illustrated by Mehta et al. (2004) (Model B)

Figure 7

Table 7 $\boldsymbol \Sigma $constraints for the linear growth model based on the (standard) delta parameterization (diag($\boldsymbol \Sigma _{\mathbf {Y}^{*}\mathbf {Y}^{*}})=\mathbf {I}$) and the (standard) theta parameterization ($\boldsymbol \Theta _{\boldsymbol \varepsilon }=\mathbf {I}$)

Figure 8

Table 8 Parameter transformations for the linear growth models where all underlying variable variances are free except on the first occasion (Model C) and when all error variances are free except at the first time point (Model D)

Figure 9

Table 9 $\boldsymbol {\Sigma }$and$\boldsymbol \mu $constraints for the linear growth model based on the alternative 2 delta parameterization (diag$(\boldsymbol \Sigma _{\mathbf {Y}^{*}\mathbf {Y}^{*}})$freely estimated) and the alternative 2 theta parameterization$(\boldsymbol \Theta _{\boldsymbol \varepsilon }$freely estimated)

Figure 10

Table 10 Identification constraints and degrees of freedom for the linear growth model for binary data

Figure 11

Table 11 Identification constraints and degrees of freedom for the autoregressive model for categorical repeated measures

Figure 12

Table 12 Parameter transformations for the autoregressive models based on the parameterization proposed by Muthén and Muthén (1998–2017) (Model E) and the one illustrated by Mehta et al. (2004) (Model F)

Figure 13

Table 13 Identification constraints and degrees of freedom for the autoregressive model for binary data

Figure 14

Table 14 Parameter transformations for the autoregressive model of order one for binary data based on the standard parameterization of the auxiliary model (Model G), following the alternative parameterization proposed by Millsap and Yun-Tein (2004) (Model H), that proposed by Muthén and Muthén (1998–2017) (Model I), and the one proposed by Jöreskog (2001) (Model L)

Figure 15

Table 15 Identification constraints and degrees of freedom for the autoregressive latent trajectory model for categorical repeated measures

Figure 16

Table 16 $\boldsymbol \mu $constraints for the different autoregressive latent trajectory model specifications

Figure 17

Table 17 Identification constraints and degrees of freedom for the autoregressive latent trajectory model for binary data

Figure 18

Table 18 (Significant) parameter estimates for different parameterizations of the ALT model.[$(^{*})$: significant at 5% level.$(^{**})$: significant at 1% levelNote: $(^{***})$: significant at 0.1% level.]

Supplementary material: File

Bianconcini and Bollen supplementary material

Bianconcini and Bollen supplementary material
Download Bianconcini and Bollen supplementary material(File)
File 118.3 KB