Hostname: page-component-89b8bd64d-sd5qd Total loading time: 0 Render date: 2026-05-08T00:01:31.831Z Has data issue: false hasContentIssue false

Identification and Scaling of Latent Variables in Ordinal Factor Analysis

Published online by Cambridge University Press:  13 January 2026

Edgar C. Merkle*
Affiliation:
University of Missouri , USA
Sonja D. Winter
Affiliation:
University of Missouri , USA
Ellen Fitzsimmons
Affiliation:
University of Missouri , USA
*
Corresponding author: Edgar C. Merkle; Email: merklee@missouri.edu
Rights & Permissions [Opens in a new window]

Abstract

Social science researchers are generally accustomed to treating ordinal variables as though they are continuous. In this article, we consider how identification constraints in ordinal factor analysis can mimic the treatment of ordinal variables as continuous. We specifically describe model constraints that lead to latent variable predictions equaling the average of ordinal variables. This result leads us to propose minimal identification constraints, which we call integer constraints, that place the latent variables on the scale of the observed, integer-coded ordinal variables. The integer constraints lead to intuitive model parameterizations because researchers are already accustomed to thinking about ordinal variables as though they are continuous. We provide a proof that our proposed integer constraints are indeed minimal identification constraints, as well as illustrations of how integer constraints work with real data. We also provide simulation results indicating that integer constraints are similar to other identification constraints in terms of estimation convergence and admissibility.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 Observed averages versus MAP latent variable estimates for $K=5$ and $p=2$ to 10. Each point represents a response pattern. Red points are response patterns that do not include a response of 1 or 5, and blue points are response patterns that do include a response of 1 and/or 5.

Figure 1

Figure 2 Proportion of converged and admissible replications across simulation conditions when all indicators have a balanced, skewed, or middling response distribution.

Figure 2

Table 1 Proportion replications with middling response pattern resulting in identical fit across identification constraint methods

Figure 3

Table 2 Proportion replications with middling response pattern, six indicators, and three response categories resulting in best fit across identification constraint methods

Figure 4

Table 3 Item response frequencies of the attitudes toward science dataset

Figure 5

Table 4 Comparison of loading estimates and SEs under traditional constraints and under integer constraints

Figure 6

Table 5 Comparison of threshold estimates and SEs under traditional constraints and under integer constraints

Figure 7

Figure 3 Average of observed variables versus MAP latent variable predictions for the attitudes toward science dataset.

Figure 8

Table 6 Item parameter estimates for Example 2

Figure 9

Figure 4 Average of observed variables versus MAP latent variable predictions for the social media dataset.

Figure 10

Figure 5 Latent variable values versus expected average score (left panel), with overlaid points of MAP latent variable estimates versus observed average scores (right panel).

Figure 11

Figure B1 Proportion of converged and admissible replications across simulation conditions when all indicators have a balanced, skewed, or middling response distribution, using default starting values.

Figure 12

Table B1 Proportion replications with middling response pattern resulting in identical fit across identification constraint methods, using default starting values

Figure 13

Table B2 Proportion replications with middling response pattern, six indicators, and three response categories resulting in best fit across identification constraint methods, using default starting values

Figure 14

Table B3 Proportion replications with symmetric or skewed response pattern resulting in identical fit across identification constraint methods, using simple starting values

Figure 15

Table B4 Proportion replications with symmetric or skewed response pattern resulting in identical fit across identification constraint methods, using default starting values

Figure 16

Table B5 Proportion replications with middling response pattern and three indicators per factor resulting in best fit across identification constraint methods, using simple starting values

Figure 17

Table B6 Proportion replications with middling response pattern and three indicators per factor resulting in best fit across identification constraint methods, using default starting values

Figure 18

Table B7 Proportion replications with middling response pattern and six indicators per factor resulting in best fit across identification constraint methods, using simple starting values

Figure 19

Table B8 Proportion replications with middling response pattern and six indicators per factor resulting in best fit across identification constraint methods, using simple starting values