Hostname: page-component-89b8bd64d-7zcd7 Total loading time: 0 Render date: 2026-05-08T03:30:39.031Z Has data issue: false hasContentIssue false

Matching Criterion for Identifiability in Sparse Factor Analysis

Published online by Cambridge University Press:  20 January 2026

Nils Sturma*
Affiliation:
Technical University of Munich , Germany
Miriam Kranzlmueller
Affiliation:
Ludwig Maximilian University of Munich , Germany
Irem Portakal
Affiliation:
Max Planck Institute for Mathematics in the Sciences , Germany
Mathias Drton
Affiliation:
Technical University of Munich , Germany
*
Corresponding author: Nils Sturma; Email: nils.sturma@tum.de
Rights & Permissions [Opens in a new window]

Abstract

Factor analysis models explain dependence among observed variables by a smaller number of unobserved factors. A main challenge in confirmatory factor analysis is determining whether the factor loading matrix is identifiable from the observed covariance matrix. The factor loading matrix captures the linear effects of the factors and, if unrestricted, can only be identified up to an orthogonal transformation of the factors. However, in many applications, the factor loadings exhibit an interesting sparsity pattern that may lead to identifiability up to column signs. We study this phenomenon by connecting sparse confirmatory factor analysis models to bipartite graphs and providing sufficient graphical conditions for identifiability of the factor loading matrix up to column signs. In contrast to previous work, our main contribution, the matching criterion, exploits sparsity by operating locally on the graph structure, thereby improving existing conditions. Our criterion is efficiently decidable in time that is polynomial in the size of the graph, when restricting the search steps to sets of bounded size.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 Directed graph encoding the sparsity structure in a factor analysis model.

Figure 1

Figure 2 Two factor analysis graphs. Graph (a) satisfies ZUTA while graph (b) does not.

Figure 2

Figure 3 AR-identifiable factor analysis graph.

Figure 3

Figure 4 Two full-ZUTA graphs.

Figure 4

Figure 5 Full-ZUTA graph and a sparse subgraph.

Figure 5

Figure 6 Sparse factor analysis graphs that is not AR-identifiable nor BB-identifiable.

Figure 6

Figure 7 M-identifiable sparse factor analysis graph.

Figure 7

Figure 8 Graph that is certified to be generically sign-identifiable by Theorem 4.15.

Figure 8

Figure 9 Extended M-identifiable sparse factor analysis graph.

Figure 9

Figure 10 Two generically sign-identifiable graphs that are not extended M-identifiable.

Figure 10

Table 1 Counts of unlabeled sparse factor graphs satisfying ZUTA with at most $|V|=7$ observed nodes and $|\mathcal {H}|=3$ latent nodes

Figure 11

Table 2 Counts of unlabeled sparse factor graphs with at most $|V|=9$ observed nodes and $|\mathcal {H}|=4$ latent nodes

Figure 12

Table 3 Status of extended M-identifiability for 5,000 randomly generated sparse factor graphs with different edge probabilities with $|W|\leq k$ for $k=4$

Figure 13

Table 4 Factor loading matrix obtained via maximum likelihood estimation and varimax rotation from the POPPA data set

Figure 14

Figure 11 Graph encoding a nonzero correlation between the latent factors.

Supplementary material: File

Sturma et al. supplementary material

Sturma et al. supplementary material
Download Sturma et al. supplementary material(File)
File 664.4 KB