Hostname: page-component-89b8bd64d-sd5qd Total loading time: 0 Render date: 2026-05-10T13:26:55.380Z Has data issue: false hasContentIssue false

Maximum Softly Penalized Likelihood in Factor Analysis

Published online by Cambridge University Press:  18 February 2026

Philipp Sterzinger*
Affiliation:
Department of Statistics, London School of Economics and Political Science , United Kingdom
Ioannis Kosmidis
Affiliation:
Department of Statistics, University of Warwick , United Kingdom
Irini Moustaki
Affiliation:
Department of Statistics, London School of Economics and Political Science , United Kingdom
*
Corresponding author: Philipp Sterzinger; p.sterzinger@lse.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Estimation in exploratory factor analysis often yields estimates on the boundary of the parameter space. Such occurrences, called Heywood cases, are characterized by non-positive variance estimates and can cause numerical instability, convergence failures, and misleading inferences. We derive sufficient conditions on the model and a penalty to the log-likelihood function that guarantee the existence of maximum penalized likelihood estimates in the interior of the parameter space, and that the corresponding estimators possess desirable asymptotic properties expected by the maximum likelihood estimator, namely, consistency and asymptotic normality. Consistency and asymptotic normality follow when penalization is soft enough, in a way that adapts to the information accumulation about the model parameters. We formally show, for the first time, that the penalties of Akaike (1987, Psychometrika, 52, 317–332) and Hirose et al. (2011, Journal of Data Science, 9, 243–259) to the log-likelihood of the normal linear factor model satisfy the conditions for existence, and, hence, deal with Heywood cases. Their vanilla versions, though, can result in questionable finite-sample properties in estimation, inference, and model selection. Our maximum softly-penalized likelihood (MSPL) framework ensures that the resulting estimation and inference procedures are asymptotically optimal. Through comprehensive simulation studies and real data analyses, we illustrate the desirable finite-sample properties of the MSPL estimators.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Table 1 Loading matrix settings $A_3$ and $B_3$

Figure 1

Figure 1 Percentage of samples (out of $1,000$) that have been identified as Heywood cases for ML (“None”), MPL with Akaike[n] and Hirose[n] penalties, and MSPL with Akaike[$n^{-1/2}$] and Hirose[$n^{-1/2}$] penalties, $n \in \left \{50,100,400\right \}$, and loading matrix settings $A_3$, $B_3$, $A_5$, $B_5$, $A_8$, and $B_8$.

Figure 2

Figure 2 Violin plots of estimates of $\log (|\mathrm {Bias}|)$ (top panel), $\log (\mathrm {RMSE})$ (middle panel), and probability of underestimation (bottom panel) for the elements of $\boldsymbol {\Lambda }\boldsymbol {\Lambda }^\top $, for each estimator, $n \in \left \{50,100,400\right \}$, and loading matrix settings $A_3$ and $B_3$. The average over all elements for each setting is noted with a dot.

Figure 3

Figure 3 Percentage of times the model with $three$ factors is selected for each estimator, $n \in \{50, 400, 1000, 5000 \}$, and loading matrix settings $A_3$ and $B_3$, using AIC and BIC. The absence of vertical bars pertaining to the Akaike[n]- and Hirose[n]-based model selection procedures indicate that these methods have never selected the correct model.

Figure 4

Table 2 Percentage of times each number of factors has been selected using minimum BIC, for ML and MSPL with Akaike[n], Hirose[n], Akaike[$n^{-1/2}$], and Hirose[$n^{-1/2}$] penalties, under loading matrix setting $B_3$ and $n \in \left \{50,400, 1000, 5000\right \}$

Figure 5

Table 3 Estimated communalities ($\times 10^3$) for the Davis, Emmett, and Maxwell data, using ML, and MSPL with Akaike[$n^{-1/2}$] and Hirose[$n^{-1/2}$] penalties, for $q \in \{1, \ldots , 5\}$, with AIC and BIC values

Supplementary material: File

Sterzinger et al. supplementary material

Sterzinger et al. supplementary material
Download Sterzinger et al. supplementary material(File)
File 765.4 KB