Maximum softly penalised likelihood in factor analysis
Philipp Sterzinger, Ioannis Kosmids, Irini Moustaki
TL;DR
This paper tackles the prevalence of Heywood cases in exploratory factor analysis by introducing a maximum softly penalised likelihood (MSPL) framework. It derives general conditions under which penalised estimators exist and preserve key ML properties, and shows that penalties from Akaike (1987) and Hirose et al. (2011) satisfy these conditions when scaled appropriately to yield softly decaying penalties. The authors establish consistency and, under stronger identification, $\sqrt{n}$-consistency and asymptotic normality for MSPL estimators, demonstrating that soft penalties can avoid improper boundary solutions without compromising ML-type inference. Through extensive simulations and real-data applications, MSPL improves finite-sample performance, stabilises factor loading estimates and communality estimates, and yields more reliable model selection compared with naive penalisation or unpenalised ML. The framework provides a principled approach to mitigate Heywood cases while maintaining the desirable properties of ML estimation in factor analysis.
Abstract
Estimation in exploratory factor analysis often yields estimates on the boundary of the parameter space. Such occurrences, known as Heywood cases, are characterised by non-positive variance estimates and can cause issues in numerical optimisation procedures or convergence failures, which, in turn, can lead to misleading inferences, particularly regarding factor scores and model selection. We derive sufficient conditions on the model and a penalty to the log-likelihood function that i) guarantee the existence of maximum penalised likelihood estimates in the interior of the parameter space, and ii) ensure that the corresponding estimators possess the desirable asymptotic properties expected by the maximum likelihood estimator, namely consistency and asymptotic normality. Consistency and asymptotic normality are achieved when the penalisation is soft enough, in a way that adapts to the information accumulation about the model parameters. We formally show, for the first time, that the penalties of Akaike (1987) and Hirose et al. (2011) to the log-likelihood of the normal linear factor model satisfy the conditions for existence, and, hence, deal with Heywood cases. Their vanilla versions, though, can result in questionable finite-sample properties in estimation, inference, and model selection. The maximum softly-penalised likelihood framework we introduce enables the careful scaling of those penalties to ensure that the resulting estimation and inference procedures inherit the ML estimator's optimal properties. Through comprehensive simulation studies and the analysis of real data sets, we illustrate the desirable finite-sample properties of the maximum softly penalised likelihood estimators and associated procedures.
