Pseudo-Maximum Likelihood Theory for High-Dimensional Rank One Inference
Curtis Grant, Aukosh Jagannath, Justin Ko
TL;DR
The paper develops a universal pseudo-maximum likelihood theory for high-dimensional rank-one inference, introducing four information parameters that govern the limiting behavior across a wide class of models. It establishes a Parisi-variational framework that characterizes the limiting pseudo-likelihood and, via Gaussian equivalence, the performance of PMLEs, including a score-corrected variant to handle ill-scored models. The work shows strong and coarse equivalence between many estimation tasks (e.g., spiked matrix models and stochastic block models) and provides a complete description of the least-squares estimator in these settings, including phase transitions and failure modes. It applies the theory to Gaussian pseudolikelihoods, non-linear transforms of rank-one matrices, and a spectrum of examples (SBM, spiked Wigner, sparse PCA, Poisson-Bernoulli, etc.), offering a unifying lens for understanding statistical limits and algorithmic feasibility in high-dimensional rank-one inference.
Abstract
We develop a pseudo-likelihood theory for rank one matrix estimation problems in the high dimensional limit. We prove a variational principle for the limiting pseudo-maximum likelihood which also characterizes the performance of the corresponding pseudo-maximum likelihood estimator. We show that this variational principle is universal and depends only on four parameters determined by the corresponding null model. Through this universality, we introduce a notion of equivalence for estimation problems of this type and, in particular, show that a broad class of estimation tasks, including community detection, sparse submatrix detection, and non-linear spiked matrix models, are equivalent to spiked matrix models. As an application, we obtain a complete description of the performance of the least-squares (or ``best rank one'') estimator for any rank one matrix estimation problem.
