Computational and statistical lower bounds for low-rank estimation under general inhomogeneous noise
Debsurya De, Dmitriy Kunisky
TL;DR
This work extends the theory of spiked matrix models to inhomogeneous noise with general variance profiles, establishing computational lower bounds via low-degree polynomial analysis and sharp information-theoretic limits that hold beyond block-structured settings. By introducing and exploiting the reciprocal-variance matrix $oldsymbol{ riangle}^{-1}$ and an AMP-inspired spectral transform $oldsymbol{ moldsymbol{H}}(Y)$, the authors identify a general threshold $eta_*$ that governs detectability and low-rank recovery, and show that for $eta<eta_*$ no efficient procedure (in the low-degree sense) can succeed, while for $eta>eta_*$ certain spectral methods achieve separation. They develop tight bounds for block-structured and general variance profiles, including growth of the low-degree $oldsymbol{ extchi^2}$-divergence and channel-universality arguments, and provide numerical experiments for smoothly varying variance profiles that support the potential optimality of the LinAMP-based spectral algorithm. The results suggest a broad computational threshold governing inhomogeneous noise models and offer tools (graph-sum bounds, partition-based analyses) that may be of independent interest in random matrix theory and beyond. Overall, the paper advances our understanding of statistical-to-computational gaps in high-dimensional low-rank estimation under realistic, nonuniform noise.
Abstract
Recent work has generalized several results concerning the well-understood spiked Wigner matrix model of a low-rank signal matrix corrupted by additive i.i.d. Gaussian noise to the inhomogeneous case, where the noise has a variance profile. In particular, for the special case where the variance profile has a block structure, a series of results identified an effective spectral algorithm for detecting and estimating the signal, identified the threshold signal strength required for that algorithm to succeed, and proved information-theoretic lower bounds that, for some special signal distributions, match the above threshold. We complement these results by studying the computational optimality of this spectral algorithm. Namely, we show that, for a much broader range of signal distributions, whenever the spectral algorithm cannot detect a low-rank signal, then neither can any low-degree polynomial algorithm. This gives the first evidence for a computational hardness conjecture of Guionnet, Ko, Krzakala, and Zdeborová (2023). With similar techniques, we also prove sharp information-theoretic lower bounds for a class of signal distributions not treated by prior work. Unlike all of the above results on inhomogeneous models, our results do not assume that the variance profile has a block structure, and suggest that the same spectral algorithm might remain optimal for quite general profiles. We include a numerical study of this claim for an example of a smoothly-varying rather than piecewise-constant profile. Our proofs involve analyzing the graph sums of a matrix, which also appear in free and traffic probability, but we require new bounds on these quantities that are tighter than existing ones for non-negative matrices, which may be of independent interest.
