Sharp Global Guarantees for Nonconvex Low-rank Recovery in the Noisy Overparameterized Regime
Richard Y. Zhang
TL;DR
This work addresses nonconvex low-rank recovery from noisy measurements under overparameterization. It introduces a unified strong duality framework connecting counterexamples and escape directions, and derives sharp global guarantees for both symmetric ($XX^{T}$) and balanced asymmetric ($UV^{T}$) parameterizations in the overparameterized regime. The results show that near-second-order points achieve minimax-optimal recovery bounds, with explicit dependence on the noise level and the overparameterization ratio $r/r^{\star}$, under RIP with constant $\delta$ and sample parameter $k$. The analysis extends to noisy settings and to the symmetric-to-asymmetric transition via a balancing regularizer, providing detailed sufficiency and necessity results and demonstrating the essential role of balancing regularization for the asymmetric case. Collectively, the paper offers a rigorous, sharp, and broadly applicable understanding of when and how overparameterization yields reliable recovery in nonconvex low-rank problems, with implications for algorithm design and theory.
Abstract
Recent work established that rank overparameterization eliminates spurious local minima in nonconvex low-rank matrix recovery under the restricted isometry property (RIP). But this does not fully explain the practical success of overparameterization, because real algorithms can still become trapped at nonstrict saddle points (approximate second-order points with arbitrarily small negative curvature) even when all local minima are global. Moreover, the result does not accommodate for noisy measurements, but it is unclear whether such an extension is even possible, in view of the many discontinuous and unintuitive behaviors already known for the overparameterized regime. In this paper, we introduce a novel proof technique that unifies, simplifies, and strengthens two previously competing approaches -- one based on escape directions and the other based on the inexistence of counterexample -- to provide sharp global guarantees in the noisy overparameterized regime. We show, once local minima have been converted into global minima through slight overparameterization, that near-second-order points achieve the same minimax-optimal recovery bounds (up to small constant factors) as significantly more expensive convex approaches. Our results are sharp with respect to the noise level and the solution accuracy, and hold for both the symmetric parameterization $XX^{T}$, as well as the asymmetric parameterization $UV^{T}$ under a balancing regularizer; we demonstrate that the balancing regularizer is indeed necessary.
