Gauss-Southwell type descent methods for low-rank matrix optimization
Guillaume Olikier, André Uschmajew, Bart Vandereycken
TL;DR
The paper addresses low-rank matrix optimization with a rank constraint by exploiting a factorized representation and two Gauss--Southwell-like descent schemes. It develops a balanced factorized method and an embedded Riemannian method based on tangent-space projections, proving global gradient-type convergence for both and a global $O(1/\sqrt{\ell})$ rate for the Riemannian variant. A local linear convergence result is established when a point has a positive-definite Riemannian Hessian, and the analysis yields novel convergence insights for alternating least squares. Numerical experiments show that the Riemannian-subspace approach is more robust to small singular values and ill-conditioning, with faster convergence and better stability than the balanced factorization, particularly under challenging conditioning and line-search settings.
Abstract
We consider gradient-related methods for low-rank matrix optimization with a smooth cost function. The methods operate on single factors of the low-rank factorization and share aspects of both alternating and Riemannian optimization. Two possible choices for the search directions based on Gauss-Southwell type selection rules are compared: one using the gradient of a factorized non-convex formulation, the other using the Riemannian gradient. While both methods provide gradient convergence guarantees that are similar to the unconstrained case, numerical experiments on a quadratic cost function indicate that the version based on the Riemannian gradient is significantly more robust with respect to small singular values and the condition number of the cost function. As a side result of our approach, we also obtain new convergence results for the alternating least squares method.
