Riemannian multigrid line search for low-rank problems
Marco Sutti, Bart Vandereycken
TL;DR
This work develops Riemannian multigrid line search (RMGLS) for large-scale variational problems that admit low-rank representations. By optimizing directly on the fixed-rank matrix manifold ${\mathcal{M}}_{k}$ and integrating a multilevel full-approximation scheme, the method achieves near mesh-independent convergence with fixed rank throughout iterations. A key contribution is extending the Hager--Zhang line search to the Riemannian setting, enabling machine-precision accuracy despite first-order information. Numerical experiments on linear and nonlinear PDE-inspired problems demonstrate robust performance, effective rank-adaptivity, and clear advantages over Euclidean multilevel and fixed-rank alternatives. The approach offers scalable, accurate low-rank solutions for high-dimensional variational problems arising from PDE discretizations.
Abstract
Large-scale optimization problems arising from the discretization of problems involving PDEs sometimes admit solutions that can be well approximated by low-rank matrices. In this paper, we will exploit this low-rank approximation property by solving the optimization problem directly over the set of low-rank matrices. In particular, we introduce a new multilevel algorithm, where the optimization variable is constrained to the Riemannian manifold of fixed-rank matrices. In contrast to most other multilevel algorithms where the rank is chosen adaptively on each level in order to control the perturbation due to the low-rank truncation, we can keep the ranks (and thus the computational complexity) fixed throughout the iterations. Furthermore, classical implementations of line searches based on Wolfe conditions enable computing a solution where the numerical accuracy is limited to about the square root of the machine epsilon. Here, we propose an extension to Riemannian manifolds of the line search of Hager and Zhang, which uses approximate Wolfe conditions that enable computing a solution on the order of the machine epsilon. Numerical experiments demonstrate the computational efficiency of the proposed framework.
