Gradient-type subspace iteration methods for the symmetric eigenvalue problem
Foivos Alimisis, Yousef Saad, Bart Vandereycken
TL;DR
This work addresses computing the dominant invariant subspace of a real symmetric matrix $A$ by recasting the problem as optimization on the Grassmann manifold Gr$(n,p)$ of $p$-dimensional subspaces. It develops a gradient descent method and a Polak–Ribiere nonlinear conjugate gradient variant, both with an exact line search, and proves global convergence with local linear convergence in a neighborhood governed by the spectral gap $\delta$. The authors also provide practical implementation details to ensure accurate line searches and efficient matvecs, and they compare the methods against subspace iteration (with Chebyshev acceleration) and LOBCG across a suite of matrices, showing favorable performance in terms of matrix-vector products and runtime in many cases. These methods are particularly relevant for applications like SCF iterations in electronic structure calculations where spectrum information is not readily available and robustness to matrix changes is valuable.
Abstract
This paper explores variants of the subspace iteration algorithm for computing approximate invariant subspaces. The standard subspace iteration approach is revisited and new variants that exploit gradient-type techniques combined with a Grassmann manifold viewpoint are developed. A gradient method as well as a nonlinear conjugate gradient technique are described. Convergence of the gradient-based algorithm is analyzed and a few numerical experiments are reported, indicating that the proposed algorithms are sometimes superior to standard algorithms. This includes the Chebyshev-based subspace iteration and the locally optimal block conjugate gradient method, when compared in terms of number of matrix vector products and computational time, resp. The new methods, on the other hand, do not require estimating optimal parameters. An important contribution of this paper to achieve this good performance is the accurate and efficient implementation of an exact line search. In addition, new convergence proofs are presented for the non-accelerated gradient method that includes a locally exponential convergence if started in a $\mathcal{O(\sqrtδ)}$ neighbourhood of the dominant subspace with spectral gap $δ$.
