Sharp error bounds for approximate eigenvalues and singular values from subspace methods
Irina-Beatrice Haas, Yuji Nakatsukasa
TL;DR
The paper develops sharp quadratic error bounds for Ritz eigenvalues derived from subspace methods, showing that the error | \\lambda_i - \\theta_i| scales like the square of the corresponding residual divided by a robust spectral gap, | \\lambda_i - \\theta_i| \\le c \\|E_i\\|_2^2 / \\text{Gap}_i, with c \ ightarrow 1 as the residuals vanish. The approach exploits the structured perturbation inherent in Rayleigh-Ritz and extends to singular values via the Jordan-Wielandt theorem, yielding analogous bounds for SVD components. The results are adapted to well-separated Ritz values as well as clusters, and the asymptotic sharpness is established, demonstrating improvements over classical bounds in practical, large-scale computations. Numerical experiments with Krylov methods (e.g., Lanczos, LOBPCG) and randomized SVD (HMT) validate the bounds and show they are tight and computable from available residual and gap information, supporting their use as reliable error certificates in large-scale eigenvalue and singular value computations.
Abstract
Subspace methods are commonly used for finding approximate eigenvalues and singular values of large-scale matrices. Once a subspace is found, the Rayleigh-Ritz method (for symmetric eigenvalue problems) and Petrov-Galerkin projection (for singular values) are the de facto method for extraction of eigenvalues and singular values. In this work we derive quadratic error bounds for approximate eigenvalues of symmetric matrices obtained via the Rayleigh-Ritz process. Our bounds take advantage of the fact that extremal eigenpairs tend to converge faster than the rest, hence having smaller residuals $\|A\widehat x_i-θ_i\widehat x_i\|_2$, where $(θ_i,\widehat x_i)$ is a Ritz pair (approximate eigenpair). The proof uses the structure of the perturbation matrix underlying the Rayleigh-Ritz method to bound the components of its eigenvectors. In this way, we obtain a bound of the form $c\frac{\|A\widehat x_i-θ_i\widehat x_i\|_2^2}{\mbox{Gap}_i}$, where $\mbox{Gap}_i$ is roughly the gap between the $i$th Ritz value and the eigenvalues that are not approximated by the Ritz process, and $c> 1$ is a modest scalar. Our bound is adapted to each Ritz value and is robust to clustered Ritz values, which is a key improvement over existing results. We further show that the bound is asymptotically sharp, and generalize it to singular values of arbitrary real matrices. Finally, we apply these bounds to several methods for computing eigenvalues and singular values, and illustrate the sharpness of our bounds in a number of computational settings, including Krylov methods and randomized algorithms.
