Matrix perturbation analysis of methods for extracting singular values from approximate singular subspaces
Lorenzo Lazzarino, Hussam Al Daas, Yuji Nakatsukasa
TL;DR
The paper addresses how to quantify the accuracy of leading singular values retrieved from approximate left and right singular subspaces by recasting generalized Nyström as a structured matrix perturbation of the original matrix. It develops a sharp 2x2 block perturbation theory for singular values, extends it to block matrices, and provides an a posteriori computable version of the bounds. These results are applied to generalized Nyström and related extraction methods, enabling direct comparisons among GN, Rayleigh-Ritz, SVD, and HMT, with extensive numerical illustrations under both exponential and algebraic singular value decays. The findings offer practical guidance for selecting among singular-value extraction strategies in streaming and randomized settings and open avenues for improved computability and broader perturbation-based analyses.
Abstract
Given (orthonormal) approximations $\tilde{U}$ and $\tilde{V}$ to the left and right subspaces spanned by the leading singular vectors of a matrix $A$, we discuss methods to approximate the leading singular values of $A$ and study their accuracy. In particular, we focus our analysis on the generalized Nyström approximation, as surprisingly, it is able to obtain significantly better accuracy than classical methods, namely Rayleigh-Ritz and (one-sided) projected SVD. A key idea of the analysis is to view the methods as finding the exact singular values of a perturbation of $A$. In this context, we derive a matrix perturbation result that exploits the structure of such $2\times2$ block matrix perturbation. Furthermore, we extend it to block tridiagonal matrices. We then obtain bounds on the accuracy of the extracted singular values. This leads to sharp bounds that predict well the approximation error trends and explain the difference in the behavior of these methods. Finally, we present an approach to derive an a-posteriori version of those bounds, which are more amenable to computation in practice.
