Does block size matter in randomized block Krylov low-rank approximation?
Tyler Chen, Ethan N. Epperly, Raphael A. Meyer, Christopher Musco, Akash Rao
TL;DR
This paper resolves a long-standing gap between theory and practice for randomized block Krylov iteration (RBKI) in low-rank approximation. It proves a unified, gap-independent bound showing that RBKI with any block size $1\le b\le k$ achieves a $(1+\varepsilon)$-approximation to the top-$k$ components using $\tilde{O}(k/\sqrt{\varepsilon})$ matrix-vector products, with high probability. The key technical contribution is a sharp bound on the minimum singular value of a random block Krylov matrix, built via a Vandermonde-form reduction, anti-concentration (Nie 2022) arguments, and a PV-style decomposition, which may be of independent interest beyond LRA. The results justify using intermediate block sizes in practice, offer gap-dependent refinements, and connect to smoothed-analysis ideas to guarantee well-conditioned behavior in realistic settings, potentially accelerating large-scale matrix computations and related sparse linear-system solvers.
Abstract
We study the problem of computing a rank-$k$ approximation of a matrix using randomized block Krylov iteration. Prior work has shown that, for block size $b = 1$ or $b = k$, a $(1 + \varepsilon)$-factor approximation to the best rank-$k$ approximation can be obtained after $\tilde O(k/\sqrt{\varepsilon})$ matrix-vector products with the target matrix. On the other hand, when $b$ is between $1$ and $k$, the best known bound on the number of matrix-vector products scales with $b(k-b)$, which could be as large as $O(k^2)$. Nevertheless, in practice, the performance of block Krylov methods is often optimized by choosing a block size $1 \ll b \ll k$. We resolve this theory-practice gap by proving that randomized block Krylov iteration produces a $(1 + \varepsilon)$-factor approximate rank-$k$ approximation using $\tilde O(k/\sqrt{\varepsilon})$ matrix-vector products for any block size $1\le b\le k$. Our analysis relies on new bounds for the minimum singular value of a random block Krylov matrix, which may be of independent interest. Similar bounds are central to recent breakthroughs on faster algorithms for sparse linear systems [Peng & Vempala, SODA 2021; Nie, STOC 2022].
