Does block size matter in randomized block Krylov low-rank approximation?

Tyler Chen; Ethan N. Epperly; Raphael A. Meyer; Christopher Musco; Akash Rao

Does block size matter in randomized block Krylov low-rank approximation?

Tyler Chen, Ethan N. Epperly, Raphael A. Meyer, Christopher Musco, Akash Rao

TL;DR

This paper resolves a long-standing gap between theory and practice for randomized block Krylov iteration (RBKI) in low-rank approximation. It proves a unified, gap-independent bound showing that RBKI with any block size $1\le b\le k$ achieves a $(1+\varepsilon)$-approximation to the top-$k$ components using $\tilde{O}(k/\sqrt{\varepsilon})$ matrix-vector products, with high probability. The key technical contribution is a sharp bound on the minimum singular value of a random block Krylov matrix, built via a Vandermonde-form reduction, anti-concentration (Nie 2022) arguments, and a PV-style decomposition, which may be of independent interest beyond LRA. The results justify using intermediate block sizes in practice, offer gap-dependent refinements, and connect to smoothed-analysis ideas to guarantee well-conditioned behavior in realistic settings, potentially accelerating large-scale matrix computations and related sparse linear-system solvers.

Abstract

We study the problem of computing a rank-$k$ approximation of a matrix using randomized block Krylov iteration. Prior work has shown that, for block size $b = 1$ or $b = k$, a $(1 + \varepsilon)$-factor approximation to the best rank-$k$ approximation can be obtained after $\tilde O(k/\sqrt{\varepsilon})$ matrix-vector products with the target matrix. On the other hand, when $b$ is between $1$ and $k$, the best known bound on the number of matrix-vector products scales with $b(k-b)$, which could be as large as $O(k^2)$. Nevertheless, in practice, the performance of block Krylov methods is often optimized by choosing a block size $1 \ll b \ll k$. We resolve this theory-practice gap by proving that randomized block Krylov iteration produces a $(1 + \varepsilon)$-factor approximate rank-$k$ approximation using $\tilde O(k/\sqrt{\varepsilon})$ matrix-vector products for any block size $1\le b\le k$. Our analysis relies on new bounds for the minimum singular value of a random block Krylov matrix, which may be of independent interest. Similar bounds are central to recent breakthroughs on faster algorithms for sparse linear systems [Peng & Vempala, SODA 2021; Nie, STOC 2022].

Does block size matter in randomized block Krylov low-rank approximation?

TL;DR

achieves a

-approximation to the top-

components using

matrix-vector products, with high probability. The key technical contribution is a sharp bound on the minimum singular value of a random block Krylov matrix, built via a Vandermonde-form reduction, anti-concentration (Nie 2022) arguments, and a PV-style decomposition, which may be of independent interest beyond LRA. The results justify using intermediate block sizes in practice, offer gap-dependent refinements, and connect to smoothed-analysis ideas to guarantee well-conditioned behavior in realistic settings, potentially accelerating large-scale matrix computations and related sparse linear-system solvers.

Abstract

We study the problem of computing a rank-

approximation of a matrix using randomized block Krylov iteration. Prior work has shown that, for block size

, a

-factor approximation to the best rank-

approximation can be obtained after

matrix-vector products with the target matrix. On the other hand, when

is between

and

, the best known bound on the number of matrix-vector products scales with

, which could be as large as

. Nevertheless, in practice, the performance of block Krylov methods is often optimized by choosing a block size

. We resolve this theory-practice gap by proving that randomized block Krylov iteration produces a

-factor approximate rank-

approximation using

matrix-vector products for any block size

. Our analysis relies on new bounds for the minimum singular value of a random block Krylov matrix, which may be of independent interest. Similar bounds are central to recent breakthroughs on faster algorithms for sparse linear systems [Peng & Vempala, SODA 2021; Nie, STOC 2022].

Does block size matter in randomized block Krylov low-rank approximation?

TL;DR

Abstract

Does block size matter in randomized block Krylov low-rank approximation?

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (26)