Embrace rejection: Kernel matrix approximation by accelerated randomly pivoted Cholesky
Ethan N. Epperly, Joel A. Tropp, Robert J. Webber
TL;DR
This work tackles the challenge of efficiently constructing low-rank PSD (kernel) approximations under a submatrix access model. It introduces accelerated RP-Chol-esky, which uses block pivots and rejection sampling to mimic the simple RP-Chol-esky pivot distribution, achieving substantial speedups (up to tens of times) without sacrificing approximation quality. Theoretical guarantees are developed via a novel expected residual function and permutation averaging, yielding a main error bound that informs critical parameter choices. Empirical results on a large kernel-matrix testbed and molecular PES problems demonstrate significant practical acceleration and reliable performance, with extensions to accelerated randomly pivoted QR. Overall, the method enables scalable kernel-based preconditioning and low-rank approximations for very large data sets in scientific computing contexts.
Abstract
Randomly pivoted Cholesky (RPCholesky) is an algorithm for constructing a low-rank approximation of a positive-semidefinite matrix using a small number of columns. This paper develops an accelerated version of RPCholesky that employs block matrix computations and rejection sampling to efficiently simulate the execution of the original algorithm. For the task of approximating a kernel matrix, the accelerated algorithm can run over $40\times$ faster. The paper contains implementation details, theoretical guarantees, experiments on benchmark data sets, and an application to computational chemistry.
