Make the most of what you have: Resource-efficient randomized algorithms for matrix computations
Ethan N. Epperly
TL;DR
This work develops resource-efficient randomized algorithms for matrix computations, unifying PSD low-rank approximation, implicit-matrix attribute estimation, and stabilized randomized least-squares. It introduces randomly pivoted Cholesky to achieve near-optimal column Nyström approximations using limited entry access, and leverages the Gram correspondence to connect projection and Nyström frameworks. It also analyzes leave-one-out estimators for implicit matrices, demonstrates backward-stable randomized LS, and extends these ideas to kernel methods and Gaussian processes, including infinite-dimensional extensions via rejection sampling. The combination of theoretical guarantees and practical algorithms offers scalable, accurate tools for kernelized learning, large-scale linear algebra, and related domains, with demonstrated benefits in kernel interpolation, GP regression, and preconditioning for KRR. The work thus provides a cohesive toolkit for deploying randomized matrix methods in practical scientific computing contexts.
Abstract
In recent years, randomized algorithms have established themselves as fundamental tools in computational linear algebra, with applications in scientific computing, machine learning, and quantum information science. Many randomized matrix algorithms proceed by first collecting information about a matrix and then processing that data to perform some computational task. This thesis addresses the following question: How can one design algorithms that use this information as efficiently as possible, reliably achieving the greatest possible speed and accuracy for a limited data budget? The first part of this thesis focuses on low-rank approximation for positive-semidefinite matrices. Here, the goal is to compute an accurate approximation to a matrix after accessing as few entries of the matrix as possible. This part of the thesis explores the randomly pivoted Cholesky (RPCholesky) algorithm for this task, which achieves a level of speed and reliability greater than other methods for the same problem. The second part of this thesis considers the task of estimating attributes of an implicit matrix accessible only by matrix-vector products. This thesis describes the leave-one-out approach to developing matrix attribute estimation algorithms and develops optimized trace, diagonal, and row-norm estimation algorithms. The third part of this thesis considers randomized algorithms for overdetermined linear least squares problems. Randomized algorithms for linear-least squares problems are asymptotically faster than any known deterministic algorithm, but recent work has raised questions about the accuracy of these methods in floating point arithmetic. This thesis shows these issues are resolvable by developing fast randomized least-squares problem achieving backward stability, the gold-standard stability guarantee for a numerical algorithm.
