Even Faster Kernel Matrix Linear Algebra via Density Estimation
Rikhav Shah, Sandeep Silwal, Haike Xu
TL;DR
This work develops subquadratic, KDE-based algorithms for core kernel-matrix tasks by accessing the kernel matrix only through fast KDE structures. The authors introduce a rapid non-negative MVP method with per-coordinate guarantees, and extend this to faster approximate matrix multiplication, top eigenvalue estimation via a refined noisy power method, and efficient computation of the kernel sum $s(K)$. They provide tighter upper bounds that improve over prior work while establishing conditional lower bounds (via SETH and OV reductions) that illuminate the limits of KDE-based approaches, especially for negative vectors or asymmetric kernel matrices. The results collectively advance the practical, theory-backed efficiency of kernel methods in high dimensions and large datasets, with clear implications for kernel alignment, spectral methods, and KDE-assisted linear algebra. The work also clarifies the trade-offs between accuracy, dimensionality, and sample complexity, pointing to open questions about the ultimate limits of subquadratic kernel computations.
Abstract
This paper studies the use of kernel density estimation (KDE) for linear algebraic tasks involving the kernel matrix of a collection of $n$ data points in $\mathbb R^d$. In particular, we improve upon existing algorithms for computing the following up to $(1+\varepsilon)$ relative error: matrix-vector products, matrix-matrix products, the spectral norm, and sum of all entries. The runtimes of our algorithms depend on the dimension $d$, the number of points $n$, and the target error $\varepsilon$. Importantly, the dependence on $n$ in each case is far lower when accessing the kernel matrix through KDE queries as opposed to reading individual entries. Our improvements over existing best algorithms (particularly those of Backurs, Indyk, Musco, and Wagner '21) for these tasks reduce the polynomial dependence on $\varepsilon$, and additionally decreases the dependence on $n$ in the case of computing the sum of all entries of the kernel matrix. We complement our upper bounds with several lower bounds for related problems, which provide (conditional) quadratic time hardness results and additionally hint at the limits of KDE based approaches for the problems we study.
