Adaptive randomized pivoting and volume sampling
Ethan N. Epperly
TL;DR
This work analyzes adaptive randomized pivoting (ARP) for column subset selection and reveals a fundamental link to volume sampling and projection DPPs. By showing ARP's sample is distributed as $\operatorname{VS}_k(\boldsymbol{Q})$, it transfers active linear-regression results to ARP, yielding near-optimal Frobenius-error guarantees and a clean interpretation of ARP as an interpolative decomposition. The authors then introduce fast ARP implementations based on rejection sampling (RejectionRPQR) and an oversampled sketchy variant (SkARP) using OSID, along with block and stability enhancements. They also establish end-to-end guarantees with sparse embeddings and demonstrate substantial practical speedups and competitive accuracy across experiments, highlighting ARP-family methods as fast and accurate tools for row-subset selection in large-scale settings.
Abstract
Adaptive randomized pivoting (ARP) is a recently proposed and highly effective algorithm for column subset selection. This paper reinterprets the ARP algorithm by drawing connections to the volume sampling distribution and active learning algorithms for linear regression. As consequences, this paper presents new analysis for the ARP algorithm and faster implementations using rejection sampling.
