Adaptive randomized pivoting and volume sampling

Ethan N. Epperly

Adaptive randomized pivoting and volume sampling

Ethan N. Epperly

TL;DR

This work analyzes adaptive randomized pivoting (ARP) for column subset selection and reveals a fundamental link to volume sampling and projection DPPs. By showing ARP's sample is distributed as $\operatorname{VS}_k(\boldsymbol{Q})$, it transfers active linear-regression results to ARP, yielding near-optimal Frobenius-error guarantees and a clean interpretation of ARP as an interpolative decomposition. The authors then introduce fast ARP implementations based on rejection sampling (RejectionRPQR) and an oversampled sketchy variant (SkARP) using OSID, along with block and stability enhancements. They also establish end-to-end guarantees with sparse embeddings and demonstrate substantial practical speedups and competitive accuracy across experiments, highlighting ARP-family methods as fast and accurate tools for row-subset selection in large-scale settings.

Abstract

Adaptive randomized pivoting (ARP) is a recently proposed and highly effective algorithm for column subset selection. This paper reinterprets the ARP algorithm by drawing connections to the volume sampling distribution and active learning algorithms for linear regression. As consequences, this paper presents new analysis for the ARP algorithm and faster implementations using rejection sampling.

Adaptive randomized pivoting and volume sampling

TL;DR

This work analyzes adaptive randomized pivoting (ARP) for column subset selection and reveals a fundamental link to volume sampling and projection DPPs. By showing ARP's sample is distributed as

, it transfers active linear-regression results to ARP, yielding near-optimal Frobenius-error guarantees and a clean interpretation of ARP as an interpolative decomposition. The authors then introduce fast ARP implementations based on rejection sampling (RejectionRPQR) and an oversampled sketchy variant (SkARP) using OSID, along with block and stability enhancements. They also establish end-to-end guarantees with sparse embeddings and demonstrate substantial practical speedups and competitive accuracy across experiments, highlighting ARP-family methods as fast and accurate tools for row-subset selection in large-scale settings.

Adaptive randomized pivoting and volume sampling

TL;DR

Abstract

Adaptive randomized pivoting and volume sampling

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (11)