Robust Blockwise Random Pivoting: Fast and Accurate Adaptive Interpolative Decomposition

Yijun Dong; Chao Chen; Per-Gunnar Martinsson; Katherine Pearce

Robust Blockwise Random Pivoting: Fast and Accurate Adaptive Interpolative Decomposition

Yijun Dong, Chao Chen, Per-Gunnar Martinsson, Katherine Pearce

TL;DR

This work tackles the challenge of constructing fast, accurate, and robust interpolative decompositions (ID) by uniting adaptiveness and randomness through a new method called Robust Blockwise Random Pivoting (RBRP). RBRP introduces robust blockwise filtering to achieve hardware-efficient, rank-adaptive, and ID-revealing skeleton selection, addressing adversarial inputs that degrade plain blockwise methods. The authors also develop interpolation-matrix construction techniques, including ID-revealing approaches and oversampled sketchy ID (OSID) to stabilize and accelerate W, with OSID offering gamma-ID-revealing guarantees under randomized embeddings. Extensive experiments across synthetic adversarial data and real-world datasets demonstrate that RBRP delivers competitive skeleton quality and interpolation accuracy, while maintaining practical runtimes on CPU and GPU architectures. Overall, the paper provides a cohesive framework for fast, accurate, and robust ID that scales well and resists adversarial inputs, with direct implications for data selection, PDE solvers, and model compression tasks.

Abstract

The interpolative decomposition (ID) aims to construct a low-rank approximation formed by a basis consisting of row/column skeletons in the original matrix and a corresponding interpolation matrix. This work explores fast and accurate ID algorithms from comprehensive perspectives for empirical performance, including accuracy in both skeleton selection and interpolation matrix construction, efficiency in terms of asymptotic complexity and hardware efficiency, as well as rank adaptiveness. While many algorithms have been developed to optimize some of these aspects, practical ID algorithms proficient in all aspects remain absent. To fill in the gap, we introduce robust blockwise random pivoting (RBRP) that is asymptotically fast, hardware-efficient, and rank-adaptive, providing accurate skeletons and interpolation matrices comparable to the best existing ID algorithms in practice. Through extensive numerical experiments on various synthetic and natural datasets, we demonstrate the appealing empirical performance of RBRP from the aforementioned perspectives, as well as the robustness of RBRP to adversarial inputs.

Robust Blockwise Random Pivoting: Fast and Accurate Adaptive Interpolative Decomposition

TL;DR

Abstract

Paper Structure (18 sections, 3 theorems, 17 equations, 7 figures, 3 tables, 4 algorithms)

This paper contains 18 sections, 3 theorems, 17 equations, 7 figures, 3 tables, 4 algorithms.

Introduction
What are Fast and Accurate ID Algorithms?
How to Combine Adaptiveness and Randomness for ID?
Notations and Roadmap
Adaptiveness vs. Randomness
Adaptiveness via Greedy Squared-norm Pivoting
Randomness via Squared-norm Sampling
Combining Adaptiveness and Randomness
Random Pivoting
Sketchy Pivoting
Robust Blockwise Random Pivoting
Interpolation Matrix Construction
ID-revealing Algorithms
Oversampled Sketchy ID for Sketchy Pivoting
Experiments
...and 3 more sections

Key Result

Proposition 3.1

\newlabelprop:sas_skeleton_complexity0 For any $\epsilon > 0$ and $r \in [n]$, the skeleton subset $S$ selected by algo:pqr_id provides $\mathbb{E}\left[\mathcal{E}_{\mathbf{X}}\left(S\right)\right] \leqslant \left(1 + \epsilon\right) \left\|\mathbf{X} - {\langle\mathbf{X}\rangle}_{r}\right\|_F^2$

Figures (7)

Figure 1: The relative skeletonization error $\mathcal{E}_{\mathbf{X}}\left(S\right)/\|\mathbf{X}\|_F^2$ and skeleton selection time (excluding interpolation matrix construction) of algorithms in \ref{['tab:abbrev_skeleton_selection']} on the adversarial GMM matrix described in \ref{['subsec:data_matrices']} constructed according to \ref{['ex:gmm_pitfall_plain_bas']}. For the sketchy pivoting methods, we only show the results of SkLUPP, as SkCPQR is known to have similar skeleton complexities as SkLUPP but with higher runtimes dong2021simpler.
Figure 1: Relative interpolation error and total runtime of ID algorithms on the GMM adversarial input (\ref{['ex:gmm_pitfall_plain_bas']}). Recall from \ref{['fig:id_gmm_rpgp']} that in lack of adaptiveness, methods like SqNorm and BRP tend to suffer from suboptimal skeleton complexities on GMM.
Figure 2: Relative interpolation error and runtime of ID algorithms on MNIST.
Figure 3: Relative interpolation error and runtime of ID algorithms on CIFAR-10.
Figure 4: Relative interpolation error and runtime of ID algorithms on Gaussian-exp.
...and 2 more figures

Theorems & Definitions (19)

Remark 2.1: Greedy squared-norm pivoting is vulnerable but empirically successful
Remark 2.2: CPQR is inherently sequential but rank-adaptive
Example 2.1: Adversarial input for squared-norm sampling
Proposition 3.1: Sequential random pivoting chen2022randomly
Remark 3.1: Inefficiency of sequential updates
Remark 3.2: Sketchy pivoting v.s. random pivoting
Example 4.1: Pitfall of plain blockwise random pivoting
Remark 4.1: Robust blockwise filtering
Remark 4.2: Blockwise greedy pivoting
Conjecture 4.1: Skeleton complexity of RBRP
...and 9 more

Robust Blockwise Random Pivoting: Fast and Accurate Adaptive Interpolative Decomposition

TL;DR

Abstract

Robust Blockwise Random Pivoting: Fast and Accurate Adaptive Interpolative Decomposition

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (19)