Robust Blockwise Random Pivoting: Fast and Accurate Adaptive Interpolative Decomposition
Yijun Dong, Chao Chen, Per-Gunnar Martinsson, Katherine Pearce
TL;DR
This work tackles the challenge of constructing fast, accurate, and robust interpolative decompositions (ID) by uniting adaptiveness and randomness through a new method called Robust Blockwise Random Pivoting (RBRP). RBRP introduces robust blockwise filtering to achieve hardware-efficient, rank-adaptive, and ID-revealing skeleton selection, addressing adversarial inputs that degrade plain blockwise methods. The authors also develop interpolation-matrix construction techniques, including ID-revealing approaches and oversampled sketchy ID (OSID) to stabilize and accelerate W, with OSID offering gamma-ID-revealing guarantees under randomized embeddings. Extensive experiments across synthetic adversarial data and real-world datasets demonstrate that RBRP delivers competitive skeleton quality and interpolation accuracy, while maintaining practical runtimes on CPU and GPU architectures. Overall, the paper provides a cohesive framework for fast, accurate, and robust ID that scales well and resists adversarial inputs, with direct implications for data selection, PDE solvers, and model compression tasks.
Abstract
The interpolative decomposition (ID) aims to construct a low-rank approximation formed by a basis consisting of row/column skeletons in the original matrix and a corresponding interpolation matrix. This work explores fast and accurate ID algorithms from comprehensive perspectives for empirical performance, including accuracy in both skeleton selection and interpolation matrix construction, efficiency in terms of asymptotic complexity and hardware efficiency, as well as rank adaptiveness. While many algorithms have been developed to optimize some of these aspects, practical ID algorithms proficient in all aspects remain absent. To fill in the gap, we introduce robust blockwise random pivoting (RBRP) that is asymptotically fast, hardware-efficient, and rank-adaptive, providing accurate skeletons and interpolation matrices comparable to the best existing ID algorithms in practice. Through extensive numerical experiments on various synthetic and natural datasets, we demonstrate the appealing empirical performance of RBRP from the aforementioned perspectives, as well as the robustness of RBRP to adversarial inputs.
