Table of Contents
Fetching ...

Adaptive randomized pivoting for column subset selection, DEIM, and low-rank approximation

Alice Cortinovis, Daniel Kressner

TL;DR

This paper introduces Adaptive Randomized Pivoting (ARP), a simple, adaptive leverage-score sampling method for the Column Subset Selection Problem that guarantees, in expectation, an error bound of the form $\\mathbb{E}\\|A - \\Pi_J A\\|_F^2 \\le (r+1) \\|A - A V V^T\\|_F^2$, matching the optimal rank-$r$ approximation up to a constant factor. ARP leverages adaptive sampling based on the row norms of a provided row-space basis $V$, avoiding the need to form $A V$ and enabling efficient applications to DEIM, cross/skeleton approximation, and Nyström for SPSD matrices; a derandomized variant recovers Osinsky’s deterministic algorithm with favorable bounds. The method is implemented stably via Householder reflections, with computational cost $\\mathcal{O}(n r^2)$, and is supported by a thorough analysis using oblique projections and conditional expectations. Numerical experiments validate ARP and its variants across CSSP, DEIM, cross approximation, and Nyström, showing competitive accuracy and clear advantages in simplicity and efficiency. Derandomization further yields deterministic algorithms with strong error guarantees, particularly for SPSD Nyström, making ARP a versatile tool for large-scale matrix approximations.

Abstract

We derive a new adaptive leverage score sampling strategy for solving the Column Subset Selection Problem (CSSP). The resulting algorithm, called Adaptive Randomized Pivoting, can be viewed as a randomization of Osinsky's recently proposed deterministic algorithm for CSSP. It guarantees, in expectation, an approximation error that matches the optimal existence result in the Frobenius norm. Although the same guarantee can be achieved with volume sampling, our sampling strategy is much simpler and less expensive. To show the versatility of Adaptive Randomized Pivoting, we apply it to select indices in the Discrete Empirical Interpolation Method, in cross/skeleton approximation of general matrices, and in the Nystroem approximation of symmetric positive semi-definite matrices. In all these cases, the resulting randomized algorithms are new and they enjoy bounds on the expected error that match -- or improve -- the best known deterministic results. A derandomization of the algorithm for the Nystroem approximation results in a new deterministic algorithm with a rather favorable error bound.

Adaptive randomized pivoting for column subset selection, DEIM, and low-rank approximation

TL;DR

This paper introduces Adaptive Randomized Pivoting (ARP), a simple, adaptive leverage-score sampling method for the Column Subset Selection Problem that guarantees, in expectation, an error bound of the form , matching the optimal rank- approximation up to a constant factor. ARP leverages adaptive sampling based on the row norms of a provided row-space basis , avoiding the need to form and enabling efficient applications to DEIM, cross/skeleton approximation, and Nyström for SPSD matrices; a derandomized variant recovers Osinsky’s deterministic algorithm with favorable bounds. The method is implemented stably via Householder reflections, with computational cost , and is supported by a thorough analysis using oblique projections and conditional expectations. Numerical experiments validate ARP and its variants across CSSP, DEIM, cross approximation, and Nyström, showing competitive accuracy and clear advantages in simplicity and efficiency. Derandomization further yields deterministic algorithms with strong error guarantees, particularly for SPSD Nyström, making ARP a versatile tool for large-scale matrix approximations.

Abstract

We derive a new adaptive leverage score sampling strategy for solving the Column Subset Selection Problem (CSSP). The resulting algorithm, called Adaptive Randomized Pivoting, can be viewed as a randomization of Osinsky's recently proposed deterministic algorithm for CSSP. It guarantees, in expectation, an approximation error that matches the optimal existence result in the Frobenius norm. Although the same guarantee can be achieved with volume sampling, our sampling strategy is much simpler and less expensive. To show the versatility of Adaptive Randomized Pivoting, we apply it to select indices in the Discrete Empirical Interpolation Method, in cross/skeleton approximation of general matrices, and in the Nystroem approximation of symmetric positive semi-definite matrices. In all these cases, the resulting randomized algorithms are new and they enjoy bounds on the expected error that match -- or improve -- the best known deterministic results. A derandomization of the algorithm for the Nystroem approximation results in a new deterministic algorithm with a rather favorable error bound.

Paper Structure

This paper contains 20 sections, 9 theorems, 75 equations, 4 figures, 5 algorithms.

Key Result

Lemma 2.1

\newlabellem:propV0 For the matrices generated by Algorithm alg:ARPnew we have:

Figures (4)

  • Figure 1: Illustration of Algorithm \ref{['alg:ARP']} applied to a matrix $V \in \mathbb{R}^{8 \times 3}$.
  • Figure 1: Comparison of different CSSP strategies; the error is $\|A - A(:,J)A(:,J)^\dagger A\|_F / \|A\|_F$.
  • Figure 2: Numerical results for DEIM (left) and cross approximation (right).
  • Figure 3: Selection of column indices for SPSD matrices: smile (left) and spiral (right); see Section \ref{['sec:example_SPSD']} for details. The figure in the middle illustrates the choice of points made by (one instance of) ARP (red $\times$) and Algorithm \ref{['alg:derandomizedSPSD']} (blue $+$) with rank $30$.

Theorems & Definitions (19)

  • Lemma 2.1
  • Proof 1
  • Remark 2.2
  • Lemma 2.3
  • Proof 2
  • Theorem 2.4
  • Proof 3
  • Corollary 2.5
  • Corollary 2.6
  • Proof 4
  • ...and 9 more