Column and row subset selection using nuclear scores: algorithms and theory for Nyström approximation, CUR decomposition, and graph Laplacian reduction
Mark Fornace, Michael Lindsey
TL;DR
This work develops unified, structure-preserving column/row subset selection methods for Nyström approximation, CUR decomposition, and inverse graph-Laplacian rank reduction via nuclear-score maximization. It introduces deterministic and fully matrix-free algorithms, the latter relying on randomized diagonal estimation with concentration guarantees to bound scores while avoiding explicit formation of large kernel or Laplacian-derived operators. The authors provide extensive error analyses, connecting greedy nuclear maximization to DPP expectations and deriving submodularity-based bounds for the Laplacian case, with strong empirical results across kernel, CUR, and Laplacian tasks. The framework achieves competitive or superior accuracy with favorable computational scaling, supported by open-source implementations and demonstrations on large-scale problems.
Abstract
Column selection is an essential tool for structure-preserving low-rank approximation, with wide-ranging applications across many fields, such as data science, machine learning, and theoretical chemistry. In this work, we develop unified methodologies for fast, efficient, and theoretically guaranteed column selection. First we derive and implement a sparsity-exploiting deterministic algorithm applicable to tasks including kernel approximation and CUR decomposition. Next, we develop a matrix-free formalism relying on a randomization scheme satisfying guaranteed concentration bounds, applying this construction both to CUR decomposition and to the approximation of matrix functions of graph Laplacians. Importantly, the randomization is only relevant for the computation of the scores that we use for column selection, not the selection itself given these scores. For both deterministic and matrix-free algorithms, we bound the performance favorably relative to the expected performance of determinantal point process (DPP) sampling and, in select scenarios, that of exactly optimal subset selection. The general case requires new analysis of the DPP expectation. Finally, we demonstrate strong real-world performance of our algorithms on a diverse set of example approximation tasks.
