Algorithm-agnostic low-rank approximation of operator monotone matrix functions
David Persson, Raphael A. Meyer, Christopher Musco
TL;DR
Given a symmetric positive semidefinite matrix $\mathbf{A}$ and an operator-monotone function $f$ with $f(0)=0$, the paper shows that a near-optimal Nyström approximation $\widehat{\mathbf{A}}$ yields a near-optimal low-rank funNyström approximation $f(\widehat{\mathbf{A}})_{(k)}$ to $f(\mathbf{A})$ across nuclear, Frobenius, and operator norms, as well as eigenvalue estimates. The main advance is a general, construction-agnostic theory: if $\mathbf{A} \succeq \widehat{\mathbf{A}} \succeq 0$ and $\widehat{\mathbf{A}}_{(k)}$ is near-optimal, then $f(\widehat{\mathbf{A}})_{(k)}$ remains near-optimal for any continuous operator-monotone $f$, not just for subspace iteration. The results extend to many common Nyström-generation schemes (Krylov, sampling, and column-selection) by tying their guarantees to projection-based near-optimality, and they include explicit eigenvalue guarantees and a discussion of when the assumptions are necessary. Overall, the work enables reliable, inexpensive low-rank approximations of a broad class of matrix functions without computing $f(\mathbf{A})$ directly, with clear implications for high-dimensional computations and spectral estimation.
Abstract
Low-rank approximation of a matrix function, $f(A)$, is an important task in computational mathematics. Most methods require direct access to $f(A)$, which is often considerably more expensive than accessing $A$. Persson and Kressner (SIMAX 2023) avoid this issue for symmetric positive semidefinite matrices by proposing funNyström, which first constructs a Nyström approximation to $A$ using subspace iteration, and then uses the approximation to directly obtain a low-rank approximation for $f(A)$. They prove that the method yields a near-optimal approximation whenever $f$ is a continuous operator monotone function with $f(0) = 0$. We significantly generalize the results of Persson and Kressner beyond subspace iteration. We show that if $\widehat{A}$ is a near-optimal low-rank Nyström approximation to $A$ then $f(\widehat{A})$ is a near-optimal low-rank approximation to $f(A)$, independently of how $\widehat{A}$ is computed. Further, we show sufficient conditions for a basis $Q$ to produce a near-optimal Nyström approximation $\widehat{A} = AQ(Q^T AQ)^{\dagger} Q^T A$. We use these results to establish that many common low-rank approximation methods produce near-optimal Nyström approximations to $A$ and therefore to $f(A)$.
