Table of Contents
Fetching ...

Tight Sampling Bounds for Eigenvalue Approximation

William Swartworth, David P. Woodruff

Abstract

We consider the problem of estimating the spectrum of a symmetric bounded entry (not necessarily PSD) matrix via entrywise sampling. This problem was introduced by [Bhattacharjee, Dexter, Drineas, Musco, Ray '22], where it was shown that one can obtain an $εn$ additive approximation to all eigenvalues of $A$ by sampling a principal submatrix of dimension $\frac{\text{poly}(\log n)}{ε^3}$. We improve their analysis by showing that it suffices to sample a principal submatrix of dimension $\tilde{O}(\frac{1}{ε^2})$ (with no dependence on $n$). This matches known lower bounds and therefore resolves the sample complexity of this problem up to $\log\frac{1}ε$ factors. Using similar techniques, we give a tight $\tilde{O}(\frac{1}{ε^2})$ bound for obtaining an additive $ε\|A\|_F$ approximation to the spectrum of $A$ via squared row-norm sampling, improving on the previous best $\tilde{O}(\frac{1}{ε^{8}})$ bound. We also address the problem of approximating the top eigenvector for a bounded entry, PSD matrix $A.$ In particular, we show that sampling $O(\frac{1}ε)$ columns of $A$ suffices to produce a unit vector $u$ with $u^T A u \geq λ_1(A) - εn$. This matches what one could achieve via the sampling bound of [Musco, Musco'17] for the special case of approximating the top eigenvector, but does not require adaptivity. As additional applications, we observe that our sampling results can be used to design a faster eigenvalue estimation sketch for dense matrices resolving a question of [Swartworth, Woodruff'23], and can also be combined with [Musco, Musco'17] to achieve $O(1/ε^3)$ (adaptive) sample complexity for approximating the spectrum of a bounded entry PSD matrix to $εn$ additive error.

Tight Sampling Bounds for Eigenvalue Approximation

Abstract

We consider the problem of estimating the spectrum of a symmetric bounded entry (not necessarily PSD) matrix via entrywise sampling. This problem was introduced by [Bhattacharjee, Dexter, Drineas, Musco, Ray '22], where it was shown that one can obtain an additive approximation to all eigenvalues of by sampling a principal submatrix of dimension . We improve their analysis by showing that it suffices to sample a principal submatrix of dimension (with no dependence on ). This matches known lower bounds and therefore resolves the sample complexity of this problem up to factors. Using similar techniques, we give a tight bound for obtaining an additive approximation to the spectrum of via squared row-norm sampling, improving on the previous best bound. We also address the problem of approximating the top eigenvector for a bounded entry, PSD matrix In particular, we show that sampling columns of suffices to produce a unit vector with . This matches what one could achieve via the sampling bound of [Musco, Musco'17] for the special case of approximating the top eigenvector, but does not require adaptivity. As additional applications, we observe that our sampling results can be used to design a faster eigenvalue estimation sketch for dense matrices resolving a question of [Swartworth, Woodruff'23], and can also be combined with [Musco, Musco'17] to achieve (adaptive) sample complexity for approximating the spectrum of a bounded entry PSD matrix to additive error.

Paper Structure

This paper contains 39 sections, 35 theorems, 114 equations, 1 table, 4 algorithms.

Key Result

Theorem 2.2

Let $A$ be a symmetric matrix with all entries bounded by $1$. Then Algorithm alg:uniform_sampling with $s\geq c \frac{1}{\epsilon^2}\log^2\frac{1}{\epsilon}$ (where $c$ is an absolute constant) outputs an additive $\epsilon n$ approximation to the spectrum of $A$, with at least $2/3$ probability.

Theorems & Definitions (67)

  • Definition 2.1
  • Theorem 2.2
  • Theorem 2.3
  • Theorem 2.4
  • Definition 3.1
  • Definition 3.2
  • Lemma 3.3
  • proof
  • Theorem 3.4
  • Lemma 3.5
  • ...and 57 more