Table of Contents
Fetching ...

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

Yi Li, Honghao Lin, David P. Woodruff

TL;DR

The paper addresses fast, memory-efficient residual error estimation for matrix and vector norms using oblivious bilinear sketches. It proves a tight $Θ(k^2/ε^4)$ bound on the sketch size for the matrix residual problem and provides matching lower bounds, while enabling sparse, fast-to-update constructions via PCP-based methods. It also extends the framework to $F_p$ residuals with $p>2$ by giving near-optimal space for sparse recovery, including a matching lower bound, and demonstrates practical gains through experiments that show substantial speedups over prior approaches. These results yield a practical toolkit for deciding when to invest in expensive low-rank or sparse recovery computations in large-scale or streaming settings. Overall, the work advances the theory and practice of residual-error-aware sketching for both matrix and vector norms with strong theoretical guarantees and empirical validation.

Abstract

We study the problem of residual error estimation for matrix and vector norms using a linear sketch. Such estimates can be used, for example, to quickly assess how useful a more expensive low-rank approximation computation will be. The matrix case concerns the Frobenius norm and the task is to approximate the $k$-residual $\|A - A_k\|_F$ of the input matrix $A$ within a $(1+ε)$-factor, where $A_k$ is the optimal rank-$k$ approximation. We provide a tight bound of $Θ(k^2/ε^4)$ on the size of bilinear sketches, which have the form of a matrix product $SAT$. This improves the previous $O(k^2/ε^6)$ upper bound in (Andoni et al. SODA 2013) and gives the first non-trivial lower bound, to the best of our knowledge. In our algorithm, our sketching matrices $S$ and $T$ can both be sparse matrices, allowing for a very fast update time. We demonstrate that this gives a substantial advantage empirically, for roughly the same sketch size and accuracy as in previous work. For the vector case, we consider the $\ell_p$-norm for $p>2$, where the task is to approximate the $k$-residual $\|x - x_k\|_p$ up to a constant factor, where $x_k$ is the optimal $k$-sparse approximation to $x$. Such vector norms are frequently studied in the data stream literature and are useful for finding frequent items or so-called heavy hitters. We establish an upper bound of $O(k^{2/p}n^{1-2/p}\operatorname{poly}(\log n))$ for constant $ε$ on the dimension of a linear sketch for this problem. Our algorithm can be extended to the $\ell_p$ sparse recovery problem with the same sketching dimension, which seems to be the first such bound for $p > 2$. We also show an $Ω(k^{2/p}n^{1-2/p})$ lower bound for the sparse recovery problem, which is tight up to a $\mathrm{poly}(\log n)$ factor.

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

TL;DR

The paper addresses fast, memory-efficient residual error estimation for matrix and vector norms using oblivious bilinear sketches. It proves a tight bound on the sketch size for the matrix residual problem and provides matching lower bounds, while enabling sparse, fast-to-update constructions via PCP-based methods. It also extends the framework to residuals with by giving near-optimal space for sparse recovery, including a matching lower bound, and demonstrates practical gains through experiments that show substantial speedups over prior approaches. These results yield a practical toolkit for deciding when to invest in expensive low-rank or sparse recovery computations in large-scale or streaming settings. Overall, the work advances the theory and practice of residual-error-aware sketching for both matrix and vector norms with strong theoretical guarantees and empirical validation.

Abstract

We study the problem of residual error estimation for matrix and vector norms using a linear sketch. Such estimates can be used, for example, to quickly assess how useful a more expensive low-rank approximation computation will be. The matrix case concerns the Frobenius norm and the task is to approximate the -residual of the input matrix within a -factor, where is the optimal rank- approximation. We provide a tight bound of on the size of bilinear sketches, which have the form of a matrix product . This improves the previous upper bound in (Andoni et al. SODA 2013) and gives the first non-trivial lower bound, to the best of our knowledge. In our algorithm, our sketching matrices and can both be sparse matrices, allowing for a very fast update time. We demonstrate that this gives a substantial advantage empirically, for roughly the same sketch size and accuracy as in previous work. For the vector case, we consider the -norm for , where the task is to approximate the -residual up to a constant factor, where is the optimal -sparse approximation to . Such vector norms are frequently studied in the data stream literature and are useful for finding frequent items or so-called heavy hitters. We establish an upper bound of for constant on the dimension of a linear sketch for this problem. Our algorithm can be extended to the sparse recovery problem with the same sketching dimension, which seems to be the first such bound for . We also show an lower bound for the sparse recovery problem, which is tight up to a factor.
Paper Structure (9 sections, 18 theorems, 19 equations, 1 table, 1 algorithm)

This paper contains 9 sections, 18 theorems, 19 equations, 1 table, 1 algorithm.

Key Result

Lemma 2.1

Let $G$ be an $m\times n$ ($m\geq n$) Gaussian random matrix of i.i.d. $N(0,1)$ entries. It holds with probability at least $1-\exp(-ct^2)$ that $\sqrt{m} - C\sqrt{n} - t \leq \sigma_{\min}(G)\leq \sigma_{\max}(G)\leq \sqrt{m} + C\sqrt{n} + t$, where $C,c>0$ are absolute constants.

Theorems & Definitions (27)

  • Lemma 2.1: Extreme singular values of Gaussian random matrices vershynin
  • Lemma 2.2: Weyl's inequality HJ12
  • Theorem 3.1
  • Lemma 3.2
  • proof
  • Theorem 3.3
  • proof
  • proof : Proof of Theorem \ref{['thm:lower_bound']}
  • Definition 3.4: CEM+15
  • Lemma 3.5: CEM+15MM20
  • ...and 17 more