Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

Yi Li; Honghao Lin; David P. Woodruff

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

Yi Li, Honghao Lin, David P. Woodruff

TL;DR

The paper addresses fast, memory-efficient residual error estimation for matrix and vector norms using oblivious bilinear sketches. It proves a tight $Θ(k^2/ε^4)$ bound on the sketch size for the matrix residual problem and provides matching lower bounds, while enabling sparse, fast-to-update constructions via PCP-based methods. It also extends the framework to $F_p$ residuals with $p>2$ by giving near-optimal space for sparse recovery, including a matching lower bound, and demonstrates practical gains through experiments that show substantial speedups over prior approaches. These results yield a practical toolkit for deciding when to invest in expensive low-rank or sparse recovery computations in large-scale or streaming settings. Overall, the work advances the theory and practice of residual-error-aware sketching for both matrix and vector norms with strong theoretical guarantees and empirical validation.

Abstract

We study the problem of residual error estimation for matrix and vector norms using a linear sketch. Such estimates can be used, for example, to quickly assess how useful a more expensive low-rank approximation computation will be. The matrix case concerns the Frobenius norm and the task is to approximate the $k$-residual $\|A - A_k\|_F$ of the input matrix $A$ within a $(1+ε)$-factor, where $A_k$ is the optimal rank-$k$ approximation. We provide a tight bound of $Θ(k^2/ε^4)$ on the size of bilinear sketches, which have the form of a matrix product $SAT$. This improves the previous $O(k^2/ε^6)$ upper bound in (Andoni et al. SODA 2013) and gives the first non-trivial lower bound, to the best of our knowledge. In our algorithm, our sketching matrices $S$ and $T$ can both be sparse matrices, allowing for a very fast update time. We demonstrate that this gives a substantial advantage empirically, for roughly the same sketch size and accuracy as in previous work. For the vector case, we consider the $\ell_p$-norm for $p>2$, where the task is to approximate the $k$-residual $\|x - x_k\|_p$ up to a constant factor, where $x_k$ is the optimal $k$-sparse approximation to $x$. Such vector norms are frequently studied in the data stream literature and are useful for finding frequent items or so-called heavy hitters. We establish an upper bound of $O(k^{2/p}n^{1-2/p}\operatorname{poly}(\log n))$ for constant $ε$ on the dimension of a linear sketch for this problem. Our algorithm can be extended to the $\ell_p$ sparse recovery problem with the same sketching dimension, which seems to be the first such bound for $p > 2$. We also show an $Ω(k^{2/p}n^{1-2/p})$ lower bound for the sparse recovery problem, which is tight up to a $\mathrm{poly}(\log n)$ factor.

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

TL;DR

The paper addresses fast, memory-efficient residual error estimation for matrix and vector norms using oblivious bilinear sketches. It proves a tight

bound on the sketch size for the matrix residual problem and provides matching lower bounds, while enabling sparse, fast-to-update constructions via PCP-based methods. It also extends the framework to

residuals with

by giving near-optimal space for sparse recovery, including a matching lower bound, and demonstrates practical gains through experiments that show substantial speedups over prior approaches. These results yield a practical toolkit for deciding when to invest in expensive low-rank or sparse recovery computations in large-scale or streaming settings. Overall, the work advances the theory and practice of residual-error-aware sketching for both matrix and vector norms with strong theoretical guarantees and empirical validation.

Abstract

-residual

of the input matrix

within a

-factor, where

is the optimal rank-

approximation. We provide a tight bound of

on the size of bilinear sketches, which have the form of a matrix product

. This improves the previous

upper bound in (Andoni et al. SODA 2013) and gives the first non-trivial lower bound, to the best of our knowledge. In our algorithm, our sketching matrices

and

can both be sparse matrices, allowing for a very fast update time. We demonstrate that this gives a substantial advantage empirically, for roughly the same sketch size and accuracy as in previous work. For the vector case, we consider the

-norm for

, where the task is to approximate the

-residual

up to a constant factor, where

is the optimal

-sparse approximation to

. Such vector norms are frequently studied in the data stream literature and are useful for finding frequent items or so-called heavy hitters. We establish an upper bound of

for constant

on the dimension of a linear sketch for this problem. Our algorithm can be extended to the

sparse recovery problem with the same sketching dimension, which seems to be the first such bound for

. We also show an

lower bound for the sparse recovery problem, which is tight up to a

factor.

Paper Structure (9 sections, 18 theorems, 19 equations, 1 table, 1 algorithm)

This paper contains 9 sections, 18 theorems, 19 equations, 1 table, 1 algorithm.

Introduction
Our Contributions
Preliminaries
Low-Rank Residual Error Estimation
Lower Bound.
Upper Bound.
$F_p$ Residual Error Estimation
$\ell_p$ Sparse Recovery.
Experiments

Key Result

Lemma 2.1

Let $G$ be an $m\times n$ ($m\geq n$) Gaussian random matrix of i.i.d. $N(0,1)$ entries. It holds with probability at least $1-\exp(-ct^2)$ that $\sqrt{m} - C\sqrt{n} - t \leq \sigma_{\min}(G)\leq \sigma_{\max}(G)\leq \sqrt{m} + C\sqrt{n} + t$, where $C,c>0$ are absolute constants.

Theorems & Definitions (27)

Lemma 2.1: Extreme singular values of Gaussian random matrices vershynin
Lemma 2.2: Weyl's inequality HJ12
Theorem 3.1
Lemma 3.2
proof
Theorem 3.3
proof
proof : Proof of Theorem \ref{['thm:lower_bound']}
Definition 3.4: CEM+15
Lemma 3.5: CEM+15MM20
...and 17 more

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

TL;DR

Abstract

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (27)