Lower Bounds on Adaptive Sensing for Matrix Recovery

Praneeth Kacham; David P Woodruff

Lower Bounds on Adaptive Sensing for Matrix Recovery

Praneeth Kacham, David P Woodruff

TL;DR

The paper establishes information-theoretic lower bounds for adaptive sensing in low-rank matrix recovery, showing that even with multiple adaptive rounds, the total measurement budget cannot be reduced below near-reading the entire matrix in many regimes. By embedding a rank-$r$ Gaussian spike into a Gaussian background and leveraging Bayes risk bounds and Gaussian rotational invariance, it proves that the number of rounds $t$ satisfies $t = ilde{ ext{Ω}}ig(rac{ ext{log}(n^2/k)}{ ext{log} ext{log} n}ig)$ for $k$ measurements per round, implying $ ext{Ω}(n^2)$ total measurements when rounds are sublogarithmic. The results extend to tensor recovery and yield a broad rounds-vs-measurements toolbox for numerous numerical linear algebra tasks (spectral/Frobenius/Schatten/Ky-Fan; reduced rank regression; singular vector approximation) under general linear measurements with Gaussian noise. Conceptually, the work shows that adaptivity provides limited leverage over prior non-adaptive bounds in this sensing model, and it connects to existing upper-bound algorithms by demonstrating near-optimal round complexities in many well-conditioned scenarios.

Abstract

We study lower bounds on adaptive sensing algorithms for recovering low rank matrices using linear measurements. Given an $n \times n$ matrix $A$, a general linear measurement $S(A)$, for an $n \times n$ matrix $S$, is just the inner product of $S$ and $A$, each treated as $n^2$-dimensional vectors. By performing as few linear measurements as possible on a rank-$r$ matrix $A$, we hope to construct a matrix $\hat{A}$ that satisfies $\|A - \hat{A}\|_F^2 \le c\|A\|_F^2$, for a small constant $c$. It is commonly assumed that when measuring $A$ with $S$, the response is corrupted with an independent Gaussian random variable of mean $0$ and variance $σ^2$. Candés and Plan study non-adaptive algorithms for low rank matrix recovery using random linear measurements. At a certain noise level, it is known that their non-adaptive algorithms need to perform $Ω(n^2)$ measurements, which amounts to reading the entire matrix. An important question is whether adaptivity helps in decreasing the overall number of measurements. We show that any adaptive algorithm that uses $k$ linear measurements in each round and outputs an approximation to the underlying matrix with probability $\ge 9/10$ must run for $t = Ω(\log(n^2/k)/\log\log n)$ rounds showing that any adaptive algorithm which uses $n^{2-β}$ linear measurements in each round must run for $Ω(\log n/\log\log n)$ rounds to compute a reconstruction with probability $\ge 9/10$. Hence any adaptive algorithm that has $o(\log n/\log\log n)$ rounds must use an overall $Ω(n^2)$ linear measurements. Our techniques also readily extend to obtain lower bounds on adaptive algorithms for tensor recovery and obtain measurement-vs-rounds trade-off for many sensing problems in numerical linear algebra, such as spectral norm low rank approximation, Frobenius norm low rank approximation, singular vector approximation, and more.

Lower Bounds on Adaptive Sensing for Matrix Recovery

TL;DR

Gaussian spike into a Gaussian background and leveraging Bayes risk bounds and Gaussian rotational invariance, it proves that the number of rounds

satisfies

for

measurements per round, implying

total measurements when rounds are sublogarithmic. The results extend to tensor recovery and yield a broad rounds-vs-measurements toolbox for numerous numerical linear algebra tasks (spectral/Frobenius/Schatten/Ky-Fan; reduced rank regression; singular vector approximation) under general linear measurements with Gaussian noise. Conceptually, the work shows that adaptivity provides limited leverage over prior non-adaptive bounds in this sensing model, and it connects to existing upper-bound algorithms by demonstrating near-optimal round complexities in many well-conditioned scenarios.

Abstract

We study lower bounds on adaptive sensing algorithms for recovering low rank matrices using linear measurements. Given an

matrix

, a general linear measurement

, for an

matrix

, is just the inner product of

and

, each treated as

-dimensional vectors. By performing as few linear measurements as possible on a rank-

matrix

, we hope to construct a matrix

that satisfies

, for a small constant

. It is commonly assumed that when measuring

with

, the response is corrupted with an independent Gaussian random variable of mean

and variance

. Candés and Plan study non-adaptive algorithms for low rank matrix recovery using random linear measurements. At a certain noise level, it is known that their non-adaptive algorithms need to perform

measurements, which amounts to reading the entire matrix. An important question is whether adaptivity helps in decreasing the overall number of measurements. We show that any adaptive algorithm that uses

linear measurements in each round and outputs an approximation to the underlying matrix with probability

must run for

rounds showing that any adaptive algorithm which uses

linear measurements in each round must run for

rounds to compute a reconstruction with probability

. Hence any adaptive algorithm that has

rounds must use an overall

linear measurements. Our techniques also readily extend to obtain lower bounds on adaptive algorithms for tensor recovery and obtain measurement-vs-rounds trade-off for many sensing problems in numerical linear algebra, such as spectral norm low rank approximation, Frobenius norm low rank approximation, singular vector approximation, and more.

Paper Structure (31 sections, 24 theorems, 117 equations, 1 table)

This paper contains 31 sections, 24 theorems, 117 equations, 1 table.

Introduction
Our Results
Dependence on noise
Tensor Recovery
Numerical Linear Algebra
Notation
Our Techniques
Related Work
Notation and Preliminaries
Bayes Risk Lower Bounds
$f$-divergence
KL Divergence
Properties of Gaussian random matrices and vectors
Rotational Invariance
Number of Linear Measurements vs Number of Adaptive Rounds
...and 16 more sections

Key Result

Theorem 1.1

There exists a constant $c$ such that any randomized algorithm which makes $k \ge nr$ noisy linear measurements of an arbitrary rank-$r$ matrix $A$ with $\|A\|_{\mathsf{F}}^2 = \Theta(nr)$ in each of $t$ rounds, and outputs an estimate $\hat{A}$ satisfying $\|A - \hat{A}\|_{\mathsf{F}}^2 \le c\|A\|_

Theorems & Definitions (42)

Theorem 1.1
Theorem 2.1
Theorem 2.2
Lemma 2.3
proof
Lemma 2.4: Chain Rule
Lemma 2.5: KL-divergence between Gaussians, folklore
Lemma 3.1
Lemma 3.2
proof
...and 32 more

Lower Bounds on Adaptive Sensing for Matrix Recovery

TL;DR

Abstract

Lower Bounds on Adaptive Sensing for Matrix Recovery

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (42)