Table of Contents
Fetching ...

Lower Bounds on Adaptive Sensing for Matrix Recovery

Praneeth Kacham, David P Woodruff

TL;DR

The paper establishes information-theoretic lower bounds for adaptive sensing in low-rank matrix recovery, showing that even with multiple adaptive rounds, the total measurement budget cannot be reduced below near-reading the entire matrix in many regimes. By embedding a rank-$r$ Gaussian spike into a Gaussian background and leveraging Bayes risk bounds and Gaussian rotational invariance, it proves that the number of rounds $t$ satisfies $t = ilde{ ext{Ω}}ig( rac{ ext{log}(n^2/k)}{ ext{log} ext{log} n}ig)$ for $k$ measurements per round, implying $ ext{Ω}(n^2)$ total measurements when rounds are sublogarithmic. The results extend to tensor recovery and yield a broad rounds-vs-measurements toolbox for numerous numerical linear algebra tasks (spectral/Frobenius/Schatten/Ky-Fan; reduced rank regression; singular vector approximation) under general linear measurements with Gaussian noise. Conceptually, the work shows that adaptivity provides limited leverage over prior non-adaptive bounds in this sensing model, and it connects to existing upper-bound algorithms by demonstrating near-optimal round complexities in many well-conditioned scenarios.

Abstract

We study lower bounds on adaptive sensing algorithms for recovering low rank matrices using linear measurements. Given an $n \times n$ matrix $A$, a general linear measurement $S(A)$, for an $n \times n$ matrix $S$, is just the inner product of $S$ and $A$, each treated as $n^2$-dimensional vectors. By performing as few linear measurements as possible on a rank-$r$ matrix $A$, we hope to construct a matrix $\hat{A}$ that satisfies $\|A - \hat{A}\|_F^2 \le c\|A\|_F^2$, for a small constant $c$. It is commonly assumed that when measuring $A$ with $S$, the response is corrupted with an independent Gaussian random variable of mean $0$ and variance $σ^2$. Candés and Plan study non-adaptive algorithms for low rank matrix recovery using random linear measurements. At a certain noise level, it is known that their non-adaptive algorithms need to perform $Ω(n^2)$ measurements, which amounts to reading the entire matrix. An important question is whether adaptivity helps in decreasing the overall number of measurements. We show that any adaptive algorithm that uses $k$ linear measurements in each round and outputs an approximation to the underlying matrix with probability $\ge 9/10$ must run for $t = Ω(\log(n^2/k)/\log\log n)$ rounds showing that any adaptive algorithm which uses $n^{2-β}$ linear measurements in each round must run for $Ω(\log n/\log\log n)$ rounds to compute a reconstruction with probability $\ge 9/10$. Hence any adaptive algorithm that has $o(\log n/\log\log n)$ rounds must use an overall $Ω(n^2)$ linear measurements. Our techniques also readily extend to obtain lower bounds on adaptive algorithms for tensor recovery and obtain measurement-vs-rounds trade-off for many sensing problems in numerical linear algebra, such as spectral norm low rank approximation, Frobenius norm low rank approximation, singular vector approximation, and more.

Lower Bounds on Adaptive Sensing for Matrix Recovery

TL;DR

The paper establishes information-theoretic lower bounds for adaptive sensing in low-rank matrix recovery, showing that even with multiple adaptive rounds, the total measurement budget cannot be reduced below near-reading the entire matrix in many regimes. By embedding a rank- Gaussian spike into a Gaussian background and leveraging Bayes risk bounds and Gaussian rotational invariance, it proves that the number of rounds satisfies for measurements per round, implying total measurements when rounds are sublogarithmic. The results extend to tensor recovery and yield a broad rounds-vs-measurements toolbox for numerous numerical linear algebra tasks (spectral/Frobenius/Schatten/Ky-Fan; reduced rank regression; singular vector approximation) under general linear measurements with Gaussian noise. Conceptually, the work shows that adaptivity provides limited leverage over prior non-adaptive bounds in this sensing model, and it connects to existing upper-bound algorithms by demonstrating near-optimal round complexities in many well-conditioned scenarios.

Abstract

We study lower bounds on adaptive sensing algorithms for recovering low rank matrices using linear measurements. Given an matrix , a general linear measurement , for an matrix , is just the inner product of and , each treated as -dimensional vectors. By performing as few linear measurements as possible on a rank- matrix , we hope to construct a matrix that satisfies , for a small constant . It is commonly assumed that when measuring with , the response is corrupted with an independent Gaussian random variable of mean and variance . Candés and Plan study non-adaptive algorithms for low rank matrix recovery using random linear measurements. At a certain noise level, it is known that their non-adaptive algorithms need to perform measurements, which amounts to reading the entire matrix. An important question is whether adaptivity helps in decreasing the overall number of measurements. We show that any adaptive algorithm that uses linear measurements in each round and outputs an approximation to the underlying matrix with probability must run for rounds showing that any adaptive algorithm which uses linear measurements in each round must run for rounds to compute a reconstruction with probability . Hence any adaptive algorithm that has rounds must use an overall linear measurements. Our techniques also readily extend to obtain lower bounds on adaptive algorithms for tensor recovery and obtain measurement-vs-rounds trade-off for many sensing problems in numerical linear algebra, such as spectral norm low rank approximation, Frobenius norm low rank approximation, singular vector approximation, and more.
Paper Structure (31 sections, 24 theorems, 117 equations, 1 table)

This paper contains 31 sections, 24 theorems, 117 equations, 1 table.

Key Result

Theorem 1.1

There exists a constant $c$ such that any randomized algorithm which makes $k \ge nr$ noisy linear measurements of an arbitrary rank-$r$ matrix $A$ with $\|A\|_{\mathsf{F}}^2 = \Theta(nr)$ in each of $t$ rounds, and outputs an estimate $\hat{A}$ satisfying $\|A - \hat{A}\|_{\mathsf{F}}^2 \le c\|A\|_

Theorems & Definitions (42)

  • Theorem 1.1
  • Theorem 2.1
  • Theorem 2.2
  • Lemma 2.3
  • proof
  • Lemma 2.4: Chain Rule
  • Lemma 2.5: KL-divergence between Gaussians, folklore
  • Lemma 3.1
  • Lemma 3.2
  • proof
  • ...and 32 more