Sparse recovery from quadratic equations, part II: hardness and incoherence

Augustin Cosse

Sparse recovery from quadratic equations, part II: hardness and incoherence

Augustin Cosse

TL;DR

The work analyzes sparse recovery from quadratic measurements, addressing the square-root bottleneck by showing that, when the incoherence parameter $\mu_0$ is favorable, recovery is possible from $m$ measurements with $m \gtrsim k\mu_0^{-2} \vee \mu_0^{-4}$ (up to log factors) via empirical loss minimization; this extends to phase retrieval with subexponential sensing. It provides a computationally practical initialization and convergence guarantees for SPF and a truncated gradient descent variant, delivering explicit nonasymptotic error bounds in the presence of noise. Beyond algorithmic recovery, the paper develops a topological hardness certificate via the Overlap Gap Property in the fully incoherent and under-sampled regime ($m=o(k^2)$), supported by first/second moment analyses and moderate-deviation tails of chi-squared sums. Collectively, the results illuminate both when efficient algorithms can succeed and when topological barriers emerge, clarifying the role of incoherence, overparametrization, and initialization in quadratic-sensing problems. The findings have implications for structured sparse recovery and phase retrieval tasks in high dimensions, guiding algorithm design and setting expectations for sample complexity in practical settings.

Abstract

We study the square root bottleneck in the recovery of sparse vectors from quadratic equations. It is acknowledged that a sparse vector $ \mathbf x_0\in \mathbb{R}^n$, $\| \mathbf x_0\|_0 = k$ can in theory be recovered from as few as $O(k)$ generic quadratic equations but no polynomial time algorithm is known for this task unless $m = Ω(k^2)$. This bottleneck was in fact shown in previous work to be essentially related to the initialization of descent algorithms. Starting such algorithms sufficiently close to the planted signal is known to imply convergence to this signal. In this paper, we show that as soon as $m\gtrsim μ_0^{-2}k \vee μ_0^{-4}$ (up to log factors) where $μ_0 = \| \mathbf x_0\|_\infty/\| \mathbf x_0\|_2$, it is possible to recover a $k$-sparse vector $ \mathbf x_0\in \mathbb{R}^n$ from $m$ quadratic equations of the form $\langle \mathbf A_i, \mathbf x \mathbf x^\intercal\rangle = \langle \mathbf A_i, \mathbf x_0 \mathbf x_0^\intercal\rangle + \varepsilon_i $ by minimizing the classical empirical loss. The proof idea carries over to the phase retrieval setting for which it provides an original initialization that matches the current optimal sample complexity (see e.g. [Cai 2023]). In the maximally incoherent regime $μ_0^{-2}=k$, and for $m=o(k^2)$ we provide evidence for topological hardness by showing that a property known as the Overlap Gap Property (OGP), which originated in spin glass theory and is conjectured to be indicative of algorithmic intractability when optimizing over random structures, holds for a particular level of overparametrization. The key ingredient of the proof is a lower bound on the tail of chi-squared random variables which follows from the theory of moderate deviations.

Sparse recovery from quadratic equations, part II: hardness and incoherence

TL;DR

The work analyzes sparse recovery from quadratic measurements, addressing the square-root bottleneck by showing that, when the incoherence parameter

is favorable, recovery is possible from

measurements with

(up to log factors) via empirical loss minimization; this extends to phase retrieval with subexponential sensing. It provides a computationally practical initialization and convergence guarantees for SPF and a truncated gradient descent variant, delivering explicit nonasymptotic error bounds in the presence of noise. Beyond algorithmic recovery, the paper develops a topological hardness certificate via the Overlap Gap Property in the fully incoherent and under-sampled regime (

), supported by first/second moment analyses and moderate-deviation tails of chi-squared sums. Collectively, the results illuminate both when efficient algorithms can succeed and when topological barriers emerge, clarifying the role of incoherence, overparametrization, and initialization in quadratic-sensing problems. The findings have implications for structured sparse recovery and phase retrieval tasks in high dimensions, guiding algorithm design and setting expectations for sample complexity in practical settings.

Abstract

We study the square root bottleneck in the recovery of sparse vectors from quadratic equations. It is acknowledged that a sparse vector

can in theory be recovered from as few as

generic quadratic equations but no polynomial time algorithm is known for this task unless

. This bottleneck was in fact shown in previous work to be essentially related to the initialization of descent algorithms. Starting such algorithms sufficiently close to the planted signal is known to imply convergence to this signal. In this paper, we show that as soon as

(up to log factors) where

, it is possible to recover a

-sparse vector

from

quadratic equations of the form

by minimizing the classical empirical loss. The proof idea carries over to the phase retrieval setting for which it provides an original initialization that matches the current optimal sample complexity (see e.g. [Cai 2023]). In the maximally incoherent regime

, and for

we provide evidence for topological hardness by showing that a property known as the Overlap Gap Property (OGP), which originated in spin glass theory and is conjectured to be indicative of algorithmic intractability when optimizing over random structures, holds for a particular level of overparametrization. The key ingredient of the proof is a lower bound on the tail of chi-squared random variables which follows from the theory of moderate deviations.

Paper Structure (15 sections, 26 theorems, 303 equations, 1 figure)

This paper contains 15 sections, 26 theorems, 303 equations, 1 figure.

Introduction
Hardness and incoherence
Topological Complexity Barriers
Proof of Propositions \ref{['propositionIncoherenceInitializationAndRecovery']} and \ref{['propositionTGD']}
Initialization
Convergence of \ref{['SPF']}
Convergence of \ref{['TGD']} (Non asymptotic)
Proof of Proposition \ref{['propositionIncoherenceInitializationAndPhaseRetrieval']}
Initialization
Overlap Gap Property
Lower bound (first moments method)
Proof of Proposition \ref{['analysisFirstMomentsCurve']}. Analysis of the first moments curve
Overparametrization
Proof of Proposition \ref{['upperBoundSecondMoments']} (Second moments method)
Auxilliary results

Key Result

Proposition 1

Let $\boldsymbol x_0$ denote any $k$-sparse vector from $\mathbb{S}^n$ with incoherence defined as $\mu_0 = \|\boldsymbol x_0\|_{\infty}/\|\boldsymbol x_0\|_2$ and $b_i = \langle \boldsymbol A_i, \boldsymbol x_0\boldsymbol x_0^\intercal\rangle +\varepsilon_i$ be $m$ quadratic samples in which the se

Figures (1)

Figure 1: Representation of the various (conjectured) regimes associated to Problems \ref{['generalProblem00']} and \ref{['generalProblemPR00']}. The improvement of the paper for problem \ref{['generalProblem00']} is shown in light blue.

Theorems & Definitions (30)

Proposition 1
Proposition 2
Proposition 3
Definition 1: $k'$-OGP
Theorem 1
Definition 2
Proposition 4
Proposition 5
Proposition 6
Proposition 7: see Proposition 4.6 in arous2023free
...and 20 more

Sparse recovery from quadratic equations, part II: hardness and incoherence

TL;DR

Abstract

Sparse recovery from quadratic equations, part II: hardness and incoherence

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (30)