Efficient reductions from a Gaussian source with applications to statistical-computational tradeoffs

Mengqi Lou; Guy Bresler; Ashwin Pananjady

Efficient reductions from a Gaussian source with applications to statistical-computational tradeoffs

Mengqi Lou, Guy Bresler, Ashwin Pananjady

TL;DR

This work develops a general framework to convert a single Gaussian observation into a target distribution via polynomial-time reductions, quantified by TV deficiency bounds that hold uniformly over a parameter set. Central to the method is a signed-kernel construction and a rejection-sampling step that yields a Markov kernel mapping the Gaussian source to broad target classes, including non-Gaussian locations and Gaussian means transformed by a nonlinear link. The reductions enable universal hardness transfers, proving, among other results, that Tensor PCA hardness persists under non-Gaussian noise, that a k^2 computational gap arises for even-link sparse GLMs, and that Rank1SubMat and PlantSubMat share a pointwise-hardness-preserving reduction. Collectively, the results establish robust connections across canonical high-dimensional problems, linking statistical-to-computational barriers through a unified Gaussian-source reduction technique. The framework offers a path to transferring hardness results beyond Gaussian noise and to broader model classes, with polylogarithmic inflation of SNR in the target and polynomial-time computability, broadening the scope of universality in computational barriers for statistical problems.

Abstract

Given a single observation from a Gaussian distribution with unknown mean $θ$, we design computationally efficient procedures that can approximately generate an observation from a different target distribution $Q_θ$ uniformly for all $θ$ in a parameter set. We leverage our technique to establish reduction-based computational lower bounds for several canonical high-dimensional statistical models under widely-believed conjectures in average-case complexity. In particular, we cover cases in which: 1. $Q_θ$ is a general location model with non-Gaussian distribution, including both light-tailed examples (e.g., generalized normal distributions) and heavy-tailed ones (e.g., Student's $t$-distributions). As a consequence, we show that computational lower bounds proved for spiked tensor PCA with Gaussian noise are universal, in that they extend to other non-Gaussian noise distributions within our class. 2. $Q_θ$ is a normal distribution with mean $f(θ)$ for a general, smooth, and nonlinear link function $f:\mathbb{R} \rightarrow \mathbb{R}$. Using this reduction, we construct a reduction from symmetric mixtures of linear regressions to generalized linear models with link function $f$, and establish computational lower bounds for solving the $k$-sparse generalized linear model when $f$ is an even function. This result constitutes the first reduction-based confirmation of a $k$-to-$k^2$ statistical-to-computational gap in $k$-sparse phase retrieval, resolving a conjecture posed by Cai et al. (2016). As a second application, we construct a reduction from the sparse rank-1 submatrix model to the planted submatrix model, establishing a pointwise correspondence between the phase diagrams of the two models that faithfully preserves regions of computational hardness and tractability.

Efficient reductions from a Gaussian source with applications to statistical-computational tradeoffs

TL;DR

Abstract

Given a single observation from a Gaussian distribution with unknown mean

, we design computationally efficient procedures that can approximately generate an observation from a different target distribution

uniformly for all

in a parameter set. We leverage our technique to establish reduction-based computational lower bounds for several canonical high-dimensional statistical models under widely-believed conjectures in average-case complexity. In particular, we cover cases in which: 1.

is a general location model with non-Gaussian distribution, including both light-tailed examples (e.g., generalized normal distributions) and heavy-tailed ones (e.g., Student's

-distributions). As a consequence, we show that computational lower bounds proved for spiked tensor PCA with Gaussian noise are universal, in that they extend to other non-Gaussian noise distributions within our class. 2.

is a normal distribution with mean

for a general, smooth, and nonlinear link function

. Using this reduction, we construct a reduction from symmetric mixtures of linear regressions to generalized linear models with link function

, and establish computational lower bounds for solving the

-sparse generalized linear model when

is an even function. This result constitutes the first reduction-based confirmation of a

-to-

statistical-to-computational gap in

-sparse phase retrieval, resolving a conjecture posed by Cai et al. (2016). As a second application, we construct a reduction from the sparse rank-1 submatrix model to the planted submatrix model, establishing a pointwise correspondence between the phase diagrams of the two models that faithfully preserves regions of computational hardness and tractability.

Efficient reductions from a Gaussian source with applications to statistical-computational tradeoffs

TL;DR

Abstract

Efficient reductions from a Gaussian source with applications to statistical-computational tradeoffs

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (37)