List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

Ilias Diakonikolas; Daniel M. Kane; Sushrut Karmalkar; Ankit Pensia; Thanasis Pittas

List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Ankit Pensia, Thanasis Pittas

TL;DR

This work addresses list-decodable sparse mean estimation under adversarial outliers, where an $(1-α)$-corrupted dataset contains an unknown $k$-sparse mean $μ$ with $loor{αm}$ inliers. It introduces a conceptually simple difference-of-pairs filtering framework, combined with Sum-of-Squares certified moment bounds in $k$-sparse directions, to obtain computationally efficient algorithms. The main result shows that with $m=(t k \log n)^{O(t)} \max(1,M^{-2})/α$ samples, the algorithm runs in time $\mathrm{poly}(m n^t)$ and returns $\hat{μ}$ with error $\|\hat{μ}-μ\|_2 = O_t\left( M^{1/t}/α^{O(1)/t} \right)$ with probability $\Omega(α)$; for Gaussian inliers, it achieves the information-theoretic rate $Θ(\sqrt{\log(1/α)})$ with quasi-polynomial resources. The approach extends to provide SQ and low-degree polynomial lower bounds, illustrating an intrinsic information-computation gap in this setting. Overall, the method yields a simpler, faster route to robust sparse mean estimation and informs the limits of efficient algorithms for list-decodable problems with structured sparsity.

Abstract

We study the problem of list-decodable sparse mean estimation. Specifically, for a parameter $α\in (0, 1/2)$, we are given $m$ points in $\mathbb{R}^n$, $\lfloor αm \rfloor$ of which are i.i.d. samples from a distribution $D$ with unknown $k$-sparse mean $μ$. No assumptions are made on the remaining points, which form the majority of the dataset. The goal is to return a small list of candidates containing a vector $\widehat μ$ such that $\| \widehat μ- μ\|_2$ is small. Prior work had studied the problem of list-decodable mean estimation in the dense setting. In this work, we develop a novel, conceptually simpler technique for list-decodable mean estimation. As the main application of our approach, we provide the first sample and computationally efficient algorithm for list-decodable sparse mean estimation. In particular, for distributions with "certifiably bounded" $t$-th moments in $k$-sparse directions and sufficiently light tails, our algorithm achieves error of $(1/α)^{O(1/t)}$ with sample complexity $m = (k\log(n))^{O(t)}/α$ and running time $\mathrm{poly}(mn^t)$. For the special case of Gaussian inliers, our algorithm achieves the optimal error guarantee of $Θ(\sqrt{\log(1/α)})$ with quasi-polynomial sample and computational complexity. We complement our upper bounds with nearly-matching statistical query and low-degree polynomial testing lower bounds.

List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

TL;DR

This work addresses list-decodable sparse mean estimation under adversarial outliers, where an

-corrupted dataset contains an unknown

-sparse mean

with

inliers. It introduces a conceptually simple difference-of-pairs filtering framework, combined with Sum-of-Squares certified moment bounds in

-sparse directions, to obtain computationally efficient algorithms. The main result shows that with

samples, the algorithm runs in time

and returns

with error

with probability

; for Gaussian inliers, it achieves the information-theoretic rate

with quasi-polynomial resources. The approach extends to provide SQ and low-degree polynomial lower bounds, illustrating an intrinsic information-computation gap in this setting. Overall, the method yields a simpler, faster route to robust sparse mean estimation and informs the limits of efficient algorithms for list-decodable problems with structured sparsity.

Abstract

We study the problem of list-decodable sparse mean estimation. Specifically, for a parameter

, we are given

points in

of which are i.i.d. samples from a distribution

with unknown

-sparse mean

. No assumptions are made on the remaining points, which form the majority of the dataset. The goal is to return a small list of candidates containing a vector

such that

is small. Prior work had studied the problem of list-decodable mean estimation in the dense setting. In this work, we develop a novel, conceptually simpler technique for list-decodable mean estimation. As the main application of our approach, we provide the first sample and computationally efficient algorithm for list-decodable sparse mean estimation. In particular, for distributions with "certifiably bounded"

-th moments in

-sparse directions and sufficiently light tails, our algorithm achieves error of

with sample complexity

and running time

. For the special case of Gaussian inliers, our algorithm achieves the optimal error guarantee of

with quasi-polynomial sample and computational complexity. We complement our upper bounds with nearly-matching statistical query and low-degree polynomial testing lower bounds.

Paper Structure (20 sections, 15 theorems, 22 equations, 4 algorithms)

This paper contains 20 sections, 15 theorems, 22 equations, 4 algorithms.

Introduction
Related Work
Our Results
Overview of Techniques
Prior Work
Novel List-Decoding Mean Estimation Algorithm
Preliminaries
Basic Notation
Linear Algebra Notation
Probability Notation
SoS Preliminaries
Certifiably Bounded Moments in $k$-Sparse Directions
Main Result: Proof of \ref{['thm:LDL-sparse']}
The SoS-based Filter
Identifying a Subset of Samples with Bounded Moments
...and 5 more sections

Key Result

Theorem 1.2

Let $t$ be an integer power of two. Let $D$ be a distribution over $\mathbb{R}^n$ with $k$-sparse mean $\mu$. Suppose that $D$ has $t$-th moments $d$-certifiably bounded in $k$-sparse directions by $M$ for some $d=O(t)$ (cf. def:bounded-moments-k-sparse) and subexponential tails in the standard basi

Theorems & Definitions (38)

Definition 1.1: List-Decodable Learning
Theorem 1.2: List-Decodable Sparse Mean Estimation
Theorem 1.3: SQ Lower Bound, Informal
Definition 2.0: (2, $k$)-norm
Definition 2.1: Symbolic Polynomial
Definition 2.2: SoS Proof
Definition 2.3: Pseudoexpectation
Theorem 2.4: The SoS Algorithm Sho87lasserre2001newnesterov2000squaredbomze1998standard
Definition 2.8: $(M,t,d)$-Certifiably Bounded Moments in $k$-Sparse Directions
Lemma 2.8: DKKPP22
...and 28 more

List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

TL;DR

Abstract

List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (38)