Table of Contents
Fetching ...

Efficient Certificates of Anti-Concentration Beyond Gaussians

Ainesh Bakshi, Pravesh Kothari, Goutham Rajendran, Madhur Tulsiani, Aravindan Vijayaraghavan

TL;DR

This work presents a new (and arguably the most natural) formulation for anti- concentration that gives quasi-polynomial time verifiable sum-of-squares certificates of anti-concentration that hold for a wide class of non-Gaussian distributions including anti-concentrated bounded product distributions and uniform distributions over L_{p}$ balls (and their affine transformations).

Abstract

A set of high dimensional points $X=\{x_1, x_2,\ldots, x_n\} \subset R^d$ in isotropic position is said to be $δ$-anti concentrated if for every direction $v$, the fraction of points in $X$ satisfying $|\langle x_i,v \rangle |\leq δ$ is at most $O(δ)$. Motivated by applications to list-decodable learning and clustering, recent works have considered the problem of constructing efficient certificates of anti-concentration in the average case, when the set of points $X$ corresponds to samples from a Gaussian distribution. Their certificates played a crucial role in several subsequent works in algorithmic robust statistics on list-decodable learning and settling the robust learnability of arbitrary Gaussian mixtures, yet remain limited to rotationally invariant distributions. This work presents a new (and arguably the most natural) formulation for anti-concentration. Using this formulation, we give quasi-polynomial time verifiable sum-of-squares certificates of anti-concentration that hold for a wide class of non-Gaussian distributions including anti-concentrated bounded product distributions and uniform distributions over $L_p$ balls (and their affine transformations). Consequently, our method upgrades and extends results in algorithmic robust statistics e.g., list-decodable learning and clustering, to such distributions. Our approach constructs a canonical integer program for anti-concentration and analysis a sum-of-squares relaxation of it, independent of the intended application. We rely on duality and analyze a pseudo-expectation on large subsets of the input points that take a small value in some direction. Our analysis uses the method of polynomial reweightings to reduce the problem to analyzing only analytically dense or sparse directions.

Efficient Certificates of Anti-Concentration Beyond Gaussians

TL;DR

This work presents a new (and arguably the most natural) formulation for anti- concentration that gives quasi-polynomial time verifiable sum-of-squares certificates of anti-concentration that hold for a wide class of non-Gaussian distributions including anti-concentrated bounded product distributions and uniform distributions over L_{p}$ balls (and their affine transformations).

Abstract

A set of high dimensional points in isotropic position is said to be -anti concentrated if for every direction , the fraction of points in satisfying is at most . Motivated by applications to list-decodable learning and clustering, recent works have considered the problem of constructing efficient certificates of anti-concentration in the average case, when the set of points corresponds to samples from a Gaussian distribution. Their certificates played a crucial role in several subsequent works in algorithmic robust statistics on list-decodable learning and settling the robust learnability of arbitrary Gaussian mixtures, yet remain limited to rotationally invariant distributions. This work presents a new (and arguably the most natural) formulation for anti-concentration. Using this formulation, we give quasi-polynomial time verifiable sum-of-squares certificates of anti-concentration that hold for a wide class of non-Gaussian distributions including anti-concentrated bounded product distributions and uniform distributions over balls (and their affine transformations). Consequently, our method upgrades and extends results in algorithmic robust statistics e.g., list-decodable learning and clustering, to such distributions. Our approach constructs a canonical integer program for anti-concentration and analysis a sum-of-squares relaxation of it, independent of the intended application. We rely on duality and analyze a pseudo-expectation on large subsets of the input points that take a small value in some direction. Our analysis uses the method of polynomial reweightings to reduce the problem to analyzing only analytically dense or sparse directions.
Paper Structure (44 sections, 52 theorems, 224 equations, 3 algorithms)

This paper contains 44 sections, 52 theorems, 224 equations, 3 algorithms.

Key Result

Theorem 1.2

Given a point set $X = \Set{x_i }_{i\in [n]}$ in $\mathbbm R^d$ such that $X$ is drawn iid from an unknown affine transformation of a reasonably anti-concentrated distribution $\mathcal{D}$ and $0<\delta <1$, there exists an algorithm that with high probability over the samples, certifies $\max_{w,

Theorems & Definitions (109)

  • Definition 1.1: Reasonably anti-concentrated distributions
  • Theorem 1.2: Efficient Anti-Concentration certificate, Informal version of \ref{['thm:main-anti-concentration-thm']}
  • Remark 1.3: Comparison to prior work
  • Theorem 1.4: Spectrally Separated Mixture, Informal version of \ref{['thm:clusterin-2-spectrally-separated']}
  • Theorem 1.5: Clustering Mixtures, Informal version of \ref{['thm:main-clusterin-theorem']}
  • Remark 1.6: Outlier-robust algorithms
  • Theorem 1.7: List-Decodable Regression, Informal version of \ref{['thm:list-decodable-regression']}
  • Definition 2.1: Pseudo-distribution
  • Definition 2.2: Pseudo-expectation
  • Definition 2.3: Constrained pseudo-distributions
  • ...and 99 more