Table of Contents
Fetching ...

Uniform mean estimation via generic chaining

Daniel Bartl, Shahar Mendelson

TL;DR

The paper tackles uniform estimation of the mean $\mathbb{E}\,u(f(X))$ over a rich class $F$ of potentially heavy-tailed functions. It introduces an optimal uniform mean estimator $\Psi$ that combines an optimal univariate mean estimator with Talagrand’s generic chaining, producing a subgaussian-type error bound $\sup_{f\in F} |\Psi - \mathbb{E}\,u(f)| \lesssim R(F) \frac{\mathbb{E} \sup_{f\in F} G_f}{\sqrt{N}}$ under minimal assumptions. Key contributions include the precise assumptions and definitions (isomorphic distance $\mathbbm{\rho}$, $R(F)$, $d_F$, and $D^*(F)$), a complete proof framework, and two major applications: a geometric setting for isotropic log-concave measures and a robust covariance-estimation scenario with adversarial corruption. The results yield optimal uniform mean estimation beyond light-tailed settings and extend to corrupted data, with explicit bounds and practical implications for high-dimensional probability and statistics.

Abstract

We introduce an empirical functional $Ψ$ that is an optimal uniform mean estimator: Let $F\subset L_2(μ)$ be a class of mean zero functions, $u$ is a real valued function, and $X_1,\dots,X_N$ are independent, distributed according to $μ$. We show that under minimal assumptions, with $μ^{\otimes N}$ exponentially high probability, \[ \sup_{f\in F} |Ψ(X_1,\dots,X_N,f) - \mathbb{E} u(f(X))| \leq c R(F) \frac{ \mathbb{E} \sup_{f\in F } |G_f| }{\sqrt N}, \] where $(G_f)_{f\in F}$ is the gaussian processes indexed by $F$ and $R(F)$ is an appropriate notion of `diameter' of the class $\{u(f(X)) : f\in F\}$. The fact that such a bound is possible is surprising, and it leads to the solution of various key problems in high dimensional probability and high dimensional statistics. The construction is based on combining Talagrand's generic chaining mechanism with optimal mean estimation procedures for a single real-valued random variable.

Uniform mean estimation via generic chaining

TL;DR

The paper tackles uniform estimation of the mean over a rich class of potentially heavy-tailed functions. It introduces an optimal uniform mean estimator that combines an optimal univariate mean estimator with Talagrand’s generic chaining, producing a subgaussian-type error bound under minimal assumptions. Key contributions include the precise assumptions and definitions (isomorphic distance , , , and ), a complete proof framework, and two major applications: a geometric setting for isotropic log-concave measures and a robust covariance-estimation scenario with adversarial corruption. The results yield optimal uniform mean estimation beyond light-tailed settings and extend to corrupted data, with explicit bounds and practical implications for high-dimensional probability and statistics.

Abstract

We introduce an empirical functional that is an optimal uniform mean estimator: Let be a class of mean zero functions, is a real valued function, and are independent, distributed according to . We show that under minimal assumptions, with exponentially high probability, where is the gaussian processes indexed by and is an appropriate notion of `diameter' of the class . The fact that such a bound is possible is surprising, and it leads to the solution of various key problems in high dimensional probability and high dimensional statistics. The construction is based on combining Talagrand's generic chaining mechanism with optimal mean estimation procedures for a single real-valued random variable.

Paper Structure

This paper contains 8 sections, 8 theorems, 79 equations.

Key Result

Theorem 1.8

If Assumptions ass:iso-distance-orace and ass:other are satisfied, there is an absolute constant $c_1$ and constants $c_2,c_3$ that depend on $\kappa$ and $L$ for which the following holds. For every $\delta>\exp(-c_1N)$, there is a procedure $\Psi_\delta$ which satisfies that with probability at le In particular, with probability at least $1-\exp(-c_1\min\{D^*(F),N\})$,

Theorems & Definitions (20)

  • Remark 1.4
  • Remark 1.6
  • Remark 1.7
  • Theorem 1.8
  • Example 1.9
  • Remark 1.10
  • Theorem 2.1
  • Remark 2.2
  • Definition 2.3
  • Theorem 2.4
  • ...and 10 more