Robust Sparse Estimation for Gaussians with Optimal Error under Huber Contamination
Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Ankit Pensia, Thanasis Pittas
TL;DR
This work delivers the first computationally efficient procedures for Gaussian sparse estimation under Huber contamination with optimal $O(\epsilon)$ error across mean estimation, sparse PCA, and sparse linear regression. The authors introduce a novel sparse-oriented multidimensional filtering framework that iteratively identifies sparse directions, down-weights outliers, and decomposes the problem into a small coordinate set handled densely, enabling polynomial-time, sample-efficient recovery. They also establish a rigorous certificate-based analysis tying covariance control to tight mean guarantees, and provide reductions that connect robust sparse PCA and regression to robust sparse mean estimation. These results close gaps from previous suboptimal-accuracy methods and set a new baseline for practical robust sparse estimation in high dimensions, with clear avenues for unknown-covariance extensions. Overall, the paper advances the theory and practice of robust statistics in sparse, high-dimensional Gaussian settings, delivering near-optimal error rates with feasible computation.
Abstract
We study Gaussian sparse estimation tasks in Huber's contamination model with a focus on mean estimation, PCA, and linear regression. For each of these tasks, we give the first sample and computationally efficient robust estimators with optimal error guarantees, within constant factors. All prior efficient algorithms for these tasks incur quantitatively suboptimal error. Concretely, for Gaussian robust $k$-sparse mean estimation on $\mathbb{R}^d$ with corruption rate $ε>0$, our algorithm has sample complexity $(k^2/ε^2)\mathrm{polylog}(d/ε)$, runs in sample polynomial time, and approximates the target mean within $\ell_2$-error $O(ε)$. Previous efficient algorithms inherently incur error $Ω(ε\sqrt{\log(1/ε)})$. At the technical level, we develop a novel multidimensional filtering method in the sparse regime that may find other applications.
