Entangled Mean Estimation in High-Dimensions
Ilias Diakonikolas, Daniel M. Kane, Sihan Liu, Thanasis Pittas
TL;DR
This work resolves the challenging problem of high-dimensional entangled mean estimation under the subset-of-signals model by delivering a computationally efficient algorithm that nearly matches the information-theoretic limit. It combines a warm-start tournament, rejection sampling to filter noisy samples, and a recursive dimensionality-reduction scheme that identifies low-variance subspaces and progressively refines the mean estimate. The main result shows that the estimation error decomposes into a one-dimensional term $f(\alpha,N)$ and a sub-Gaussian term $\sqrt{D/(\alpha N)}$, up to polylogarithmic factors, and that the algorithm runs in polynomial time in $D$ and $N$ provided $N \ge \widetilde{\Omega}(D/\alpha)$. This advances our understanding of multivariate entangled mean estimation by matching the 1D lower bounds up to polylog factors and paves the way for robust, scalable estimation in heterogeneous Gaussian settings with bounded subset covariances.
Abstract
We study the task of high-dimensional entangled mean estimation in the subset-of-signals model. Specifically, given $N$ independent random points $x_1,\ldots,x_N$ in $\mathbb{R}^D$ and a parameter $α\in (0, 1)$ such that each $x_i$ is drawn from a Gaussian with mean $μ$ and unknown covariance, and an unknown $α$-fraction of the points have identity-bounded covariances, the goal is to estimate the common mean $μ$. The one-dimensional version of this task has received significant attention in theoretical computer science and statistics over the past decades. Recent work [LY20; CV24] has given near-optimal upper and lower bounds for the one-dimensional setting. On the other hand, our understanding of even the information-theoretic aspects of the multivariate setting has remained limited. In this work, we design a computationally efficient algorithm achieving an information-theoretically near-optimal error. Specifically, we show that the optimal error (up to polylogarithmic factors) is $f(α,N) + \sqrt{D/(αN)}$, where the term $f(α,N)$ is the error of the one-dimensional problem and the second term is the sub-Gaussian error rate. Our algorithmic approach employs an iterative refinement strategy, whereby we progressively learn more accurate approximations $\hat μ$ to $μ$. This is achieved via a novel rejection sampling procedure that removes points significantly deviating from $\hat μ$, as an attempt to filter out unusually noisy samples. A complication that arises is that rejection sampling introduces bias in the distribution of the remaining points. To address this issue, we perform a careful analysis of the bias, develop an iterative dimension-reduction strategy, and employ a novel subroutine inspired by list-decodable learning that leverages the one-dimensional result.
