Robust Estimation in metric spaces: Achieving Exponential Concentration with a Fréchet Median
Jakwang Kim, Jiyoung Park, Anirban Bhattacharya
TL;DR
This work extends robust Fréchet-median-based estimation to CAT($\kappa$) spaces, enabling exponential concentration under heavy tails. By introducing the Fréchet median of estimators (FMoE), the authors show how weakly concentrating estimators can be boosted to strong, exponential concentration with a bound of the form $\mathbb{P}[d(\hat{\theta}_{FMoE},\theta) > C_{\alpha}\epsilon] \le \exp(-k\psi(\alpha,p))$, where $C_{\alpha}$ and $\psi(\alpha,p)$ are explicitly defined. The framework is applied to Fréchet mean estimation in NPC spaces (via FMoM) and to covariance estimation on SPD manifolds under $d_{AI}$ and $d_{BW}$, with detailed concentration analyses and practical implementation guidance. Empirical results on metric trees and SPD examples corroborate the theoretical gains, illustrating substantial improvements in heavy-tailed settings and offering a tractable path for robust estimation in general metric spaces.
Abstract
There is growing interest in developing statistical estimators that achieve exponential concentration around a population target even when the data distribution has heavier than exponential tails. More recent activity has focused on extending such ideas beyond Euclidean spaces to Hilbert spaces and Riemannian manifolds. In this work, we show that such exponential concentration in presence of heavy tails can be achieved over a broader class of parameter spaces called CAT($κ$) spaces, a very general metric space equipped with the minimal essential geometric structure for our purpose, while being sufficiently broad to encompass most typical examples encountered in statistics and machine learning. The key technique is to develop and exploit a general concentration bound for the Fréchet median in CAT($κ$) spaces. We illustrate our theory through a number of examples, and provide empirical support through simulation studies.
