Table of Contents
Fetching ...

On the distance between mean and geometric median in high dimensions

Richard Schwank, Mathias Drton

TL;DR

This work analyzes the distance between the geometric median and the mean in high-dimensional settings under $M$-dependent component structure. It proves a dimension-dependent contraction: the Euclidean distance $\|\mathbf{m}_p - \boldsymbol{\mu}\|$ decays as $O(p^{-1/2})$, and provides a first-order, componentwise expansion $m_{p,i} = \mu_i - \frac{1}{2\bar{\sigma}^2 p} \sum_{j\in\mathcal{E}(i)} \mathbb{E}[(X_i - \mu_i)(X_j - \mu_j)^2] + o(1/p)$, with $\mathcal{E}(i)$ the neighboring index set. The results rely on the distribution class $\mathcal{D}(M,q,c,\sigma_{\min},C)$ and require $q>3$ for the expansion, along with a finite limit of the average variance $\bar{\sigma}^2$. Simulations across exponential, MA(2), and Pareto data confirm the rates, while highlighting slower convergence under heavy tails; the online Robbins-Monro algorithm is used to approximate the median efficiently. These findings illuminate the bias-robustness trade-off in high dimensions and connect to classical asymptotic equivalences for isotropic Gaussian settings.

Abstract

The geometric median, a notion of center for multivariate distributions, has gained recent attention in robust statistics and machine learning. Although conceptually distinct from the mean (i.e., expectation), we demonstrate that both are very close in high dimensions when the dependence between the distribution components is suitably controlled. Concretely, we find an upper bound on the distance that vanishes with the dimension asymptotically, and derive a rate-matching first order expansion of the geometric median components. Simulations illustrate and confirm our results.

On the distance between mean and geometric median in high dimensions

TL;DR

This work analyzes the distance between the geometric median and the mean in high-dimensional settings under -dependent component structure. It proves a dimension-dependent contraction: the Euclidean distance decays as , and provides a first-order, componentwise expansion , with the neighboring index set. The results rely on the distribution class and require for the expansion, along with a finite limit of the average variance . Simulations across exponential, MA(2), and Pareto data confirm the rates, while highlighting slower convergence under heavy tails; the online Robbins-Monro algorithm is used to approximate the median efficiently. These findings illuminate the bias-robustness trade-off in high dimensions and connect to classical asymptotic equivalences for isotropic Gaussian settings.

Abstract

The geometric median, a notion of center for multivariate distributions, has gained recent attention in robust statistics and machine learning. Although conceptually distinct from the mean (i.e., expectation), we demonstrate that both are very close in high dimensions when the dependence between the distribution components is suitably controlled. Concretely, we find an upper bound on the distance that vanishes with the dimension asymptotically, and derive a rate-matching first order expansion of the geometric median components. Simulations illustrate and confirm our results.

Paper Structure

This paper contains 7 sections, 16 theorems, 83 equations, 2 figures, 1 table.

Key Result

Lemma 2.0

When $(X_i)_i\in \mathcal{D}(M, q, c, \sigma_{\min}, C)$, then $\|\mathbf{m}_p - \boldsymbol{\mu}\|\leq \sqrt{Cp}$.

Figures (2)

  • Figure 1: (Left) Euclidean distance $\|\hat{\mathbf{m}}_p - \boldsymbol{\mu}\|$. (Right) Residuals from Theorem \ref{['theoremComponentExpansion']} for $i=1$; see equation \ref{['eq:sim_residuals']}. Black reference lines are $\mathcal{O}(\tfrac{1}{\sqrt{p}})$ (left) and $\mathcal{O}(\tfrac{1}{p})$ (right).
  • Figure 2: A closer look at the Pareto(3.1) simulations. Left: $\|\hat{\mathbf{m}}_p - \boldsymbol{\mu}\|$ (same as left side of Figure \ref{['fig:Simulation']}) in pink versus $f(p):=\sqrt{p}\cdot|\tfrac{-19.52}{p} + \tfrac{20.16}{p^{1.03}}|$ in black. Indeed, $f$ explains the decay of $\|\hat{\mathbf{m}}_p - \boldsymbol{\mu}\|$ well. Right: Plot $f(p)$ from $100$ to $10^5$ in black and compare with $\mathcal{O}(\tfrac{1}{\sqrt{p}})$ in blue. Even for $p$ so large, $f$ is still not fully parallel to $\mathcal{O}(\tfrac{1}{\sqrt{p}})$.

Theorems & Definitions (29)

  • Definition 2.0
  • Lemma 2.0
  • Lemma 2.0
  • Lemma 2.0
  • Theorem 2.1
  • Theorem 2.2
  • Lemma A.1
  • proof
  • Lemma A.2
  • proof
  • ...and 19 more