Finite sample bounds for barycenter estimation in geodesic spaces
Victor-Emmanuel Brunel, Jordan Serres
TL;DR
The paper develops non-asymptotic, finite-sample guarantees for estimating barycenters of distributions in geodesic spaces with an upper curvature bound. By extending concentration tools via Laplace transforms to CAT$(\kappa)$ spaces, it derives dimension-free bounds for empirical and iterated barycenters, including both expectation and high-probability results that generalize Hoeffding- and Bernstein-type inequalities. It then presents two algorithmic applications: a fast stochastic approximation in CAT$(0)$ spaces and a parallelized barycenter estimation scheme in symmetric spaces, each with PAC guarantees. Finally, the Riemannian case is treated to yield refined, geometry-aware bounds under sub-Gaussian conditions on tangent-space projections, highlighting the practical impact for non-Euclidean data analysis and statistical inference in curved spaces.
Abstract
We study the problem of estimating the barycenter of a distribution given i.i.d. data in a geodesic space. Assuming an upper curvature bound in Alexandrov's sense and a support condition ensuring the strong geodesic convexity of the barycenter problem, we establish finite-sample error bounds in expectation and with high probability. Our results generalize Hoeffding- and Bernstein-type concentration inequalities from Euclidean to geodesic spaces. Building on these concentration inequalities, we derive statistical guarantees for two efficient algorithms for the computation of barycenters.
