Computing Wasserstein Barycenters through Gradient Flows
Eduardo Fernandes Montesuma, Yassir Bendou, Mike Gartrell
TL;DR
The paper advances Wasserstein barycenters by casting them as gradient flows in Wasserstein space, enabling mini-batch sampling and regularization through internal, potential, and interaction energies. It provides two practical instantiations— Empirical Flow for particle-based barycenters and Gaussian Mixture Flow for mixtures—along with convergence guarantees under a Polyak-Łojasiewicz condition and error bounds for empirical approximations. The approach supports joint measures with label-aware ground costs, yielding a decomposition of the barycenter problem into feature and label components and achieving robust performance in toy tests and challenging multi-source domain adaptation benchmarks. Overall, the work delivers a scalable, theoretically grounded framework that outperforms prior discrete and neural barycenter methods, particularly when leveraging label information during domain adaptation.
Abstract
Wasserstein barycenters provide a powerful tool for aggregating probability measures, while leveraging the geometry of their ambient space. Existing discrete methods suffer from poor scalability, as they require access to the complete set of samples from input measures. We address this issue by recasting the original barycenter problem as a gradient flow in the Wasserstein space. Our approach offers two advantages. First, we achieve scalability by sampling mini-batches from the input measures. Second, we incorporate functionals over probability measures, which regularize the barycenter problem through internal, potential, and interaction energies. We present two algorithms for empirical and Gaussian mixture measures, providing convergence guarantees under the Polyak-Łojasiewicz inequality. Experimental validation on toy datasets and domain adaptation benchmarks show that our methods outperform previous discrete and neural net-based methods for computing Wasserstein barycenters.
