Minimax Distribution Estimation in Wasserstein Distance
Shashank Singh, Barnabás Póczos
TL;DR
This work establishes minimax rates for estimating a distribution under Wasserstein loss using only moment constraints and metric-entropy structure of the support. It provides a tight upper bound for the empirical distribution via a multi-resolution partitioning scheme and two complementary lower bounds based on packing radius and heavy-tailed moments, showing empirical distribution is often minimax-optimal. The results extend beyond totally bounded spaces to unbounded settings and connect to practical scenarios including Euclidean spaces, manifolds, and latent-variable models, with direct implications for Monte Carlo integration and Wasserstein-based learning. Overall, the paper clarifies how metric complexity and moment conditions jointly govern the speed of distribution estimation in Wasserstein distance and highlights the empirical estimator’s strong performance under broad conditions.
Abstract
The Wasserstein metric is an important measure of distance between probability distributions, with applications in machine learning, statistics, probability theory, and data analysis. This paper provides upper and lower bounds on statistical minimax rates for the problem of estimating a probability distribution under Wasserstein loss, using only metric properties, such as covering and packing numbers, of the sample space, and weak moment assumptions on the probability distributions.
