Table of Contents
Fetching ...

Sharp Convergence Rates for Empirical Optimal Transport with Smooth Costs

Tudor Manole, Jonathan Niles-Weed

Abstract

We revisit the question of characterizing the convergence rate of plug-in estimators of optimal transport costs. It is well known that an empirical measure comprising independent samples from an absolutely continuous distribution on $\mathbb{R}^d$ converges to that distribution at the rate $n^{-1/d}$ in Wasserstein distance, which can be used to prove that plug-in estimators of many optimal transport costs converge at this same rate. However, we show that when the cost is smooth, this analysis is loose: plug-in estimators based on empirical measures converge quadratically faster, at the rate $n^{-2/d}$. As a corollary, we show that the Wasserstein distance between two distributions is significantly easier to estimate when the measures are well-separated. We also prove lower bounds, showing not only that our analysis of the plug-in estimator is tight, but also that no other estimator can enjoy significantly faster rates of convergence uniformly over all pairs of measures. Our proofs rely on empirical process theory arguments based on tight control of $L^2$ covering numbers for locally Lipschitz and semi-concave functions. As a byproduct of our proofs, we derive $L^\infty$ estimates on the displacement induced by the optimal coupling between any two measures satisfying suitable concentration and anticoncentration conditions, for a wide range of cost functions.

Sharp Convergence Rates for Empirical Optimal Transport with Smooth Costs

Abstract

We revisit the question of characterizing the convergence rate of plug-in estimators of optimal transport costs. It is well known that an empirical measure comprising independent samples from an absolutely continuous distribution on converges to that distribution at the rate in Wasserstein distance, which can be used to prove that plug-in estimators of many optimal transport costs converge at this same rate. However, we show that when the cost is smooth, this analysis is loose: plug-in estimators based on empirical measures converge quadratically faster, at the rate . As a corollary, we show that the Wasserstein distance between two distributions is significantly easier to estimate when the measures are well-separated. We also prove lower bounds, showing not only that our analysis of the plug-in estimator is tight, but also that no other estimator can enjoy significantly faster rates of convergence uniformly over all pairs of measures. Our proofs rely on empirical process theory arguments based on tight control of covering numbers for locally Lipschitz and semi-concave functions. As a byproduct of our proofs, we derive estimates on the displacement induced by the optimal coupling between any two measures satisfying suitable concentration and anticoncentration conditions, for a wide range of cost functions.

Paper Structure

This paper contains 34 sections, 25 theorems, 199 equations.

Key Result

Theorem 1

Let $\mu,\nu \in \calP(\bbR^d)$ and $p > 1$. Let $\nu$ be a $\sigma^2$-sub-Gaussian measure boucheron2013 and $\mu$ have finite $p$-th moment, and assume there exist constants $c_1,c_2 > 0$ such that $\mu(B_{x,1}) \geq c_1 \exp(-c_2 \norm x^2)$ for all $x \in \bbR^d$. Then, for any optimal coupling In particular, if there exists an optimal transport map $T$ from $\mu$ to $\nu$ with respect to $c_

Theorems & Definitions (30)

  • Theorem : Informal
  • Lemma 1
  • Theorem 2
  • Corollary 3: Powers of $\ell_r$ Norms
  • Corollary 4: Wasserstein Distances
  • proof
  • Lemma 5
  • Lemma 6: vonluxburg2004
  • Lemma 7: bronshtein1976
  • Lemma 8
  • ...and 20 more