Sample Complexity of Quadratically Regularized Optimal Transport
Alberto González-Sanz, Eustasio del Barrio, Marcel Nutz
TL;DR
This work shows that quadratically regularized OT (QOT) admits parametric, $n^{-1/2}$-scale fluctuations for its dual potentials, optimal costs, and couplings, despite the dual lack of strong concavity and the potentials being non-smooth. The authors introduce a novel Lipschitz regularity result for the population transport sections and a VC-theory framework to bound the statistical complexity of the optimal support, enabling central limit theorems without requiring smoothness. They develop a Z-estimation style proof relying on Fréchet differentiability and a Fredholm-alternative-invertible derivative, complemented by a VC-tool that bounds empirical-process terms. The resulting CLTs for potentials, costs, and couplings provide precise finite-sample fluctuation descriptions and gradient estimates, supporting the use of QOT in high-dimensional statistical applications with sparse optimal transport plans. Overall, the paper establishes theoretically that QOT achieves parametric sample complexity while preserving sparsity and numerical stability advantages over EOT.
Abstract
It is well known that optimal transport suffers from the curse of dimensionality: when the prescribed marginals are approximated by i.i.d. samples, the convergence of the empirical optimal transport problem to the population counterpart slows exponentially with increasing dimension. Entropically regularized optimal transport (EOT) has become the standard bearer in many statistical applications as it avoids this curse. Indeed, EOT has parametric sample complexity, as has been shown in a series of works based on the smoothness of the EOT potentials or the strong concavity of the dual EOT problem. However, EOT produces full-support approximations to the (sparse) OT problem, leading to overspreading in applications, and is computationally unstable for small regularization parameters. The most popular alternative is quadratically regularized optimal transport (QOT), which penalizes couplings by $L^2$ norm instead of relative entropy. QOT produces sparse approximations of OT and is computationally stable. However, its potentials are not smooth (do not belong to a Donsker class) and its dual problem is not strongly concave, hence QOT is often assumed to suffer from the curse of dimensionality. In this paper, we show that QOT nevertheless has parametric sample complexity. More precisely, we establish central limit theorems for its dual potentials, optimal couplings, and optimal costs. Our analysis is based on novel arguments that focus on the regularity of the support of the optimal QOT coupling. Specifically, we establish a Lipschitz property of its sections and leverage VC theory to bound its statistical complexity. Our analysis also leads to gradient estimates of independent interest, including $C^{1,1}$ regularity of the population potentials.
