Estimation of Stochastic Optimal Transport Maps
Sloan Nietert, Ziv Goldfeld
TL;DR
This paper introduces a general framework for estimating stochastic optimal transport maps by defining a new Ep error metric that does not require existence or uniqueness of a deterministic Brenier map. It develops efficient estimators (entropic-kernel and rounding) with near-optimal finite-sample guarantees and robustness to adversarial contamination, extending OT map theory to stochastic kernels. Under Hölder continuity of the optimal kernel, it provides improved guarantees via WDRO and discusses plug-in and wavelet-based approaches, achieving minimax-like rates under weaker assumptions than traditional Brenier-based analysis. The results are supported by experiments on irregular OT maps, illustrating both theoretical and practical advantages of the Ep framework for real-world, non-deterministic transport problems. This work thus offers a foundational, broadly applicable theory for stochastic OT map estimation with implications for domains where mass splitting is intrinsic.
Abstract
The optimal transport (OT) map is a geometry-driven transformation between high-dimensional probability distributions which underpins a wide range of tasks in statistics, applied probability, and machine learning. However, existing statistical theory for OT map estimation is quite restricted, hinging on Brenier's theorem (quadratic cost, absolutely continuous source) to guarantee existence and uniqueness of a deterministic OT map, on which various additional regularity assumptions are imposed to obtain quantitative error bounds. In many real-world problems these conditions fail or cannot be certified, in which case optimal transportation is possible only via stochastic maps that can split mass. To broaden the scope of map estimation theory to such settings, this work introduces a novel metric for evaluating the transportation quality of stochastic maps. Under this metric, we develop computationally efficient map estimators with near-optimal finite-sample risk bounds, subject to easy-to-verify minimal assumptions. Our analysis further accommodates common forms of adversarial sample contamination, yielding estimators with robust estimation guarantees. Empirical experiments are provided which validate our theory and demonstrate the utility of the proposed framework in settings where existing theory fails. These contributions constitute the first general-purpose theory for map estimation, compatible with a wide spectrum of real-world applications where optimal transport may be intrinsically stochastic.
