Semidiscrete optimal transport with unknown costs
Yinchu Zhu, Ilya O. Ryzhov
TL;DR
This work extends semidiscrete optimal transport to settings where the cost function is unknown and must be learned online. It introduces a semi-myopic learning algorithm that couples stochastic approximation for the dual variables with sequential, cost-wise exploration, enabling learning without discretization or smoothing. Theoretical guarantees show canonical $O(1/n)$ convergence rates for known costs and similar rates, plus a $PICS$-type bound of $O(\sqrt{n})$ for unknown costs, validating efficient online learning under probabilistic targets. Numerical experiments in synthetic and geographical-partitioning tasks demonstrate robustness to noise and highlight practical applicability to online decision problems with guaranteed contracts and adaptively learned costs.
Abstract
Semidiscrete optimal transport is a challenging generalization of the classical transportation problem in linear programming. The goal is to design a joint distribution for two random variables (one continuous, one discrete) with fixed marginals, in a way that minimizes expected cost. We formulate a novel variant of this problem in which the cost functions are unknown, but can be learned through noisy observations; however, only one function can be sampled at a time. We develop a semi-myopic algorithm that couples online learning with stochastic approximation, and prove that it achieves optimal convergence rates, despite the non-smoothness of the stochastic gradient and the lack of strong concavity in the objective function.
