Nash Equilibria, Regularization and Computation in Optimal Transport-Based Distributionally Robust Optimization
Soroosh Shafiee, Liviu Aolaritei, Florian Dörfler, Daniel Kuhn
TL;DR
The paper advances distributionally robust optimization by integrating optimal transport-based ambiguity sets with Nash equilibrium analysis, showing that robustification induces both higher-order variation and Lipschitz regularization even when the transport cost is non-metric. It establishes conditions for the existence and computability of Nash equilibria, and demonstrates that the dual problem often reduces to finite convex programs under discrete reference distributions, enabling construction of least-favorable distributions. By connecting the c-transform to classical envelopes (Pasch-Hausdorff and Moreau), it provides dual perspectives and algorithmic paths for solving nonconvex DROs via gradient-based methods. The theoretical results are complemented by numerical experiments on DRO-SVMs and distributionally robust portfolio optimization, illustrating transferable adversarial samples and tangible performance improvements. Overall, the work unifies regularization and robustification in OT-based DRO, offering practical, scalable solutions for high-stakes decisions under distributional ambiguity.
Abstract
We study optimal transport-based distributionally robust optimization problems where a fictitious adversary, often envisioned as nature, can choose the distribution of the uncertain problem parameters by reshaping a prescribed reference distribution at a finite transportation cost. In this framework, we show that robustification is intimately related to various forms of variation and Lipschitz regularization even if the transportation cost function fails to be (some power of) a metric. We also derive conditions for the existence and the computability of a Nash equilibrium between the decision-maker and nature, and we demonstrate numerically that nature's Nash strategy can be viewed as a distribution that is supported on remarkably deceptive adversarial samples. Finally, we identify practically relevant classes of optimal transport-based distributionally robust optimization problems that can be addressed with efficient gradient descent algorithms even if the loss function or the transportation cost function are nonconvex (but not both at the same time).
