Wasserstein Distributionally Robust Regret Optimization
Lukas-Benedikt Fiechtner, Jose Blanchet
TL;DR
The paper studies ex-ante Wasserstein Distributionally Robust Regret Optimization (DRRO) as an alternative to DRO to mitigate over-conservatism while hedging model misspecification. It establishes that, under smoothness and unique-optimum conditions, DRRO behaves like ERM up to first order, and is exactly equivalent to ERM for quadratic losses, with explicit regret characterizations. It proves NP-hardness for general max-affine loss classes, motivating a convex relaxation that provides strong empirical performance and bounds the ex-ante regret. The authors develop exact and relaxation-based algorithms, including finite convex reformulations and hill-climbing initialization to accelerate solving, and demonstrate via Newsvendor experiments that DRRO can achieve robust yet growth-friendly decisions beyond traditional DRO. Overall, the work provides theoretical insights, computational strategies, and practical evidence that DRRO can achieve better trade-offs between robustness and upside potential than DRO alone.
Abstract
Distributionally Robust Optimization (DRO) is a popular framework for decision-making under uncertainty, but its adversarial nature can lead to overly conservative solutions. To address this, we study ex-ante Distributionally Robust Regret Optimization (DRRO), focusing on Wasserstein-based ambiguity sets which are popular due to their links to regularization and machine learning. We provide a systematic analysis of Wasserstein DRRO, paralleling known results for Wasserstein DRO. Under smoothness and regularity conditions, we show that Wasserstein DRRO coincides with Empirical Risk Minimization (ERM) up to first-order terms, and exactly so in convex quadratic settings. We revisit the Wasserstein DRRO newsvendor problem, where the loss is the maximum of two linear functions of demand and decision. Extending [25], we show that the regret can be computed by maximizing two one-dimensional concave functions. For more general loss functions involving the maximum of multiple linear terms in multivariate random variables and decision vectors, we prove that computing the regret and thus also the DRRO policy is NP-hard. We then propose a convex relaxation for these more general Wasserstein DRRO problems and demonstrate its strong empirical performance. Finally, we provide an upper bound on the optimality gap of our relaxation and show it improves over recent alternatives.
