Distributionally Robust Optimization via Generative Ambiguity Modeling
Jiaqi Wen, Jianyi Yang
TL;DR
The paper tackles the challenge of constructing ambiguity sets for Distributionally Robust Optimization that are both expressive and tractable. It introduces Generative Ambiguity Sets (GAS) and a practical GAS-DRO algorithm that optimizes over a parameterized space of likelihood-based generative models (e.g., diffusion models and VAEs) while constraining reconstruction loss via $J(\theta,P_0)\le \epsilon$, ensuring consistency with the nominal distribution $P_0$. The inner maximization is tackled with dual learning and policy optimization (PPO), enabling tractable optimization in a finite parameter space, and the authors prove stationary convergence of GAS-DRO along with bounds on the inner-max error and a KL-divergence constraint. Empirically, GAS-DRO demonstrates superior Out-of-Distribution generalization on time-series and image classification tasks, outperforming Wasserstein- and KL-based DRO methods and prior generative-DRO approaches. The work offers a principled, scalable DRO framework that leverages the representational power of generative models to balance realism and robustness in distribution shifts.
Abstract
This paper studies Distributionally Robust Optimization (DRO), a fundamental framework for enhancing the robustness and generalization of statistical learning and optimization. An effective ambiguity set for DRO must involve distributions that remain consistent to the nominal distribution while being diverse enough to account for a variety of potential scenarios. Moreover, it should lead to tractable DRO solutions. To this end, we propose generative model-based ambiguity sets that capture various adversarial distributions beyond the nominal support space while maintaining consistency with the nominal distribution. Building on this generative ambiguity modeling, we propose DRO with Generative Ambiguity Set (GAS-DRO), a tractable DRO algorithm that solves the inner maximization over the parameterized generative model space. We formally establish the stationary convergence performance of GAS-DRO. We implement GAS-DRO with a diffusion model and empirically demonstrate its superior Out-of-Distribution (OOD) generalization performance in ML tasks.
