Provable Robust Overfitting Mitigation in Wasserstein Distributionally Robust Optimization
Shuang Liu, Yihan Wang, Yifan Zhu, Yibo Miao, Xiao-Shan Gao
TL;DR
This work tackles robust overfitting in Wasserstein distributionally robust optimization by introducing Statistically Robust WDRO (SR-WDRO), which augments Wasserstein-based ambiguity with a KL-divergence constraint to account for statistical error from finite data. It provides a rigorous generalization bound showing adversarial test loss is controlled by the statistically robust training loss, and establishes the existence of Stackelberg and Nash equilibria under reasonable conditions. The authors derive a computationally tractable dual reformulation and adapt it to classification with a sample-shift cost, along with a practical training algorithm that preserves computational efficiency. Empirically, SR-WDRO significantly reduces robust overfitting and improves adversarial robustness on CIFAR-10/100 and related architectures, with a manageable increase in training time. Overall, SR-WDRO offers a theoretically grounded, scalable approach to robust distributional learning with practical benefits for unseen distribution shifts.
Abstract
Wasserstein distributionally robust optimization (WDRO) optimizes against worst-case distributional shifts within a specified uncertainty set, leading to enhanced generalization on unseen adversarial examples, compared to standard adversarial training which focuses on pointwise adversarial perturbations. However, WDRO still suffers fundamentally from the robust overfitting problem, as it does not consider statistical error. We address this gap by proposing a novel robust optimization framework under a new uncertainty set for adversarial noise via Wasserstein distance and statistical error via Kullback-Leibler divergence, called the Statistically Robust WDRO. We establish a robust generalization bound for the new optimization framework, implying that out-of-distribution adversarial performance is at least as good as the statistically robust training loss with high probability. Furthermore, we derive conditions under which Stackelberg and Nash equilibria exist between the learner and the adversary, giving an optimal robust model in certain sense. Finally, through extensive experiments, we demonstrate that our method significantly mitigates robust overfitting and enhances robustness within the framework of WDRO.
