Near-Optimal Algorithms for Group Distributionally Robust Optimization and Beyond
Tasuku Soma, Khashayar Gatmiry, Sharut Gupta, Stefanie Jegelka
TL;DR
This paper unifies a broad class of distributionally robust optimization problems under generalized group DRO and provides near-optimal stochastic algorithms. By formulating DRO as a two-player zero-sum game and applying online gradient methods for the model and online mirror/descent for the group-weighting, the authors derive GDRO-EXP3P and GDRO-TINF with substantially improved convergence rates over prior work, plus a matching information-theoretic lower bound for group DRO. They extend the framework to weighted ranking via permutahedra, achieving comparable rates and efficiency, and demonstrate strong empirical gains on both convex benchmarks and deep learning tasks. The results offer a principled, scalable approach to fairness and robustness across multiple subpopulation settings, with practical implications for robust ML deployment.
Abstract
Distributionally robust optimization (DRO) can improve the robustness and fairness of learning methods. In this paper, we devise stochastic algorithms for a class of DRO problems including group DRO, subpopulation fairness, and empirical conditional value at risk (CVaR) optimization. Our new algorithms achieve faster convergence rates than existing algorithms for multiple DRO settings. We also provide a new information-theoretic lower bound that implies our bounds are tight for group DRO. Empirically, too, our algorithms outperform known methods.
