Group Distributionally Robust Optimization with Flexible Sample Queries
Haomin Bai, Dingzhi Yu, Shuai Li, Haipeng Luo, Lijun Zhang
TL;DR
This work extends group distributionally robust optimization (GDRO) by introducing flexible, time-varying sample queries. By recasting GDRO as a two-player game between a w-player (non-oblivious online convex optimization) and a q-player (non-oblivious prediction with limited advice), the authors develop novel PLA algorithms and integrate them with Follow-The-Regularized-Leader updates to achieve anytime operation. They establish high-probability, time-uniform regret and optimization error bounds that scale as $O\left(\frac{1}{t}\sqrt{\sum_{j=1}^t \frac{m}{r_j}\log m}\right)$, with fixed-sample-size consequences $O\left(\frac{m\log m}{\epsilon^2}\right)$ for $r\in[m]$ and improvements over prior bounds for $r=1$ or $m$. Empirical results on synthetic and real multi-class data validate faster convergence under dynamic sampling and demonstrate the practical viability of the proposed flexible-sampling GDRO framework.
Abstract
Group distributionally robust optimization (GDRO) aims to develop models that perform well across $m$ distributions simultaneously. Existing GDRO algorithms can only process a fixed number of samples per iteration, either 1 or $m$, and therefore can not support scenarios where the sample size varies dynamically. To address this limitation, we investigate GDRO with flexible sample queries and cast it as a two-player game: one player solves an online convex optimization problem, while the other tackles a prediction with limited advice (PLA) problem. Within such a game, we propose a novel PLA algorithm, constructing appropriate loss estimators for cases where the sample size is either 1 or not, and updating the decision using follow-the-regularized-leader. Then, we establish the first high-probability regret bound for non-oblivious PLA. Building upon the above approach, we develop a GDRO algorithm that allows an arbitrary and varying sample size per round, achieving a high-probability optimization error bound of $O\left(\frac{1}{t}\sqrt{\sum_{j=1}^t \frac{m}{r_j}\log m}\right)$, where $r_t$ denotes the sample size at round $t$. This result demonstrates that the optimization error decreases as the number of samples increases and implies a consistent sample complexity of $O(m\log (m)/ε^2)$ for any fixed sample size $r\in[m]$, aligning with existing bounds for cases of $r=1$ or $m$. We validate our approach on synthetic binary and real-world multi-class datasets.
