Zeroth-Order Stochastic Mirror Descent Algorithms for Minimax Excess Risk Optimization
Zhihao Gu, Zi Xu
TL;DR
This work tackles MERO, a distributionally robust minimax excess risk problem, by reformulating it as a stochastic convex-concave saddle-point with φ(w,q) = ∑ q_i [R_i(w) - R_i^*]. It introduces a zeroth-order stochastic mirror descent (ZO-SMD) algorithm that leverages UniGE-based gradient estimators for smooth losses and a non-smooth variant, enabling gradient-free optimization across m distributions. The authors prove optimal convergence rates: the excess-risk estimates converge at O(1/√t) and the saddle-point error at O(1/√t) for both smooth and non-smooth MERO, with an overall complexity of O(1/t). This provides the first zeroth-order guarantees for MERO and demonstrates the practical viability of gradient-free approaches in distributionally robust minimax settings, with potential extensions to nonconvex regimes and broader stochastic saddle-point problems.
Abstract
The minimax excess risk optimization (MERO) problem is a new variation of the traditional distributionally robust optimization (DRO) problem, which achieves uniformly low regret across all test distributions under suitable conditions. In this paper, we propose a zeroth-order stochastic mirror descent (ZO-SMD) algorithm available for both smooth and non-smooth MERO to estimate the minimal risk of each distrbution, and finally solve MERO as (non-)smooth stochastic convex-concave (linear) minimax optimization problems. The proposed algorithm is proved to converge at optimal convergence rates of $\mathcal{O}\left(1/\sqrt{t}\right)$ on the estimate of $R_i^*$ and $\mathcal{O}\left(1/\sqrt{t}\right)$ on the optimization error of both smooth and non-smooth MERO. Numerical results show the efficiency of the proposed algorithm.
