Distributed Generalized Nash Equilibria Learning for Online Stochastic Aggregative Games
Kaixin Du, Min Meng
TL;DR
This work tackles online stochastic aggregative games with local decision-set constraints and time-varying coupled inequalities. It introduces a distributed online stochastic primal-dual push-sum algorithm that enables partial-information players to learn a variational GNE while tracking the aggregative variable over unbalanced, time-varying networks. The authors prove high-probability sublinear bounds on per-player regret R_i(T) and constraint violation R_g(T), and establish almost-sure convergence to the vGNE in time-invariant, strongly monotone cases, with sublinear rates for time-averaged iterates. The method is validated via simulations in an electricity-market-like scenario, demonstrating consensus, feasibility progress, and superior performance over baselines, indicating practical impact for real-time, distributed decision-making under uncertainty.
Abstract
This paper investigates online stochastic aggregative games subject to local set constraints and time-varying coupled inequality constraints, where each player possesses a time-varying expectation-valued cost function relying on not only its own decision variable but also an aggregation of all the players' variables. Each player can only access its local individual cost function and constraints, necessitating partial information exchanges with neighboring players through time-varying unbalanced networks. Additionally, local cost functions and constraint functions are not prior knowledge and only revealed gradually. To learn generalized Nash equilibria of such games, a novel distributed online stochastic algorithm is devised based on push-sum and primal-dual strategies. Through rigorous analysis, high probability bounds on the regret and constraint violation are provided by appropriately selecting decreasing stepsizes. Moreover, for a time-invariant stochastic strongly monotone game, it is shown that the generated sequence by the designed algorithm converges to its variational generalized Nash equilibrium (GNE) almost surely, and the time-averaged sequence converges sublinearly with high probability. Finally, the derived theoretical results are illustrated by numerical simulations.
