Distributed Online Convex Optimization with Nonseparable Costs and Constraints
Zhaoye Pan, Haozhe Lei, Fan Zuo, Zilin Bian, Tao Li
TL;DR
This work addresses distributed online convex optimization with time-varying, globally coupled (nonseparable) costs and constraints. It introduces a belief-consensus-based distributed online primal-dual algorithm (dopbc) in which agents maintain and exchange beliefs about joint actions and multipliers, enabling decentralized gradient-like updates. Theoretical results show sublinear performance: static regret and cumulative constraint violation both scale as $O(\\sqrt{T})$ under a fixed step size by selecting $c=1/2$ in $\alpha_t=T^{-c}$, breaking the prior $O(T^{3/4})$ CCV barrier. The key insight is that belief sharing decouples primal consensus disagreement from dual constraint violation, achieving optimal online learning efficiency at the cost of additional communication.
Abstract
This paper studies distributed online convex optimization with time-varying coupled constraints, motivated by distributed online control in network systems. Most prior work assumes a separability condition: the global objective and coupled constraint functions are sums of local costs and individual constraints. In contrast, we study a group of agents, networked via a communication graph, that collectively select actions to minimize a sequence of nonseparable global cost functions and to stratify nonseparable long-term constraints based on full-information feedback and intra-agent communication. We propose a distributed online primal-dual belief consensus algorithm, where each agent maintains and updates a local belief of the global collective decisions, which are repeatedly exchanged with neighboring agents. Unlike the previous consensus primal-dual algorithms under separability that ask agents to only communicate their local decisions, our belief-sharing protocol eliminates coupling between the primal consensus disagreement and the dual constraint violation, yielding sublinear regret and cumulative constraint violation (CCV) bounds, both in $O({T}^{1/2})$, where $T$ denotes the time horizon. Such a result breaks the long-standing $O(T^{3/4})$ barrier for CCV and matches the lower bound of online constrained convex optimization, indicating the online learning efficiency at the cost of communication overhead.
