Compressed Momentum-based Single-Point Zero-Order Algorithm for Stochastic Distributed Nonconvex Optimization
Linjing Chen, Antai Xie, Xinlei Yi, Xiaoqiang Ren, Xiaofan Wang
TL;DR
This work addresses stochastic distributed nonconvex optimization when explicit gradients are unavailable and communication is bandwidth-limited. It introduces the Compressed Momentum-based Single-Point Zeroth-Order (CMSPZO) algorithm, which blends momentum, one-point zeroth-order gradient estimation, and compressed inter-agent communication over a connected graph with weight matrix $\mathbf{W}$ and spectral gap $\delta$. Under appropriate assumptions and parameter choices, CMSPZO achieves convergence to the exact solution with diminishing step sizes at rate $O(1/\sqrt[4]{T})$ and convergence to a neighborhood of a stationary point with fixed steps at rate $O(1/\sqrt{T})$, while controlling consensus and compression errors. Empirical results on a distributed nonconvex logistic regression task demonstrate reduced communication than baselines while maintaining competitive convergence, highlighting practical efficiency in bandwidth-limited networks.
Abstract
This paper studies a compressed momentum-based single-point zeroth-order algorithm for stochastic distributed nonconvex optimization, aiming to alleviate communication overhead and address the unavailability of explicit gradient information. In the developed framework, each agent has access only to stochastic zeroth-order information of its local objective function, performs local stochastic updates with momentum, and exchanges compressed updates with its neighbors. We theoretically prove that the proposed algorithm can achieve the exact solution with diminishing step sizes and can achieve a sublinear convergence rate towards a neighborhood of the stationary point with fixed step sizes. Numerical experiments validate the effectiveness and communication efficiency of the proposed algorithm.
