Quantized Decentralized Stochastic Learning over Directed Graphs
Hossein Taheri, Aryan Mokhtari, Hamed Hassani, Ramtin Pedarsani
TL;DR
This work tackles the communication bottleneck in decentralized learning over directed graphs by introducing quantized push-sum schemes for both gossip and decentralized stochastic optimization. The proposed methods compress exchanged information while preserving the convergence guarantees of exact-communication push-sum, achieving the same $O\left(\frac{1}{\sqrt{nT}}\right)$ rates for convex problems and the corresponding stationary-point rates for non-convex problems. The analysis shows quantization noise decays at a linear spectral rate, enabling vanishing error and matching performance with significantly reduced communication. Numerical experiments on directed graphs demonstrate substantial reductions in transmitted bits (up to 5–10x) with negligible loss in convergence speed or final accuracy, highlighting practical benefits for large-scale distributed learning.
Abstract
We consider a decentralized stochastic learning problem where data points are distributed among computing nodes communicating over a directed graph. As the model size gets large, decentralized learning faces a major bottleneck that is the heavy communication load due to each node transmitting large messages (model updates) to its neighbors. To tackle this bottleneck, we propose the quantized decentralized stochastic learning algorithm over directed graphs that is based on the push-sum algorithm in decentralized consensus optimization. More importantly, we prove that our algorithm achieves the same convergence rates of the decentralized stochastic learning algorithm with exact-communication for both convex and non-convex losses. Numerical evaluations corroborate our main theoretical results and illustrate significant speed-up compared to the exact-communication methods.
