Energy-efficient Decentralized Learning via Graph Sparsification
Xusheng Zhang, Cho-Chun Chiu, Ting He
TL;DR
This work tackles energy efficiency in decentralized learning by jointly optimizing the mixing matrix and communication topology. It formulates a bi-level problem where the lower-level uses graph sparsification to design link activations that accelerate convergence, and the upper-level selects per-node budgets to minimize total workload. The authors provide a guaranteed-performance Ramanujan-graph design for a fully connected base topology and a scalable greedy heuristic for general graphs, achieving substantial busiest-node energy savings (54%–76%) without sacrificing model quality on realistic datasets and topologies. These results demonstrate a practical path to greener distributed learning systems by balancing communication load through principled spectral-design of the mixing matrix.
Abstract
This work aims at improving the energy efficiency of decentralized learning by optimizing the mixing matrix, which controls the communication demands during the learning process. Through rigorous analysis based on a state-of-the-art decentralized learning algorithm, the problem is formulated as a bi-level optimization, with the lower level solved by graph sparsification. A solution with guaranteed performance is proposed for the special case of fully-connected base topology and a greedy heuristic is proposed for the general case. Simulations based on real topology and dataset show that the proposed solution can lower the energy consumption at the busiest node by 54%-76% while maintaining the quality of the trained model.
