Distributed Event-Based Learning via ADMM
Guner Dilsad Er, Sebastian Trimpe, Michael Muehlebach
TL;DR
This work tackles distributed learning over networks with non-i.i.d. data and limited communication. It introduces an event-triggered, over-relaxed ADMM scheme that exchanges information only when local updates exceed a threshold $Δ$, and analyzes the resulting algorithm as a dynamical system to establish convergence. Under strong convexity, the method achieves linear (and accelerated) convergence; in nonconvex settings, it yields sublinear guarantees, and a reset mechanism provides robustness to communication failures. Empirical results on MNIST and CIFAR-10 demonstrate substantial communication savings (≥35%) and superior accuracy-communication trade-offs over baselines like FedAvg, FedProx, SCAFFOLD, and FedADMM, highlighting its practical impact for large-scale, heterogeneous distributed learning.
Abstract
We consider a distributed learning problem, where agents minimize a global objective function by exchanging information over a network. Our approach has two distinct features: (i) It substantially reduces communication by triggering communication only when necessary, and (ii) it is agnostic to the data-distribution among the different agents. We therefore guarantee convergence even if the local data-distributions of the agents are arbitrarily distinct. We analyze the convergence rate of the algorithm both in convex and nonconvex settings and derive accelerated convergence rates for the convex case. We also characterize the effect of communication failures and demonstrate that our algorithm is robust to these. The article concludes by presenting numerical results from distributed learning tasks on the MNIST and CIFAR-10 datasets. The experiments underline communication savings of 35% or more due to the event-based communication strategy, show resilience towards heterogeneous data-distributions, and highlight that our approach outperforms common baselines such as FedAvg, FedProx, SCAFFOLD and FedADMM.
