Resource-Constrained Decentralized Federated Learning via Personalized Event-Triggering
Shahryar Zehtabi, Seyyedali Hosseinalipour, Christopher G. Brinton
TL;DR
The paper addresses resource-constrained, fully decentralized FL over time-varying D2D networks without a central server. It introduces EF-HC, an asynchronous, event-triggered, bandwidth-aware triggering framework with personalized thresholds to balance local updates and communications. The authors prove last-iterate convergence at a $O(\ln{k}/\sqrt{k})$ rate under a $B$-connected information-flow graph and show that information can propagate despite sporadic communications, complemented by extensive simulations demonstrating communication savings and faster convergence relative to baselines. The work has practical implications for edge/fog networks with heterogeneous devices and varying connectivity.
Abstract
Federated learning (FL) is a popular technique for distributing machine learning (ML) across a set of edge devices. In this paper, we study fully decentralized FL, where in addition to devices conducting training locally, they carry out model aggregations via cooperative consensus formation over device-to-device (D2D) networks. We introduce asynchronous, event-triggered communications among the devices to handle settings where access to a central server is not feasible. To account for the inherent resource heterogeneity and statistical diversity challenges in FL, we define personalized communication triggering conditions at each device that weigh the change in local model parameters against the available local network resources. We theoretically recover the $O(\ln{k} / \sqrt{k})$ convergence rate to the globally optimal model of decentralized gradient descent (DGD) methods in the setup of our methodology. We provide our convergence guarantees for the last iterates of models, under relaxed graph connectivity and data heterogeneity assumptions compared with the existing literature. To do so, we demonstrate a $B$-connected information flow guarantee in the presence of sporadic communications over the time-varying D2D graph. Our subsequent numerical evaluations demonstrate that our methodology obtains substantial improvements in convergence speed and/or communication savings compared to existing decentralized FL baselines.
