Communication Efficient ConFederated Learning: An Event-Triggered SAGA Approach
Bin Wang, Jun Fang, Hongbin Li, Yonina C. Eldar
TL;DR
This work tackles the high communication cost of federated learning across multiple edge servers by introducing Confederated Learning (CFL) and a gradient-tracking-based method, CFL-SAGA, that employs a conditionally-triggered user selection (CTUS) mechanism. CTUS selectively uploads variance-reduced gradients from a small subset of users, balancing information gain with communication overhead, while maintaining a linear convergence rate. The authors establish convergence guarantees under standard smoothness and strong convexity assumptions, deriving a rate bound with a carefully chosen stepsize, and provide a theoretical analysis showing CTUS can prune non-informative uploads. Empirical results across different server topologies demonstrate substantial communication efficiency gains over state-of-the-art methods, validating the practical impact of the approach for scalable, privacy-preserving distributed learning.
Abstract
Federated learning (FL) is a machine learning paradigm that targets model training without gathering the local data dispersed over various data sources. Standard FL, which employs a single server, can only support a limited number of users, leading to degraded learning capability. In this work, we consider a multi-server FL framework, referred to as \emph{Confederated Learning} (CFL), in order to accommodate a larger number of users. A CFL system is composed of multiple networked edge servers, with each server connected to an individual set of users. Decentralized collaboration among servers is leveraged to harness all users' data for model training. Due to the potentially massive number of users involved, it is crucial to reduce the communication overhead of the CFL system. We propose a stochastic gradient method for distributed learning in the CFL framework. The proposed method incorporates a conditionally-triggered user selection (CTUS) mechanism as the central component to effectively reduce communication overhead. Relying on a delicately designed triggering condition, the CTUS mechanism allows each server to select only a small number of users to upload their gradients, without significantly jeopardizing the convergence performance of the algorithm. Our theoretical analysis reveals that the proposed algorithm enjoys a linear convergence rate. Simulation results show that it achieves substantial improvement over state-of-the-art algorithms in terms of communication efficiency.
