Communication Efficient ConFederated Learning: An Event-Triggered SAGA Approach

Bin Wang; Jun Fang; Hongbin Li; Yonina C. Eldar

Communication Efficient ConFederated Learning: An Event-Triggered SAGA Approach

Bin Wang, Jun Fang, Hongbin Li, Yonina C. Eldar

TL;DR

This work tackles the high communication cost of federated learning across multiple edge servers by introducing Confederated Learning (CFL) and a gradient-tracking-based method, CFL-SAGA, that employs a conditionally-triggered user selection (CTUS) mechanism. CTUS selectively uploads variance-reduced gradients from a small subset of users, balancing information gain with communication overhead, while maintaining a linear convergence rate. The authors establish convergence guarantees under standard smoothness and strong convexity assumptions, deriving a rate bound with a carefully chosen stepsize, and provide a theoretical analysis showing CTUS can prune non-informative uploads. Empirical results across different server topologies demonstrate substantial communication efficiency gains over state-of-the-art methods, validating the practical impact of the approach for scalable, privacy-preserving distributed learning.

Abstract

Federated learning (FL) is a machine learning paradigm that targets model training without gathering the local data dispersed over various data sources. Standard FL, which employs a single server, can only support a limited number of users, leading to degraded learning capability. In this work, we consider a multi-server FL framework, referred to as \emph{Confederated Learning} (CFL), in order to accommodate a larger number of users. A CFL system is composed of multiple networked edge servers, with each server connected to an individual set of users. Decentralized collaboration among servers is leveraged to harness all users' data for model training. Due to the potentially massive number of users involved, it is crucial to reduce the communication overhead of the CFL system. We propose a stochastic gradient method for distributed learning in the CFL framework. The proposed method incorporates a conditionally-triggered user selection (CTUS) mechanism as the central component to effectively reduce communication overhead. Relying on a delicately designed triggering condition, the CTUS mechanism allows each server to select only a small number of users to upload their gradients, without significantly jeopardizing the convergence performance of the algorithm. Our theoretical analysis reveals that the proposed algorithm enjoys a linear convergence rate. Simulation results show that it achieves substantial improvement over state-of-the-art algorithms in terms of communication efficiency.

Communication Efficient ConFederated Learning: An Event-Triggered SAGA Approach

TL;DR

Abstract

Paper Structure (33 sections, 11 theorems, 82 equations, 7 figures, 3 algorithms)

This paper contains 33 sections, 11 theorems, 82 equations, 7 figures, 3 algorithms.

Introduction
Problem Formulation
CFL Framework
Function and Server-Network Assumptions
Overview of GT and GT-SAGA
Gradient Tracking
Gradient Tracking with Variance Reduction
Adapting GT-SAGA for CFL
Proposed Algorithm
Summary of Algorithm
Rationale Behind The CTUS Mechanism
Discussions
Convergence Results
Proof of Theorem \ref{['theorem-1']}
Finding $\boldsymbol{\psi}$
...and 18 more sections

Key Result

Theorem 1

Let $\boldsymbol{x}^*$ denote the optimal solution to the CFL problem (problem-cfl). Assume that the objective function (resp. server network) satisfies the assumptions made in Section sec-obj-assumption. Define where $X^k\triangleq \|\boldsymbol{x}^{k}- \bar{\boldsymbol{W}}_{\infty}\boldsymbol{x}^{k}\|_2^2$, $\bar{X}^k \triangleq\|\bar{\boldsymbol{x}}^{k}-\boldsymbol{x}^*\|_2^2$, $Y^k\triangleq\

Figures (7)

Figure 1: The CFL framework with multiple servers.
Figure 2: Topology of the server network
Figure 3: Results on $\ell_2$-regularized logistic regression. First row: Optimality gap vs. number of iterations on different server networks; Second row: Optimality gap vs. communication overhead on different server networks
Figure 4: Comparisons on $\ell_2$-regularized logistic regression. First row: Optimality gap vs. number of iterations on different server networks; Second row: Optimality gap vs. communication overhead on different server networks. SR is short for 'Sampling rate'.
Figure 5: Runtime vs. number of iterations on the random graph
...and 2 more figures

Theorems & Definitions (19)

Definition 1
Definition 2
Theorem 1
Proposition 1
proof
Lemma 1: Lemma 10 in QuLi17
Lemma 2: Lemma 2 in PuShi20
Lemma 3: Lemma 1 in XinKhan18
Lemma 4: Corollary 8.1.29 in HornJohnson12
Lemma 5: Lemma 4 in XinKhan20
...and 9 more

Communication Efficient ConFederated Learning: An Event-Triggered SAGA Approach

TL;DR

Abstract

Communication Efficient ConFederated Learning: An Event-Triggered SAGA Approach

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (19)