Accelerating Hybrid Federated Learning Convergence under Partial Participation

Jieming Bian; Lei Wang; Kun Yang; Cong Shen; Jie Xu

Accelerating Hybrid Federated Learning Convergence under Partial Participation

Jieming Bian, Lei Wang, Kun Yang, Cong Shen, Jie Xu

TL;DR

This work addresses convergence in hybrid federated learning where the server holds a small, representative dataset and clients participate partially with non-IID data. It first analyzes CLG-SGD under non-IID and partial participation, revealing that server training helps but partial participation remains a bottleneck. The authors then introduce FedCLG, with two variants (FedCLG-C and FedCLG-S) that use server-side gradients to correct variance either during client updates or server aggregation, respectively. Theoretical convergence results show accelerated rates when leveraging server data, and experiments on MNIST, CIFAR-10, and CIFAR-100 demonstrate that FedCLG outperforms state-of-the-art baselines under realistic non-IID, partially participating settings. Overall, FedCLG provides a practical and theoretically grounded approach to speeding up hybrid FL by effectively exploiting server data and variance correction mechanisms.

Abstract

Over the past few years, Federated Learning (FL) has become a popular distributed machine learning paradigm. FL involves a group of clients with decentralized data who collaborate to learn a common model under the coordination of a centralized server, with the goal of protecting clients' privacy by ensuring that local datasets never leave the clients and that the server only performs model aggregation. However, in realistic scenarios, the server may be able to collect a small amount of data that approximately mimics the population distribution and has stronger computational ability to perform the learning process. To address this, we focus on the hybrid FL framework in this paper. While previous hybrid FL work has shown that the alternative training of clients and server can increase convergence speed, it has focused on the scenario where clients fully participate and ignores the negative effect of partial participation. In this paper, we provide theoretical analysis of hybrid FL under clients' partial participation to validate that partial participation is the key constraint on convergence speed. We then propose a new algorithm called FedCLG, which investigates the two-fold role of the server in hybrid FL. Firstly, the server needs to process the training steps using its small amount of local datasets. Secondly, the server's calculated gradient needs to guide the participated clients' training and the server's aggregation. We validate our theoretical findings through numerical experiments, which show that our proposed method FedCLG outperforms state-of-the-art methods.

Accelerating Hybrid Federated Learning Convergence under Partial Participation

TL;DR

Abstract

Paper Structure (24 sections, 6 theorems, 80 equations, 9 figures, 1 table, 1 algorithm)

This paper contains 24 sections, 6 theorems, 80 equations, 9 figures, 1 table, 1 algorithm.

Introduction
Related Works
Federated Learning
Hybrid Federated Learning
Variance Reduction
Problem Formulation and CLG-SGD
Problem Formulation
Novel Convergence Analysis of CLG-SGD 10001832
FedCLG
FedCLG-C
FedCLG-S
Comparison with FL Variance Reduction Methods
Convergence Analysis of FedCLG
Experiments
Setup
...and 9 more sections

Key Result

Theorem 1

Suppose that client local learning rate $\eta$, global learning rate $\eta_g$, and server local learning rate $\gamma$ are chosen such that $\eta \leq \frac{1}{3KL}$, $\eta \eta_g \leq \frac{1}{27KL}$, and $\gamma \leq \frac{1}{6EL}$. Under Assumptions assm:smooth, assm:unbiased-local, l_variance, a

Figures (9)

Figure 1: Illustration of a Communication Round in Hybrid FL with Non-IID Client Data and Partial Participation.
Figure 2: The key distinction between FedCLG-S and FedCLG-C lies in the timing and location of the correction step. In FedCLG-S, the corrections occur during the server aggregation step, whereas in FedCLG-C, they take place during each client's local training step.
Figure 3: Convergence performances on MNIST
Figure 4: Convergence performances on CIFAR-10
Figure 5: Convergence performances on CIFAR-100
...and 4 more figures

Theorems & Definitions (19)

Theorem 1
proof
Remark 1
Remark 2
Corollary 1
Remark 3
Remark 4
Remark 5
Theorem 2
proof
...and 9 more

Accelerating Hybrid Federated Learning Convergence under Partial Participation

TL;DR

Abstract

Accelerating Hybrid Federated Learning Convergence under Partial Participation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (19)