Novel clustered federated learning based on local loss

Endong Gu; Yongxin Chen; Hao Wen; Xingju Cai; Deren Han

Novel clustered federated learning based on local loss

Endong Gu, Yongxin Chen, Hao Wen, Xingju Cai, Deren Han

TL;DR

This work tackles clustering in federated learning under strict privacy constraints, addressing non-IID data by introducing a loss-based clustering metric LCFL that does not rely on sharing gradients or raw data. The framework defines a distance between clients based on local losses, provides theoretical bounds linking this distance to distributional discrepancy, and supports flexible clustering methods with a warm-up phase that precedes FL training. Empirical results on FEMNIST, Rotated MNIST, and Rotated CIFAR10 show LCFL outperforms gradient- and parameter-based clustering approaches and standard FedAvg, especially when client data exhibit clustering structure. The approach preserves privacy, accommodates non-convex models, and offers practical improvements for personalized, distributed learning in real-world FL deployments.

Abstract

This paper proposes LCFL, a novel clustering metric for evaluating clients' data distributions in federated learning. LCFL aligns with federated learning requirements, accurately assessing client-to-client variations in data distribution. It offers advantages over existing clustered federated learning methods, addressing privacy concerns, improving applicability to non-convex models, and providing more accurate classification results. LCFL does not require prior knowledge of clients' data distributions. We provide a rigorous mathematical analysis, demonstrating the correctness and feasibility of our framework. Numerical experiments with neural network instances highlight the superior performance of LCFL over baselines on several clustered federated learning benchmarks.

Novel clustered federated learning based on local loss

TL;DR

Abstract

Paper Structure (13 sections, 2 theorems, 42 equations, 5 figures, 3 tables, 1 algorithm)

This paper contains 13 sections, 2 theorems, 42 equations, 5 figures, 3 tables, 1 algorithm.

Introduction
Contributions
Notation
Clustered federated learning based on local loss
Algorithm and Main Results
Implementation considerations
Examples and mathematical analysis
Drawbacks of the model parameter metric
Numerical experiments
Performances of the metrics
Performances of algorithms
Conclusion
Biography Section

Key Result

Theorem 1

For any $\delta > 0,$ with probability at least $(1-\delta)^4$ over the draw of $i.i.d$ samples $X_i, X_j$ of sizes $m_i, m_j$ respectively, the following holds:

Figures (5)

Figure 1: FL three-step protocol illustration
Figure 2: Performances of different metrics on FEMNIST
Figure 3: Performances of different metrics on Rotated MNIST
Figure 4: Test Accuracy comparison on FEMNIST
Figure 5: Test Accuracy comparison on Rotated CIFAR10

Theorems & Definitions (7)

Definition 1: KL divergence kullback1951information
Definition 2: Label-discrepancy mohri2012new
Definition 3: Empirical Rademacher complexity mohri2018foundations
Definition 4: Rademacher complexity mohri2018foundations
Theorem 1
Theorem 2: Hoeffding's Inequality
proof

Novel clustered federated learning based on local loss

TL;DR

Abstract

Novel clustered federated learning based on local loss

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (7)