Entropy-driven Fair and Effective Federated Learning

Lin Wang; Zhichao Wang; Ye Shi; Sai Praneeth Karimireddy; Xiaoying Tang

Entropy-driven Fair and Effective Federated Learning

Lin Wang, Zhichao Wang, Ye Shi, Sai Praneeth Karimireddy, Xiaoying Tang

TL;DR

This work tackles fairness in Federated Learning under heterogeneous client data by balancing client performance without sacrificing global accuracy. It introduces FedEBA+, a bi-level optimization framework combining entropy-based fair aggregation with model and gradient alignment, yielding an analytic inner solution for aggregation weights and adaptive outer-loop updates. Theoretical results establish convergence in nonconvex FL and fairness improvements across generalized linear and strongly convex settings, while empirical results show reduced performance variance and higher global accuracy on diverse datasets; a communication-efficient variant (Prac-FedEBA+) maintains these gains with FedAvg-like costs. The approach also demonstrates robustness to noisy labels and compatibility with differential privacy, though Byzantine-robust extensions remain for future work, underscoring its practical impact for fair, scalable FL deployment.

Abstract

Federated Learning (FL) enables collaborative model training across distributed devices while preserving data privacy. Nonetheless, the heterogeneity of edge devices often leads to inconsistent performance of the globally trained models, resulting in unfair outcomes among users. Existing federated fairness algorithms strive to enhance fairness but often fall short in maintaining the overall performance of the global model, typically measured by the average accuracy across all clients. To address this issue, we propose a novel algorithm that leverages entropy-based aggregation combined with model and gradient alignments to simultaneously optimize fairness and global model performance. Our method employs a bi-level optimization framework, where we derive an analytic solution to the aggregation probability in the inner loop, making the optimization process computationally efficient. Additionally, we introduce an innovative alignment update and an adaptive strategy in the outer loop to further balance global model's performance and fairness. Theoretical analysis indicates that our approach guarantees convergence even in non-convex FL settings and demonstrates significant fairness improvements in generalized regression and strongly convex models. Empirically, our approach surpasses state-of-the-art federated fairness algorithms, ensuring consistent performance among clients while improving the overall performance of the global model.

Entropy-driven Fair and Effective Federated Learning

TL;DR

Abstract

Paper Structure (51 sections, 12 theorems, 103 equations, 13 figures, 24 tables, 3 algorithms)

This paper contains 51 sections, 12 theorems, 103 equations, 13 figures, 24 tables, 3 algorithms.

Introduction
Related Work
Preliminaries and Metrics
FedEBA+: An Effective Fair Algorithm
Fair Aggregation: EBA
Bi-level optimization formulation and alignment update
Adaptive Balance between Fairness and Global Performance Improvement
Model Alignment for Improving Global Accuracy.
Gradient Alignment for Improving Fairness.
Practical gradient alignment to reduce communication.
Analysis of Convergence and Fairness
Convergence Analysis of FedEBA+
Fairness Analysis of FedEBA+
Variance analysis.
Numerical Results
...and 36 more sections

Key Result

Proposition 4.1

By solving the constrained maximum entropy problem, we propose an aggregation strategy called EBA to enhance fairness in FL, expressed as follows: where $\tau>0$ is the temperature, and the derivation of $\tau$ is related to $\tilde{f}(x)$.

Figures (13)

Figure 1: Performance of algorithms on (a) left: variance and accuracy on MNIST, (a) right: variance and accuracy on CIFAR-10, (b) left: convergence on MNIST, (b) right: convergence on CIFAR-10.
Figure 2: Performance of all the methods in terms of Fairness (Var.).
Figure 3: Ablation study for hyperparameters
Figure 4: The maximum and minimum 5% performance of all baselines and FedEBA+ on CIFAR-10.
Figure 5: The maximum and minimum 5% performance of all baselines and FedEBA+ on FashionMNSIT.
...and 8 more figures

Theorems & Definitions (25)

Definition 3.1: Fairness via variance
Proposition 4.1
Remark 4.2: The effectiveness of $\tau$ on fairness
Proposition 4.3
Theorem 5.1
Remark 5.2
Remark 5.3
Theorem 5.4
proof
proof
...and 15 more

Entropy-driven Fair and Effective Federated Learning

TL;DR

Abstract

Entropy-driven Fair and Effective Federated Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (25)