Table of Contents
Fetching ...

Entropy-driven Fair and Effective Federated Learning

Lin Wang, Zhichao Wang, Ye Shi, Sai Praneeth Karimireddy, Xiaoying Tang

TL;DR

This work tackles fairness in Federated Learning under heterogeneous client data by balancing client performance without sacrificing global accuracy. It introduces FedEBA+, a bi-level optimization framework combining entropy-based fair aggregation with model and gradient alignment, yielding an analytic inner solution for aggregation weights and adaptive outer-loop updates. Theoretical results establish convergence in nonconvex FL and fairness improvements across generalized linear and strongly convex settings, while empirical results show reduced performance variance and higher global accuracy on diverse datasets; a communication-efficient variant (Prac-FedEBA+) maintains these gains with FedAvg-like costs. The approach also demonstrates robustness to noisy labels and compatibility with differential privacy, though Byzantine-robust extensions remain for future work, underscoring its practical impact for fair, scalable FL deployment.

Abstract

Federated Learning (FL) enables collaborative model training across distributed devices while preserving data privacy. Nonetheless, the heterogeneity of edge devices often leads to inconsistent performance of the globally trained models, resulting in unfair outcomes among users. Existing federated fairness algorithms strive to enhance fairness but often fall short in maintaining the overall performance of the global model, typically measured by the average accuracy across all clients. To address this issue, we propose a novel algorithm that leverages entropy-based aggregation combined with model and gradient alignments to simultaneously optimize fairness and global model performance. Our method employs a bi-level optimization framework, where we derive an analytic solution to the aggregation probability in the inner loop, making the optimization process computationally efficient. Additionally, we introduce an innovative alignment update and an adaptive strategy in the outer loop to further balance global model's performance and fairness. Theoretical analysis indicates that our approach guarantees convergence even in non-convex FL settings and demonstrates significant fairness improvements in generalized regression and strongly convex models. Empirically, our approach surpasses state-of-the-art federated fairness algorithms, ensuring consistent performance among clients while improving the overall performance of the global model.

Entropy-driven Fair and Effective Federated Learning

TL;DR

This work tackles fairness in Federated Learning under heterogeneous client data by balancing client performance without sacrificing global accuracy. It introduces FedEBA+, a bi-level optimization framework combining entropy-based fair aggregation with model and gradient alignment, yielding an analytic inner solution for aggregation weights and adaptive outer-loop updates. Theoretical results establish convergence in nonconvex FL and fairness improvements across generalized linear and strongly convex settings, while empirical results show reduced performance variance and higher global accuracy on diverse datasets; a communication-efficient variant (Prac-FedEBA+) maintains these gains with FedAvg-like costs. The approach also demonstrates robustness to noisy labels and compatibility with differential privacy, though Byzantine-robust extensions remain for future work, underscoring its practical impact for fair, scalable FL deployment.

Abstract

Federated Learning (FL) enables collaborative model training across distributed devices while preserving data privacy. Nonetheless, the heterogeneity of edge devices often leads to inconsistent performance of the globally trained models, resulting in unfair outcomes among users. Existing federated fairness algorithms strive to enhance fairness but often fall short in maintaining the overall performance of the global model, typically measured by the average accuracy across all clients. To address this issue, we propose a novel algorithm that leverages entropy-based aggregation combined with model and gradient alignments to simultaneously optimize fairness and global model performance. Our method employs a bi-level optimization framework, where we derive an analytic solution to the aggregation probability in the inner loop, making the optimization process computationally efficient. Additionally, we introduce an innovative alignment update and an adaptive strategy in the outer loop to further balance global model's performance and fairness. Theoretical analysis indicates that our approach guarantees convergence even in non-convex FL settings and demonstrates significant fairness improvements in generalized regression and strongly convex models. Empirically, our approach surpasses state-of-the-art federated fairness algorithms, ensuring consistent performance among clients while improving the overall performance of the global model.
Paper Structure (51 sections, 12 theorems, 103 equations, 13 figures, 24 tables, 3 algorithms)

This paper contains 51 sections, 12 theorems, 103 equations, 13 figures, 24 tables, 3 algorithms.

Key Result

Proposition 4.1

By solving the constrained maximum entropy problem, we propose an aggregation strategy called EBA to enhance fairness in FL, expressed as follows: where $\tau>0$ is the temperature, and the derivation of $\tau$ is related to $\tilde{f}(x)$.

Figures (13)

  • Figure 1: Performance of algorithms on (a) left: variance and accuracy on MNIST, (a) right: variance and accuracy on CIFAR-10, (b) left: convergence on MNIST, (b) right: convergence on CIFAR-10.
  • Figure 2: Performance of all the methods in terms of Fairness (Var.).
  • Figure 3: Ablation study for hyperparameters
  • Figure 4: The maximum and minimum 5% performance of all baselines and FedEBA+ on CIFAR-10.
  • Figure 5: The maximum and minimum 5% performance of all baselines and FedEBA+ on FashionMNSIT.
  • ...and 8 more figures

Theorems & Definitions (25)

  • Definition 3.1: Fairness via variance
  • Proposition 4.1
  • Remark 4.2: The effectiveness of $\tau$ on fairness
  • Proposition 4.3
  • Theorem 5.1
  • Remark 5.2
  • Remark 5.3
  • Theorem 5.4
  • proof
  • proof
  • ...and 15 more