Towards Trustworthy Federated Learning
Alina Basharat, Yijun Bian, Ping Xu, Zhi Tian
TL;DR
This work tackles the challenge of trustworthy federated learning by integrating three mechanisms: a fairness-driven objective via $q$-fair FL, a differential privacy scheme to obscure local data, and a two-sided norm-based screening (TNBS) to mitigate Byzantine attacks. The framework defines $H(\theta) = \sum_{i=1}^{M} \frac{p_i}{q+1} F_i^{q+1}(\theta)$ and employs DP-perturbed gradients $\tilde{g}_i = g_i + n$ with $n \sim \mathcal{N}(0,\sigma^2)$, while TNBS keeps the middle $p$-fraction of gradients based on their norms before aggregating to update $\theta$. The authors establish a resilience bound $\|G - \nabla H\| \leq \frac{2\alpha}{1-p} \|\nabla H\| + \max_{i \in M} \|g_i - \nabla H\| + \sigma$ and prove convergence guarantees for both nonconvex and convex cases, showing that the method converges to a stationary point (nonconvex) or achieves a $1/T$-rate (convex) despite DP noise and Byzantine threats. Empirically, the method outperforms baselines under various attacks on MNIST and Spam datasets, while maintaining fairness across clients and demonstrating the expected privacy-accuracy trade-off induced by DP.
Abstract
This paper develops a comprehensive framework to address three critical trustworthy challenges in federated learning (FL): robustness against Byzantine attacks, fairness, and privacy preservation. To improve the system's defense against Byzantine attacks that send malicious information to bias the system's performance, we develop a Two-sided Norm Based Screening (TNBS) mechanism, which allows the central server to crop the gradients that have the l lowest norms and h highest norms. TNBS functions as a screening tool to filter out potential malicious participants whose gradients are far from the honest ones. To promote egalitarian fairness, we adopt the q-fair federated learning (q-FFL). Furthermore, we adopt a differential privacy-based scheme to prevent raw data at local clients from being inferred by curious parties. Convergence guarantees are provided for the proposed framework under different scenarios. Experimental results on real datasets demonstrate that the proposed framework effectively improves robustness and fairness while managing the trade-off between privacy and accuracy. This work appears to be the first study that experimentally and theoretically addresses fairness, privacy, and robustness in trustworthy FL.
