SaFL: Sybil-aware Federated Learning with Application to Face Recognition
Mahdi Ghafourian, Julian Fierrez, Ruben Vera-Rodriguez, Ruben Tolosana, Aythami Morales
TL;DR
Federated Learning enables collaborative training without sharing raw data, but is vulnerable to Sybil-based targeted poisoning under non-IID conditions. The authors propose SaFL, a Sybil-aware, time-variant aggregation that groups updates by cosine similarity and replaces each group with a median representative, reducing the influence of malicious updates. Through experiments on face recognition with non-IID data, SaFL is compared against Multi-Krum and FoolsGold, showing strong protection with minimal degradation in learning performance; a decaying similarity threshold further enhances robustness. This work offers a practical defense for secure FL deployments, particularly in privacy-sensitive, large-scale face recognition settings.
Abstract
Federated Learning (FL) is a machine learning paradigm to conduct collaborative learning among clients on a joint model. The primary goal is to share clients' local training parameters with an integrating server while preserving their privacy. This method permits to exploit the potential of massive mobile users' data for the benefit of machine learning models' performance while keeping sensitive data on local devices. On the downside, FL raises security and privacy concerns that have just started to be studied. To address some of the key threats in FL, researchers have proposed to use secure aggregation methods (e.g. homomorphic encryption, secure multiparty computation, etc.). These solutions improve some security and privacy metrics, but at the same time bring about other serious threats such as poisoning attacks, backdoor attacks, and free running attacks. This paper proposes a new defense method against poisoning attacks in FL called SaFL (Sybil-aware Federated Learning) that minimizes the effect of sybils with a novel time-variant aggregation scheme.
