FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
Jichang Li, Guanbin Li, Hui Cheng, Zicheng Liao, Yizhou Yu
TL;DR
FedDiv tackles Federated Learning with Noisy Labels (F-LNL) by introducing a one-stage framework that jointly learns a global Federated Noise Filter (FNF) and a Predictive Consistency based Sampler (PCS). The FNF aggregates per-client Gaussian Mixture Models fitted to per-sample losses to model the global noise distribution without sharing raw data, enabling robust identification of clean vs. noisy samples. PCS enforces agreement between local and global predictions and applies counterfactual debiasing to reselect high-quality labeled data for local training, reducing noise memorization and stabilizing optimization. Training employs MixUp with a class-prior regularization to further enhance robustness, yielding state-of-the-art results on CIFAR-10, CIFAR-100, and Clothing1M under both IID and non-IID partitions. Overall, FedDiv demonstrates that collaborative noise filtering, coupled with predictive-consistency data selection, can significantly improve FL performance in the presence of noisy labels while preserving data privacy.
Abstract
Federated learning with noisy labels (F-LNL) aims at seeking an optimal server model via collaborative distributed learning by aggregating multiple client models trained with local noisy or clean samples. On the basis of a federated learning framework, recent advances primarily adopt label noise filtering to separate clean samples from noisy ones on each client, thereby mitigating the negative impact of label noise. However, these prior methods do not learn noise filters by exploiting knowledge across all clients, leading to sub-optimal and inferior noise filtering performance and thus damaging training stability. In this paper, we present FedDiv to tackle the challenges of F-LNL. Specifically, we propose a global noise filter called Federated Noise Filter for effectively identifying samples with noisy labels on every client, thereby raising stability during local training sessions. Without sacrificing data privacy, this is achieved by modeling the global distribution of label noise across all clients. Then, in an effort to make the global model achieve higher performance, we introduce a Predictive Consistency based Sampler to identify more credible local data for local model training, thus preventing noise memorization and further boosting the training stability. Extensive experiments on CIFAR-10, CIFAR-100, and Clothing1M demonstrate that \texttt{FedDiv} achieves superior performance over state-of-the-art F-LNL methods under different label noise settings for both IID and non-IID data partitions. Source code is publicly available at https://github.com/lijichang/FLNL-FedDiv.
