Local K-Similarity Constraint for Federated Learning with Label Noise

Sanskar Amgain; Prashant Shrestha; Bidur Khanal; Alina Devkota; Yash Raj Shrestha; Seungryul Baek; Prashnna Gyawali; Binod Bhattarai

Local K-Similarity Constraint for Federated Learning with Label Noise

Sanskar Amgain, Prashant Shrestha, Bidur Khanal, Alina Devkota, Yash Raj Shrestha, Seungryul Baek, Prashnna Gyawali, Binod Bhattarai

TL;DR

The paper tackles federated learning under substantial label noise by introducing a local K-similarity constraint that regresses client representations toward the local SSL neighborhood. A fixed SSL encoder provides pseudo-ground-truth neighborhoods, and an InfoNCE-based regularizer enforces locality in the classifier space, yielding a per-sample objective that combines standard CE with a neighborhood-based penalty. Empirically, the method surpasses state-of-the-art FNLL baselines across computer vision and medical imaging benchmarks, including real-world noisy datasets, and remains effective with different SSL backbones in architecture-agnostic settings. The approach offers practical robustness without requiring shared SSL and classifier architectures or global denoising, making it suitable for diverse federated deployments with noisy client data.

Abstract

Federated learning on clients with noisy labels is a challenging problem, as such clients can infiltrate the global model, impacting the overall generalizability of the system. Existing methods proposed to handle noisy clients assume that a sufficient number of clients with clean labels are available, which can be leveraged to learn a robust global model while dampening the impact of noisy clients. This assumption fails when a high number of heterogeneous clients contain noisy labels, making the existing approaches ineffective. In such scenarios, it is important to locally regularize the clients before communication with the global model, to ensure the global model isn't corrupted by noisy clients. While pre-trained self-supervised models can be effective for local regularization, existing centralized approaches relying on pretrained initialization are impractical in a federated setting due to the potentially large size of these models, which increases communication costs. In that line, we propose a regularization objective for client models that decouples the pre-trained and classification models by enforcing similarity between close data points within the client. We leverage the representation space of a self-supervised pretrained model to evaluate the closeness among examples. This regularization, when applied with the standard objective function for the downstream task in standard noisy federated settings, significantly improves performance, outperforming existing state-of-the-art federated methods in multiple computer vision and medical image classification benchmarks. Unlike other techniques that rely on self-supervised pretrained initialization, our method does not require the pretrained model and classifier backbone to share the same architecture, making it architecture-agnostic.

Local K-Similarity Constraint for Federated Learning with Label Noise

TL;DR

Abstract

Local K-Similarity Constraint for Federated Learning with Label Noise

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)