FedRC: Tackling Diverse Distribution Shifts Challenge in Federated Learning by Robust Clustering
Yongxin Guo, Xiaoying Tang, Tao Lin
TL;DR
This work addresses the challenge of diverse and simultaneous distribution shifts in Federated Learning by proposing FedRC, a soft-clustering framework built on RobustCluster. RobustCluster casts clustering as a bi-level optimization over cluster parameters $\boldsymbol{\Theta}$ and per-source weights $\boldsymbol{\Omega}$, utilizing a robust objective $\mathcal{L}(\boldsymbol{\Theta},\boldsymbol{\Omega})$ that promotes separation of concept shifts while accommodating feature and label shifts via a mixture model. FedRC adapts this centralized clustering approach to FL, enabling cluster-specific global models through an iterative E-step / M-step style optimization and FedAvg aggregation, with theoretical convergence guarantees under standard assumptions. Empirical results on FashionMNIST, CIFAR10/100, and Tiny-ImageNet across CNNs and MobileNetV2/ResNet18 demonstrate that FedRC outperforms state-of-the-art clustered FL baselines, remains robust to cluster imbalances and varying concept numbers, and can exploit adaptive enhancements (FedRC-Adam) to accelerate convergence and even infer the number of concepts. The work provides a principled framework for robust clustering in privacy-preserving, heterogeneous FL systems and lays groundwork for future extensions like adaptive concept-counting and test-time adaptation.
Abstract
Federated Learning (FL) is a machine learning paradigm that safeguards privacy by retaining client data on edge devices. However, optimizing FL in practice can be challenging due to the diverse and heterogeneous nature of the learning system. Though recent research has focused on improving the optimization of FL when distribution shifts occur among clients, ensuring global performance when multiple types of distribution shifts occur simultaneously among clients -- such as feature distribution shift, label distribution shift, and concept shift -- remain under-explored. In this paper, we identify the learning challenges posed by the simultaneous occurrence of diverse distribution shifts and propose a clustering principle to overcome these challenges. Through our research, we find that existing methods fail to address the clustering principle. Therefore, we propose a novel clustering algorithm framework, dubbed as FedRC, which adheres to our proposed clustering principle by incorporating a bi-level optimization problem and a novel objective function. Extensive experiments demonstrate that FedRC significantly outperforms other SOTA cluster-based FL methods. Our code is available at \url{https://github.com/LINs-lab/FedRC}.
