A Simple Data Augmentation for Feature Distribution Skewed Federated Learning
Yunlu Yan, Huazhu Fu, Yuexiang Li, Jinheng Xie, Jun Ma, Guang Yang, Lei Zhu
TL;DR
This work addresses feature distribution skew in federated learning by proposing FedRDN, a simple, plug-and-play input-level augmentation that injects federation-wide statistical information into local data during training. By randomly pairing local samples with statistics from other clients, FedRDN broadens the effective data distribution and enhances local feature generalization, reducing distribution bias without altering network architectures or incurring significant overhead. Extensive experiments across image classification and medical image segmentation demonstrate consistent improvements for a range of FL methods, with notably low additional communication costs and improved cross-site generalization. The approach has practical impact for deploying FL in heterogeneous environments where feature distributions vary across clients, providing a privacy-preserving, easily adoptable enhancement to existing pipelines.
Abstract
Federated Learning (FL) facilitates collaborative learning among multiple clients in a distributed manner and ensures the security of privacy. However, its performance inevitably degrades with non-Independent and Identically Distributed (non-IID) data. In this paper, we focus on the feature distribution skewed FL scenario, a common non-IID situation in real-world applications where data from different clients exhibit varying underlying distributions. This variation leads to feature shift, which is a key issue of this scenario. While previous works have made notable progress, few pay attention to the data itself, i.e., the root of this issue. The primary goal of this paper is to mitigate feature shift from the perspective of data. To this end, we propose a simple yet remarkably effective input-level data augmentation method, namely FedRDN, which randomly injects the statistical information of the local distribution from the entire federation into the client's data. This is beneficial to improve the generalization of local feature representations, thereby mitigating feature shift. Moreover, our FedRDN is a plug-and-play component, which can be seamlessly integrated into the data augmentation flow with only a few lines of code. Extensive experiments on several datasets show that the performance of various representative FL methods can be further improved by integrating our FedRDN, demonstrating its effectiveness, strong compatibility and generalizability. Code will be released.
