Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning
Yujun Shi, Jian Liang, Wenqing Zhang, Vincent Y. F. Tan, Song Bai
TL;DR
Dimensional collapse of representations under data heterogeneity is identified as a core challenge in federated learning, affecting both local and global models. The authors provide a gradient-flow-based theoretical explanation showing how heterogeneous label distributions bias weight matrices toward low rank, leading to collapsed representations, and demonstrate that the global collapse inherits from local models. They propose FedDecorr, a plug-and-play regularizer that decorrelates representations by penalizing the correlation matrix via a Frobenius-norm term, implemented efficiently with z-score normalization. Empirical results on CIFAR-10/100 and TinyImageNet show consistent, notable gains over strong baselines, especially under strong heterogeneity, highlighting FedDecorr’s practical value for scalable, robust FL deployments.
Abstract
Federated learning aims to train models collaboratively across different clients without the sharing of data for privacy considerations. However, one major challenge for this learning paradigm is the {\em data heterogeneity} problem, which refers to the discrepancies between the local data distributions among various clients. To tackle this problem, we first study how data heterogeneity affects the representations of the globally aggregated models. Interestingly, we find that heterogeneous data results in the global model suffering from severe {\em dimensional collapse}, in which representations tend to reside in a lower-dimensional space instead of the ambient space. Moreover, we observe a similar phenomenon on models locally trained on each client and deduce that the dimensional collapse on the global model is inherited from local models. In addition, we theoretically analyze the gradient flow dynamics to shed light on how data heterogeneity result in dimensional collapse for local models. To remedy this problem caused by the data heterogeneity, we propose {\sc FedDecorr}, a novel method that can effectively mitigate dimensional collapse in federated learning. Specifically, {\sc FedDecorr} applies a regularization term during local training that encourages different dimensions of representations to be uncorrelated. {\sc FedDecorr}, which is implementation-friendly and computationally-efficient, yields consistent improvements over baselines on standard benchmark datasets. Code: https://github.com/bytedance/FedDecorr.
