Federated Learning under Partially Class-Disjoint Data via Manifold Reshaping
Ziqing Fan, Jiangchao Yao, Ruipeng Zhang, Lingjuan Lyu, Ya Zhang, Yanfeng Wang
TL;DR
This work tackles federated learning under partially class-disjoint data (PCDD), where each client holds only a subset of classes, a practical but understudied setting. It introduces FedMR, a manifold-reshaping framework that combines an intra-class loss to decorrelate feature dimensions and an inter-class loss leveraging global class prototypes to enforce margins, thereby preventing dimensional collapse and space invasion during local training. Theoretical and empirical analyses show that the joint losses align local representations with a globally consistent feature space, delivering significant accuracy gains and improved communication efficiency across multiple benchmarks and a real-world medical dataset, with tractable privacy and local-burden considerations. FedMR’s modular design and its light variants offer practical applicability for robust, scalable FL in heterogeneous, partially observed data environments.
Abstract
Statistical heterogeneity severely limits the performance of federated learning (FL), motivating several explorations e.g., FedProx, MOON and FedDyn, to alleviate this problem. Despite effectiveness, their considered scenario generally requires samples from almost all classes during the local training of each client, although some covariate shifts may exist among clients. In fact, the natural case of partially class-disjoint data (PCDD), where each client contributes a few classes (instead of all classes) of samples, is practical yet underexplored. Specifically, the unique collapse and invasion characteristics of PCDD can induce the biased optimization direction in local training, which prevents the efficiency of federated learning. To address this dilemma, we propose a manifold reshaping approach called FedMR to calibrate the feature space of local training. Our FedMR adds two interplaying losses to the vanilla federated learning: one is intra-class loss to decorrelate feature dimensions for anti-collapse; and the other one is inter-class loss to guarantee the proper margin among categories in the feature expansion. We conduct extensive experiments on a range of datasets to demonstrate that our FedMR achieves much higher accuracy and better communication efficiency. Source code is available at: https://github.com/MediaBrain-SJTU/FedMR.git.
