Mitigating Domain Shift in Federated Learning via Intra- and Inter-Domain Prototypes
Huy Q. Le, Ye Lin Tun, Yu Qiao, Minh N. H. Nguyen, Keon Oh Kim, Eui-Nam Huh, Choong Seon Hong
TL;DR
This work tackles domain shift in federated learning by introducing I$^2$PFL, which jointly leverages intra-domain prototypes gathered locally via MixUp-based augmentation and inter-domain generalized prototypes computed with a prototype reweighting scheme. The method employs Generalized Prototypes Contrastive Learning (GPCL) and Augmented Prototype Alignment (APA) losses to transfer cross-domain knowledge while enriching local feature diversity, with EMA updates ensuring stable prototype evolution. Empirical results on Digits, Office-10, and PACS show improved generalization across domains and strong performance in unseen-domain settings, with ablations confirming the contributions of intra-domain augmentation, inter-domain reweighting, and prototype-based contrastive learning. The approach offers a practical pathway to robust FL under real-world multi-domain heterogeneity, while preserving privacy through prototype averaging operations.
Abstract
Federated Learning (FL) has emerged as a decentralized machine learning technique, allowing clients to train a global model collaboratively without sharing private data. However, most FL studies ignore the crucial challenge of heterogeneous domains where each client has a distinct feature distribution, which is popular in real-world scenarios. Prototype learning, which leverages the mean feature vectors within the same classes, has become a prominent solution for federated learning under domain shift. However, existing federated prototype learning methods focus soley on inter-domain prototypes and neglect intra-domain perspectives. In this work, we introduce a novel federated prototype learning method, namely I$^2$PFL, which incorporates $\textbf{I}$ntra-domain and $\textbf{I}$nter-domain $\textbf{P}$rototypes, to mitigate domain shift from both perspectives and learn a generalized global model across multiple domains in federated learning. To construct intra-domain prototypes, we propose feature alignment with MixUp-based augmented prototypes to capture the diversity within local domains and enhance the generalization of local features. Additionally, we introduce a reweighting mechanism for inter-domain prototypes to generate generalized prototypes that reduce domain shift while providing inter-domain knowledge across multiple clients. Extensive experiments on the Digits, Office-10, and PACS datasets illustrate the superior performance of our method compared to other baselines.
