Global Intervention and Distillation for Federated Out-of-Distribution Generalization
Zhuang Qi, Runhui Zhang, Lei Meng, Wei Wu, Yachong Zhang, Xiangxu Meng
TL;DR
The paper tackles federated out-of-distribution generalization under attribute skew, where local models latch onto non-causal background features. It introduces FedGID, a plug-and-play framework with two modules: Global Intervention (GI) to perform backdoor-adjustment by decoupling objects from backgrounds and injecting diverse background information, and Global Distillation (GD) to align local representations with a unified global knowledge base via KL-based regularization. The method optimizes a total loss $\mathcal{L}_{total} = \mathcal{L}_{EM} + \mathcal{L}_{GI} + \lambda \mathcal{L}_{GD}$, encouraging robust, invariant features across clients. Empirical results on three datasets show FedGID improves attention to main subjects in unseen data and outperforms state-of-the-art baselines, with ablations confirming the complementary benefits of GI and GD. Overall, FedGID provides a model-agnostic, privacy-conscious approach to federated OOD generalization by combining backdoor adjustment with cross-client knowledge distillation, enabling more reliable collaborative learning in heterogeneous environments.
Abstract
Attribute skew in federated learning leads local models to focus on learning non-causal associations, guiding them towards inconsistent optimization directions, which inevitably results in performance degradation and unstable convergence. Existing methods typically leverage data augmentation to enhance sample diversity or employ knowledge distillation to learn invariant representations. However, the instability in the quality of generated data and the lack of domain information limit their performance on unseen samples. To address these issues, this paper presents a global intervention and distillation method, termed FedGID, which utilizes diverse attribute features for backdoor adjustment to break the spurious association between background and label. It includes two main modules, where the global intervention module adaptively decouples objects and backgrounds in images, injects background information into random samples to intervene in the sample distribution, which links backgrounds to all categories to prevent the model from treating background-label associations as causal. The global distillation module leverages a unified knowledge base to guide the representation learning of client models, preventing local models from overfitting to client-specific attributes. Experimental results on three datasets demonstrate that FedGID enhances the model's ability to focus on the main subjects in unseen data and outperforms existing methods in collaborative modeling.
