Federated Learning with Domain Shift Eraser

Zheng Wang; Zihui Wang; Zheng Wang; Xiaoliang Fan; Cheng Wang

Federated Learning with Domain Shift Eraser

Zheng Wang, Zihui Wang, Zheng Wang, Xiaoliang Fan, Cheng Wang

TL;DR

This paper tackles domain shift in federated learning by introducing Federated Domain Shift Eraser (FDSE), a decoupled framework that separates domain-agnostic feature extraction (DFE) from domain-specific skew erasing (DSE) via layer decomposition. FDSE enforces consistency across domains with a BN-based regularization term and aggregates shared components for consensus while personalizing DSE modules through similarity-aware aggregation. Empirical results on Office-Caltech10, PACS, and DomainNet show that FDSE achieves superior accuracy, robust convergence, and better generalization to unseen domains, while reducing communication and computation costs. The work demonstrates that a hybrid approach combining consensus enhancement with targeted personalization yields strong performance gains in cross-domain FL and offers a scalable path to broader applications and architectures.

Abstract

Federated learning (FL) is emerging as a promising technique for collaborative learning without local data leaving their devices. However, clients' data originating from diverse domains may degrade model performance due to domain shifts, preventing the model from learning consistent representation space. In this paper, we propose a novel FL framework, Federated Domain Shift Eraser (FDSE), to improve model performance by differently erasing each client's domain skew and enhancing their consensus. First, we formulate the model forward passing as an iterative deskewing process that extracts and then deskews features alternatively. This is efficiently achieved by decomposing each original layer in the neural network into a Domain-agnostic Feature Extractor (DFE) and a Domain-specific Skew Eraser (DSE). Then, a regularization term is applied to promise the effectiveness of feature deskewing by pulling local statistics of DSE's outputs close to the globally consistent ones. Finally, DFE modules are fairly aggregated and broadcast to all the clients to maximize their consensus, and DSE modules are personalized for each client via similarity-aware aggregation to erase their domain skew differently. Comprehensive experiments were conducted on three datasets to confirm the advantages of our method in terms of accuracy, efficiency, and generalizability.

Federated Learning with Domain Shift Eraser

TL;DR

Abstract

Federated Learning with Domain Shift Eraser

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)