Table of Contents
Fetching ...

FedCCRL: Federated Domain Generalization with Cross-Client Representation Learning

Xinpeng Wang, Yongxin Guo, Xiaoying Tang

TL;DR

FedCCRL is proposed, a lightweight federated domain generalization method that significantly improves the model's generalization ability while preserving privacy and ensuring computational and communication efficiency.

Abstract

Domain Generalization (DG) aims to train models that can effectively generalize to unseen domains. However, in the context of Federated Learning (FL), where clients collaboratively train a model without directly sharing their data, most existing DG algorithms are not directly applicable to the FL setting due to privacy constraints, as well as the limited data quantity and domain diversity at each client. To tackle these challenges, we propose FedCCRL, a lightweight federated domain generalization method that significantly improves the model's generalization ability while preserving privacy and ensuring computational and communication efficiency. Specifically, FedCCRL comprises two principal modules: the first is a cross-client feature extension module, which increases local domain diversity via cross-client domain transfer and domain-invariant feature perturbation; the second is a representation and prediction dual-stage alignment module, which enables the model to effectively capture domain-invariant features. Extensive experimental results demonstrate that FedCCRL achieves the state-of-the-art performance on the PACS, OfficeHome and miniDomainNet datasets across FL settings of varying numbers of clients. Code is available at https://github.com/sanphouwang/fedccrl

FedCCRL: Federated Domain Generalization with Cross-Client Representation Learning

TL;DR

FedCCRL is proposed, a lightweight federated domain generalization method that significantly improves the model's generalization ability while preserving privacy and ensuring computational and communication efficiency.

Abstract

Domain Generalization (DG) aims to train models that can effectively generalize to unseen domains. However, in the context of Federated Learning (FL), where clients collaboratively train a model without directly sharing their data, most existing DG algorithms are not directly applicable to the FL setting due to privacy constraints, as well as the limited data quantity and domain diversity at each client. To tackle these challenges, we propose FedCCRL, a lightweight federated domain generalization method that significantly improves the model's generalization ability while preserving privacy and ensuring computational and communication efficiency. Specifically, FedCCRL comprises two principal modules: the first is a cross-client feature extension module, which increases local domain diversity via cross-client domain transfer and domain-invariant feature perturbation; the second is a representation and prediction dual-stage alignment module, which enables the model to effectively capture domain-invariant features. Extensive experimental results demonstrate that FedCCRL achieves the state-of-the-art performance on the PACS, OfficeHome and miniDomainNet datasets across FL settings of varying numbers of clients. Code is available at https://github.com/sanphouwang/fedccrl

Paper Structure

This paper contains 23 sections, 16 equations, 7 figures, 2 tables, 3 algorithms.

Figures (7)

  • Figure 1: Illustration of the typical scenario in FL. Each client contains data from a unique domain, and the test domain (Photo) differs from all domains present on the clients.
  • Figure 2: Overview of FedCCRL. The client transmits local model parameters and sample statistics to the server, where these elements are aggregated and redistributed to each client. During the local training phase, feature augmentation, representation alignment and prediction alignment are applied to ensure the model learns to focus on domain-invariant features.
  • Figure 3: Effect of feature augmentation. (b) is generated from (a) by CCDT, and (c) is generated from (b) by DIFP.
  • Figure 4: Average test accuracy (%) versus the number of participating clients.
  • Figure 5: Effect of $r$ and $\lambda_1$, $\lambda_2$ on accuracy across datasets
  • ...and 2 more figures