Hypernetwork-Driven Model Fusion for Federated Domain Generalization
Marc Bartholet, Taehyeon Kim, Ami Beuret, Se-Young Yun, Joachim M. Buhmann
TL;DR
The paper tackles Federated Domain Generalization (FDG) by addressing domain shifts in decentralized data via non-linear model aggregation. It introduces hFedF, a hypernetwork-based Federated Fusion framework that uses client-specific embeddings to generate personalized client parameters g_i = h(θ, ν[i]), enabling a quasi-global fusion without sharing raw data. Stability and convergence are achieved through GradAlign-based gradient alignment and Exponential Moving Average (EMA) regularization, with empirical validation on PACS, Office-Home, and VLCS showing superior in-domain and out-of-domain performance in zero-shot and few-shot settings. The results indicate strong robustness to domain heterogeneity, reduced weight divergence across clients, and improved prediction reliability, highlighting the practical potential of hypernetwork-based FDG for privacy-preserving, scalable domain generalization in FL.
Abstract
Federated Learning (FL) faces significant challenges with domain shifts in heterogeneous data, degrading performance. Traditional domain generalization aims to learn domain-invariant features, but the federated nature of model averaging often limits this due to its linear aggregation of local learning. To address this, we propose a robust framework, coined as hypernetwork-based Federated Fusion (hFedF), using hypernetworks for non-linear aggregation, facilitating generalization to unseen domains. Our method employs client-specific embeddings and gradient alignment techniques to manage domain generalization effectively. Evaluated in both zero-shot and few-shot settings, hFedF demonstrates superior performance in handling domain shifts. Comprehensive comparisons on PACS, Office-Home, and VLCS datasets show that hFedF consistently achieves the highest in-domain and out-of-domain accuracy with reliable predictions. Our study contributes significantly to the under-explored field of Federated Domain Generalization (FDG), setting a new benchmark for performance in this area.
