Principled Federated Domain Adaptation: Gradient Projection and Auto-Weighting
Enyi Jiang, Yibo Jacky Zhang, Sanmi Koyejo
TL;DR
This work tackles Federated Domain Adaptation under two core challenges: domain shift between source and target domains and limited target data. It develops a principled framework based on a Delta error decomposition using $d_π(\mathcal{D}_S,\mathcal{D}_T)$ and $\sigma_π^2(\widehat{\mathcal{D}}_T)$, guiding the design of aggregation rules. The authors introduce FedGP, a gradient projection-based FDA method, and FedDA, a convex-gradient-merge approach, along with an auto-weighting scheme that computes optimal per-source weights $β_i$ from target updates. Empirical results on semi-synthetic and real-world domain shifts show that FedGP and the auto-weighted variants consistently outperform personalized FL, UFDA, and domain-generalization baselines, demonstrating the practical value of principled gradient-based FDA aggregation.
Abstract
Federated Domain Adaptation (FDA) describes the federated learning (FL) setting where source clients and a server work collaboratively to improve the performance of a target client where limited data is available. The domain shift between the source and target domains, coupled with limited data of the target client, makes FDA a challenging problem, e.g., common techniques such as federated averaging and fine-tuning fail due to domain shift and data scarcity. To theoretically understand the problem, we introduce new metrics that characterize the FDA setting and a theoretical framework with novel theorems for analyzing the performance of server aggregation rules. Further, we propose a novel lightweight aggregation rule, Federated Gradient Projection ($\texttt{FedGP}$), which significantly improves the target performance with domain shift and data scarcity. Moreover, our theory suggests an $\textit{auto-weighting scheme}$ that finds the optimal combinations of the source and target gradients. This scheme improves both $\texttt{FedGP}$ and a simpler heuristic aggregation rule. Extensive experiments verify the theoretical insights and illustrate the effectiveness of the proposed methods in practice.
