Decentralized Domain Generalization with Style Sharing: Formal Model and Convergence Analysis
Shahryar Zehtabi, Dong-Jun Han, Seyyedali Hosseinalipour, Christopher G. Brinton
TL;DR
This work addresses domain generalization in decentralized federated settings by introducing StyleDDG, a fully peer-to-peer DG algorithm that shares and exploits style statistics across one-hop neighbors. The authors provide a formal modeling framework for style-based DG, extending centralized methods like MixStyle and DSU to decentralized networks and deriving convergence guarantees under standard non-convex optimization assumptions. StyleDDG integrates consensus-based gradient updates with a three-stage style exploration pipeline (StyleShift, StyleExplore, MixStyle) to augment styles while keeping communication overhead minimal. Empirical results on PACS and VLCS show that StyleDDG achieves superior generalization to unseen target domains across varying graph connectivities and model sizes, validating both its theoretical and practical merits. The work advances the state of DG in distributed settings by delivering the first convergence analysis for style-based DG and a scalable, communication-efficient algorithm for fully decentralized networks.
Abstract
Much of federated learning (FL) focuses on settings where local dataset statistics remain the same between training and testing. However, this assumption often does not hold in practice due to distribution shifts, motivating the development of domain generalization (DG) approaches that leverage source domain data to train models capable of generalizing to unseen target domains. In this paper, we are motivated by two major gaps in existing work on FL and DG: (1) the lack of formal mathematical analysis of DG objectives; and (2) DG research in FL being limited to the star-topology architecture. We develop Decentralized Federated Domain Generalization with Style Sharing ($\textit{StyleDDG}$), a decentralized DG algorithm which allows devices in a peer-to-peer network to achieve DG based on sharing style information inferred from their datasets. Additionally, we provide the first systematic approach to analyzing style-based DG training in decentralized networks. We cast existing centralized DG algorithms within our framework, and employ their formalisms to model $\textit{StyleDDG}$. We then obtain analytical conditions under which convergence of $\textit{StyleDDG}$ can be guaranteed. Through experiments on popular DG datasets, we demonstrate that $\textit{StyleDDG}$ can obtain significant improvements in accuracy across target domains with minimal communication overhead compared to baseline decentralized gradient methods.
