DSDRNet: Disentangling Representation and Reconstruct Network for Domain Generalization
Juncheng Yang, Zuchao Li, Shuai Xie, Wei Yu, Shijun Li
TL;DR
DSDRNet tackles domain generalization by learning disentangled representations and robust reconstructions across unseen domains. The method introduces a dual-stream framework with AdaIN-based disentanglement and a two-stage cyclic reconstruction, enforcing both intra-instance and inter-instance semantic constraints to preserve semantic structure while varying style attributes. The approach combines reconstruction, cross-cycle consistency, semantics adversarial supervision, and KL/divergence-based classification signals, achieving state-of-the-art or competitive results on Digits-DG, PACS, OfficeHome, and DomainNet. This yields practical improvements in generalization under domain shifts without relying on domain labels, though it requires careful loss-balancing and presents opportunities for further interpretability via causal analysis.
Abstract
Domain generalization faces challenges due to the distribution shift between training and testing sets, and the presence of unseen target domains. Common solutions include domain alignment, meta-learning, data augmentation, or ensemble learning, all of which rely on domain labels or domain adversarial techniques. In this paper, we propose a Dual-Stream Separation and Reconstruction Network, dubbed DSDRNet. It is a disentanglement-reconstruction approach that integrates features of both inter-instance and intra-instance through dual-stream fusion. The method introduces novel supervised signals by combining inter-instance semantic distance and intra-instance similarity. Incorporating Adaptive Instance Normalization (AdaIN) into a two-stage cyclic reconstruction process enhances self-disentangled reconstruction signals to facilitate model convergence. Extensive experiments on four benchmark datasets demonstrate that DSDRNet outperforms other popular methods in terms of domain generalization capabilities.
