Gradually Vanishing Gap in Prototypical Network for Unsupervised Domain Adaptation
Shanshan Wang, Hao Zhou, Xun Yang, Zhenwei He, Mengzhu Wang, Xingyi Zhang, Meng Wang
TL;DR
This work tackles unsupervised domain adaptation under large domain gaps that can cause distribution collapse during alignment. It introduces Gradually Vanishing Gap in Prototypical Network (GVG-PN), which constructs two intermediate, domain-biased domains via a GCN to preserve global distribution structure while maintaining local manifold relations; prototypes are formed by aggregating cross-domain features. A novel ProNCE loss focuses on hard negative prototype pairs to enhance discriminability, complemented by InfoNCE preliminaries and a mutual information term. Theoretical analysis connects target risk to source risk via α-divergence, arguing that progressively aligning intermediate domains tightens bounds on target loss. Empirically, GVG-PN establishes new state-of-the-art results across five benchmarks (Office-31, ImageCLEF-DA, Office-Home, VisDA-2017, DomainNet), validating the effectiveness of domain-biased prototypes and prototype-level contrastive learning for robust, scalable DA in vision tasks.
Abstract
Unsupervised domain adaptation (UDA) is a critical problem for transfer learning, which aims to transfer the semantic information from labeled source domain to unlabeled target domain. Recent advancements in UDA models have demonstrated significant generalization capabilities on the target domain. However, the generalization boundary of UDA models remains unclear. When the domain discrepancy is too large, the model can not preserve the distribution structure, leading to distribution collapse during the alignment. To address this challenge, we propose an efficient UDA framework named Gradually Vanishing Gap in Prototypical Network (GVG-PN), which achieves transfer learning from both global and local perspectives. From the global alignment standpoint, our model generates a domain-biased intermediate domain that helps preserve the distribution structures. By entangling cross-domain features, our model progressively reduces the risk of distribution collapse. However, only relying on global alignment is insufficient to preserve the distribution structure. To further enhance the inner relationships of features, we introduce the local perspective. We utilize the graph convolutional network (GCN) as an intuitive method to explore the internal relationships between features, ensuring the preservation of manifold structures and generating domain-biased prototypes. Additionally, we consider the discriminability of the inner relationships between features. We propose a pro-contrastive loss to enhance the discriminability at the prototype level by separating hard negative pairs. By incorporating both GCN and the pro-contrastive loss, our model fully explores fine-grained semantic relationships. Experiments on several UDA benchmarks validated that the proposed GVG-PN can clearly outperform the SOTA models.
