Stein Discrepancy for Unsupervised Domain Adaptation
Anneke von Seeger, Dongmian Zou, Gilad Lerman
TL;DR
This work introduces Stein discrepancy-based unsupervised domain adaptation (UDA) to address scenarios with scarce unlabeled target data. It develops both kernelized and adversarial forms of the transfer loss and supports target-density modeling via Gaussian, Gaussian mixtures, or VAEs, enabling score-function-based alignment without relying on abundant target labels. Theoretical contributions include a generalization bound on the target error and a convergence rate for the empirical Stein discrepancy in two-sample settings. Empirically, the method yields robust improvements over baselines in scarce-target benchmarks (Office31, Office-Home, VisDA-2017), with notable gains when combined with FixMatch or SPA, especially under very limited target data.
Abstract
Unsupervised domain adaptation (UDA) aims to improve model performance on an unlabeled target domain using a related, labeled source domain. A common approach aligns source and target feature distributions by minimizing a distance between them, often using symmetric measures such as maximum mean discrepancy (MMD). However, these methods struggle when target data is scarce. We propose a novel UDA framework that leverages Stein discrepancy, an asymmetric measure that depends on the target distribution only through its score function, making it particularly suitable for low-data target regimes. Our proposed method has kernelized and adversarial forms and supports flexible modeling of the target distribution via Gaussian, GMM, or VAE models. We derive a generalization bound on the target error and a convergence rate for the empirical Stein discrepancy in the two-sample setting. Empirically, our method consistently outperforms prior UDA approaches under limited target data across multiple benchmarks.
