Continual Domain Adversarial Adaptation via Double-Head Discriminators
Yan Shen, Zhanghexuan Ji, Chunwei Ma, Mingchen Gao
TL;DR
The paper tackles continual unsupervised domain adaptation with limited access to past source data, where estimating the ${\mathcal{H}}$-divergence is unreliable when using small memory buffers. It introduces a double-head discriminator comprising a pre-trained, frozen source-only discriminator and a target-phase discriminator, whose ensemble provides a more accurate domain-discrepancy signal to learn domain-invariant features. Theoretical analysis links the ensemble to a population ${\mathcal{H}}-\Delta{\mathcal{H}}$ bound and offers finite-sample generalization guarantees, while experiments on benchmarks such as MNIST-family and Office datasets show consistent improvements in target adaptation and reduced forgetting, with further gains achievable by combining with SSL or KD. Overall, the method advances continual UDA by reducing empirical estimation error and enhancing robustness to domain shifts under memory constraints, with potential for extension to source-free settings.
Abstract
Domain adversarial adaptation in a continual setting poses a significant challenge due to the limitations on accessing previous source domain data. Despite extensive research in continual learning, the task of adversarial adaptation cannot be effectively accomplished using only a small number of stored source domain data, which is a standard setting in memory replay approaches. This limitation arises from the erroneous empirical estimation of $\gH$-divergence with few source domain samples. To tackle this problem, we propose a double-head discriminator algorithm, by introducing an addition source-only domain discriminator that are trained solely on source learning phase. We prove that with the introduction of a pre-trained source-only domain discriminator, the empirical estimation error of $\gH$-divergence related adversarial loss is reduced from the source domain side. Further experiments on existing domain adaptation benchmark show that our proposed algorithm achieves more than 2$\%$ improvement on all categories of target domain adaptation task while significantly mitigating the forgetting on source domain.
