Unsupervised Robust Domain Adaptation: Paradigm, Theory and Algorithm

Fuxiang Huang; Xiaowei Fu; Shiyu Ye; Lina Ma; Wen Li; Xinbo Gao; David Zhang; Lei Zhang

Unsupervised Robust Domain Adaptation: Paradigm, Theory and Algorithm

Fuxiang Huang, Xiaowei Fu, Shiyu Ye, Lina Ma, Wen Li, Xinbo Gao, David Zhang, Lei Zhang

TL;DR

The paper addresses robustness of unsupervised domain adaptation (UDA) under adversarial perturbations by identifying entanglement between transfer learning and adversarial training in the UDA+VAT setup. It introduces unsupervised robust domain adaptation (URDA) with a formal generalization bound based on an ideal target classifier $h_t^*$ and proposes a practical two-step algorithm, Disentangled Adversarial Robustness Training (DART), to disentangle transfer and robustness training. Empirical results across Office-31, Office-Home, VisDA-2017, DomainNet, and Amazon Reviews show that DART substantially improves adversarial robustness while preserving clean transfer performance, outperforming standard UDA, AT-based defenses, and other robust UDA methods. The work provides a theoretically grounded, easy-to-implement firewall for UDA against attacks, with implications for deploying robust cross-domain models in real-world settings. However, it assumes knowledge of the attack family and leaves open questions on universal robustness to agnostic attacks.

Abstract

Unsupervised domain adaptation (UDA) aims to transfer knowledge from a label-rich source domain to an unlabeled target domain by addressing domain shifts. Most UDA approaches emphasize transfer ability, but often overlook robustness against adversarial attacks. Although vanilla adversarial training (VAT) improves the robustness of deep neural networks, it has little effect on UDA. This paper focuses on answering three key questions: 1) Why does VAT, known for its defensive effectiveness, fail in the UDA paradigm? 2) What is the generalization bound theory under attacks and how does it evolve from classical UDA theory? 3) How can we implement a robustification training procedure without complex modifications? Specifically, we explore and reveal the inherent entanglement challenge in general UDA+VAT paradigm, and propose an unsupervised robust domain adaptation (URDA) paradigm. We further derive the generalization bound theory of the URDA paradigm so that it can resist adversarial noise and domain shift. To the best of our knowledge, this is the first time to establish the URDA paradigm and theory. We further introduce a simple, novel yet effective URDA algorithm called Disentangled Adversarial Robustness Training (DART), a two-step training procedure that ensures both transferability and robustness. DART first pre-trains an arbitrary UDA model, and then applies an instantaneous robustification post-training step via disentangled distillation.Experiments on four benchmark datasets with/without attacks show that DART effectively enhances robustness while maintaining domain adaptability, and validate the URDA paradigm and theory.

Unsupervised Robust Domain Adaptation: Paradigm, Theory and Algorithm

TL;DR

Abstract

Unsupervised Robust Domain Adaptation: Paradigm, Theory and Algorithm

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (6)