Adapting to Distribution Shift by Visual Domain Prompt Generation
Zhixiang Chi, Li Gu, Tao Zhong, Huan Liu, Yuanhao Yu, Konstantinos N Plataniotis, Yang Wang
TL;DR
This work addresses the challenge of distribution shift under Few-Shot Test-Time Domain Adaptation by building adaptation on top of frozen foundation-model features. It introduces Visual Domain Prompt Generator (VDPG), which learns a shared knowledge bank across source domains and generates a domain-specific prompt conditioned on a small unlabeled target sample batch; a domain guidance module then fuses this prompt with the foundation model features to direct domain-aware predictions. The approach uses episodic meta-learning and a domain-aware contrastive loss to extract transferable domain knowledge while keeping the backbone fixed and enabling on-device, gradient-free adaptation. Empirical results on 5 large-scale benchmarks, including WILDS and DomainNet, show state-of-the-art performance and strong robustness to distribution shifts, with improved efficiency over finetuning-based methods. The work demonstrates practical, scalable domain specialization for foundation models in real-world deployment scenarios with limited target data.
Abstract
In this paper, we aim to adapt a model at test-time using a few unlabeled data to address distribution shifts. To tackle the challenges of extracting domain knowledge from a limited amount of data, it is crucial to utilize correlated information from pre-trained backbones and source domains. Previous studies fail to utilize recent foundation models with strong out-of-distribution generalization. Additionally, domain-centric designs are not flavored in their works. Furthermore, they employ the process of modelling source domains and the process of learning to adapt independently into disjoint training stages. In this work, we propose an approach on top of the pre-computed features of the foundation model. Specifically, we build a knowledge bank to learn the transferable knowledge from source domains. Conditioned on few-shot target data, we introduce a domain prompt generator to condense the knowledge bank into a domain-specific prompt. The domain prompt then directs the visual features towards a particular domain via a guidance module. Moreover, we propose a domain-aware contrastive loss and employ meta-learning to facilitate domain knowledge extraction. Extensive experiments are conducted to validate the domain knowledge extraction. The proposed method outperforms previous work on 5 large-scale benchmarks including WILDS and DomainNet.
