COCA: Classifier-Oriented Calibration via Textual Prototype for Source-Free Universal Domain Adaptation
Xinghong Liu, Yi Zhou, Tao Zhou, Chun-Mei Feng, Ling Shao
TL;DR
The paper tackles source-free universal domain adaptation by shifting focus from adapting image encoders to calibrating the classifier of a vision-language model–powered few-shot learner using textual prototypes. It introduces ACTP to generate positive textual prototypes and negative image prototypes for self-training, and MIECI to enhance contextual mutual information via a masked-image pathway and an EMA teacher. The resulting COCA framework demonstrates state-of-the-art performance across OPDA, OSDA, and PDA on OfficeHome, VisDA-2017, and DomainNet while requiring only few-shot source data. This approach highlights that VLMs encode cross-domain knowledge, enabling robust adaptation with reduced labeling costs and broad applicability to zero-shot and few-shot settings.
Abstract
Universal domain adaptation (UniDA) aims to address domain and category shifts across data sources. Recently, due to more stringent data restrictions, researchers have introduced source-free UniDA (SF-UniDA). SF-UniDA methods eliminate the need for direct access to source samples when performing adaptation to the target domain. However, existing SF-UniDA methods still require an extensive quantity of labeled source samples to train a source model, resulting in significant labeling costs. To tackle this issue, we present a novel plug-and-play classifier-oriented calibration (COCA) method. COCA, which exploits textual prototypes, is designed for the source models based on few-shot learning with vision-language models (VLMs). It endows the VLM-powered few-shot learners, which are built for closed-set classification, with the unknown-aware ability to distinguish common and unknown classes in the SF-UniDA scenario. Crucially, COCA is a new paradigm to tackle SF-UniDA challenges based on VLMs, which focuses on classifier instead of image encoder optimization. Experiments show that COCA outperforms state-of-the-art UniDA and SF-UniDA models.
