Controllable Continual Test-Time Adaptation
Ziqi Shi, Fan Lyu, Ye Liu, Fanhua Shang, Fuyuan Hu, Wei Feng, Zhang Zhang, Liang Wang
TL;DR
The paper tackles continual test-time adaptation under continual, unlabeled domain shifts by shifting from suppression to guidance of domain shifts. It introduces C-CoTTA, which uses Concept Activation Vectors (CAV) to represent shift directions via prototypes and enforces domain-level (CDS) and class-level (CCS) shift controls within a mean-teacher framework and symmetric cross-entropy loss. Key contributions include a prototype-based shift representation, two explicit controllability losses, and extensive experiments across CIFAR10-C, CIFAR100-C, and ImageNet-C demonstrating improved robustness, reduced category bleeding, and favorable long-term behavior. This approach advances CTTA by preserving class boundaries and reducing sensitivity to domain transformations, with practical implications for robust long-term deployment in dynamic environments.
Abstract
Continual Test-Time Adaptation (CTTA) is an emerging and challenging task where a model trained in a source domain must adapt to continuously changing conditions during testing, without access to the original source data. CTTA is prone to error accumulation due to uncontrollable domain shifts, leading to blurred decision boundaries between categories. Existing CTTA methods primarily focus on suppressing domain shifts, which proves inadequate during the unsupervised test phase. In contrast, we introduce a novel approach that guides rather than suppresses these shifts. Specifically, we propose $\textbf{C}$ontrollable $\textbf{Co}$ntinual $\textbf{T}$est-$\textbf{T}$ime $\textbf{A}$daptation (C-CoTTA), which explicitly prevents any single category from encroaching on others, thereby mitigating the mutual influence between categories caused by uncontrollable shifts. Moreover, our method reduces the sensitivity of model to domain transformations, thereby minimizing the magnitude of category shifts. Extensive quantitative experiments demonstrate the effectiveness of our method, while qualitative analyses, such as t-SNE plots, confirm the theoretical validity of our approach.
