Effortless Active Labeling for Long-Term Test-Time Adaptation
Guowei Wang, Changxing Ding
TL;DR
This work addresses the growing labeling burden in long-term test-time adaptation by introducing EATTA, which restricts annotation to at most one sample per batch. It identifies border samples between source and target domains via feature perturbations and uses their pseudo-labels to guide single-step learning, while a gradient-norm based debiasing mechanism balances supervised and unsupervised objectives with EMA refinement. The approach yields state-of-the-art results on ImageNet-C/R/K/A and PACS under both CTTA and FTTA settings, with significantly lower annotation costs than prior ATTA methods. The findings suggest that careful sample selection focused on learnability, combined with dynamic loss balancing, enables robust, efficient long-term adaptation in dynamic deployment scenarios.
Abstract
Long-term test-time adaptation (TTA) is a challenging task due to error accumulation. Recent approaches tackle this issue by actively labeling a small proportion of samples in each batch, yet the annotation burden quickly grows as the batch number increases. In this paper, we investigate how to achieve effortless active labeling so that a maximum of one sample is selected for annotation in each batch. First, we annotate the most valuable sample in each batch based on the single-step optimization perspective in the TTA context. In this scenario, the samples that border between the source- and target-domain data distributions are considered the most feasible for the model to learn in one iteration. Then, we introduce an efficient strategy to identify these samples using feature perturbation. Second, we discover that the gradient magnitudes produced by the annotated and unannotated samples have significant variations. Therefore, we propose balancing their impact on model optimization using two dynamic weights. Extensive experiments on the popular ImageNet-C, -R, -K, -A and PACS databases demonstrate that our approach consistently outperforms state-of-the-art methods with significantly lower annotation costs.
