Test-Time Adaptation with Binary Feedback
Taeckyung Lee, Sorn Chottananurak, Junsu Kim, Jinwoo Shin, Taesik Gong, Sung-Ju Lee
TL;DR
This work tackles the problem of deep models degrading under domain shifts by introducing Test-Time Adaptation with Binary Feedback (TTA-BF). It proposes BiTTA, a dual-path reinforcement-learning framework that combines Binary Feedback-guided Adaptation (BFA) on uncertain samples with Agreement-Based self-Adaptation (ABA) on confident ones, using MC-dropout to estimate uncertainty and guide sample selection. The method optimizes a joint objective via policy gradients and memory-based updates, achieving substantial gains (up to 13.3 percentage points) over state-of-the-art TTA baselines while requiring only a small amount of binary feedback. BiTTA demonstrates robust performance under severe distribution shifts with minimal labeling effort, highlighting the practical value of sparse human feedback for real-time adaptation in dynamic environments.
Abstract
Deep learning models perform poorly when domain shifts exist between training and test data. Test-time adaptation (TTA) is a paradigm to mitigate this issue by adapting pre-trained models using only unlabeled test samples. However, existing TTA methods can fail under severe domain shifts, while recent active TTA approaches requiring full-class labels are impractical due to high labeling costs. To address this issue, we introduce a new setting of TTA with binary feedback. This setting uses a few binary feedback inputs from annotators to indicate whether model predictions are correct, thereby significantly reducing the labeling burden of annotators. Under the setting, we propose BiTTA, a novel dual-path optimization framework that leverages reinforcement learning to balance binary feedback-guided adaptation on uncertain samples with agreement-based self-adaptation on confident predictions. Experiments show BiTTA achieves 13.3%p accuracy improvements over state-of-the-art baselines, demonstrating its effectiveness in handling severe distribution shifts with minimal labeling effort. The source code is available at https://github.com/taeckyung/BiTTA.
