Maintain Plasticity in Long-timescale Continual Test-time Adaptation
Yanshuo Wang, Xuesong Li, Jinguang Tong, Jie Hong, Jun Lan, Weiqiang Wang, Huijia Zhu, Haoxing Chen
TL;DR
This work addresses the problem of sustaining long-term adaptability in continual test-time adaptation (CTTA) by focusing on plasticity, the model's ability to learn from new distributions over time. It introduces Adaptive Shrink-Restore (ASR), a policy that adaptively reinitializes model weights based on the trajectory of label flips, using a shrink-and-restore scheme to preserve prior knowledge while injecting necessary plasticity. The authors demonstrate that plasticity loss correlates with label-flip fluctuations and show that ASR improves mean accuracy and robustness on CCC, CIN-C, and CIN-3DCC benchmarks, while being compatible with various CTTA methods. The proposed approach offers a practical, generalizable solution for sustaining continual adaptation in non-stationary test environments, with potential impact on deployments requiring long-running robustness.
Abstract
Continual test-time domain adaptation (CTTA) aims to adjust pre-trained source models to perform well over time across non-stationary target environments. While previous methods have made considerable efforts to optimize the adaptation process, a crucial question remains: can the model adapt to continually-changing environments with preserved plasticity over a long time? The plasticity refers to the model's capability to adjust predictions in response to non-stationary environments continually. In this work, we explore plasticity, this essential but often overlooked aspect of continual adaptation to facilitate more sustained adaptation in the long run. First, we observe that most CTTA methods experience a steady and consistent decline in plasticity during the long-timescale continual adaptation phase. Moreover, we find that the loss of plasticity is strongly associated with the change in label flip. Based on this correlation, we propose a simple yet effective policy, Adaptive Shrink-Restore (ASR), towards preserving the model's plasticity. In particular, ASR does the weight re-initialization by the adaptive intervals. The adaptive interval is determined based on the change in label flipping. Our method is validated on extensive CTTA benchmarks, achieving excellent performance.
