Online Distillation with Continual Learning for Cyclic Domain Shifts
Joachim Houyon, Anthony Cioppa, Yasir Ghunaim, Motasem Alfarra, Anaïs Halin, Maxim Henry, Bernard Ghanem, Marc Van Droogenbroeck
TL;DR
The paper tackles catastrophic forgetting in online distillation under cyclic domain shifts by integrating continual learning (both replay-based and regularization-based) into the online distillation pipeline. It defines a cyclic online continual learning setting and evaluates it on long, untrimmed video streams where a fast student learns from a slow, accurate teacher via pseudo-labels. The authors show that replay-based methods, particularly MIR and MIR+RWalk, substantially mitigate forgetting and improve both backward and forward transfer, while some regularizers can hinder online adaptation. Overall, the approach enhances real-time perception robustness for applications like autonomous driving and video surveillance, representing a significant step toward practical online continual learning for cyclical domain changes.
Abstract
In recent years, online distillation has emerged as a powerful technique for adapting real-time deep neural networks on the fly using a slow, but accurate teacher model. However, a major challenge in online distillation is catastrophic forgetting when the domain shifts, which occurs when the student model is updated with data from the new domain and forgets previously learned knowledge. In this paper, we propose a solution to this issue by leveraging the power of continual learning methods to reduce the impact of domain shifts. Specifically, we integrate several state-of-the-art continual learning methods in the context of online distillation and demonstrate their effectiveness in reducing catastrophic forgetting. Furthermore, we provide a detailed analysis of our proposed solution in the case of cyclic domain shifts. Our experimental results demonstrate the efficacy of our approach in improving the robustness and accuracy of online distillation, with potential applications in domains such as video surveillance or autonomous driving. Overall, our work represents an important step forward in the field of online distillation and continual learning, with the potential to significantly impact real-world applications.
