No Forgetting Learning: Memory-free Continual Learning
Mohammad Ali Vahedifar, Qi Zhang
TL;DR
The paper tackles catastrophic forgetting in continual learning when past data are not stored. It introduces No Forgetting Learning (NFL), a memory-free distillation-based framework, and NFL+ with an auto-encoder to preserve features and address bias. NFL achieves competitive performance on Task-IL and Class-IL benchmarks while using roughly 14.75× less memory than memory-based methods, and it introduces the Plasticity-Stability Ratio to better quantify trade-offs. These results advocate memory-free CL as a scalable, privacy-friendly alternative and highlight the need for fair benchmarking that accounts for memory and computation.
Abstract
Continual Learning (CL) remains a central challenge in deep learning, where models must sequentially acquire new knowledge while mitigating Catastrophic Forgetting (CF) of prior tasks. Existing approaches often struggle with efficiency and scalability, requiring extensive memory or model buffers. This work introduces ``No Forgetting Learning" (NFL), a memory-free CL framework that leverages knowledge distillation to maintain stability while preserving plasticity. Memory-free means the NFL does not rely on any memory buffer. Through extensive evaluations of three benchmark datasets, we demonstrate that NFL achieves competitive performance while utilizing approximately 14.75 times less memory than state-of-the-art methods. Furthermore, we introduce a new metric to better assess CL's plasticity-stability trade-off.
