Retrospective Feature Estimation for Continual Learning
Nghia D. Nguyen, Hieu Trung Nguyen, Ang Li, Hoang Pham, Viet Anh Nguyen, Khoa D. Doan
TL;DR
This paper tackles catastrophic forgetting in continual learning by introducing Retrospective Feature Estimation (RFE), a mechanism that uses a chain of lightweight retrospector modules to map current features $f_t(\boldsymbol{x})$ back toward past representations $f_{t-1}(\boldsymbol{x})$, enabling backward rectification of learned knowledge. The retrospector training relies on a latent-estimation loss $\mathcal{L}_{FE}$ and supports three data strategies (RFE, RFE-P, RFE-B) to balance privacy and performance, while keeping changes to the main task learning minimal. Empirically, RFE and its variants achieve competitive or superior performance compared to strong rehearsal-based baselines on standard CL benchmarks (S-CIFAR10, S-CIFAR100, S-TinyImg), with particular gains on long task sequences and in TIL/CIL scenarios. The approach is data-efficient, potentially data-free, and integrates into existing CL pipelines by adding lightweight retrospector modules that operate post-training, offering a principled alternative to traditional replay or architectural expansion methods.
Abstract
The intrinsic capability to continuously learn a changing data stream is a desideratum of deep neural networks (DNNs). However, current DNNs suffer from catastrophic forgetting, which interferes with remembering past knowledge. To mitigate this issue, existing Continual Learning (CL) approaches often retain exemplars for replay, regularize learning, or allocate dedicated capacity for new tasks. This paper investigates an unexplored direction for CL called Retrospective Feature Estimation (RFE). RFE learns to reverse feature changes by aligning the features from the current trained DNN backward to the feature space of the old task, where performing predictions is easier. This retrospective process utilizes a chain of small feature mapping networks called retrospector modules. Empirical experiments on several CL benchmarks, including CIFAR10, CIFAR100, and Tiny ImageNet, demonstrate the effectiveness and potential of this novel CL direction compared to existing representative CL methods, motivating further research into retrospective mechanisms as a principled alternative for mitigating catastrophic forgetting in CL. Code is available at: https://github.com/mail-research/retrospective-feature-estimation.
