IDER: IDempotent Experience Replay for Reliable Continual Learning

Zhanwang Liu; Yuting Li; Haoyuan Gao; Yexin Li; Linghe Kong; Lichao Sun; Weiran Huang

IDER: IDempotent Experience Replay for Reliable Continual Learning

Zhanwang Liu, Yuting Li, Haoyuan Gao, Yexin Li, Linghe Kong, Lichao Sun, Weiran Huang

TL;DR

The proposed idempotent experience replay (IDER), a novel approach based on the idempotent property where repeated function applications yield the same output, consistently improves prediction reliability while simultaneously boosting accuracy and reducing forgetting.

Abstract

Catastrophic forgetting, the tendency of neural networks to forget previously learned knowledge when learning new tasks, has been a major challenge in continual learning (CL). To tackle this challenge, CL methods have been proposed and shown to reduce forgetting. Furthermore, CL models deployed in mission-critical settings can benefit from uncertainty awareness by calibrating their predictions to reliably assess their confidences. However, existing uncertainty-aware continual learning methods suffer from high computational overhead and incompatibility with mainstream replay methods. To address this, we propose idempotent experience replay (IDER), a novel approach based on the idempotent property where repeated function applications yield the same output. Specifically, we first adapt the training loss to make model idempotent on current data streams. In addition, we introduce an idempotence distillation loss. We feed the output of the current model back into the old checkpoint and then minimize the distance between this reprocessed output and the original output of the current model. This yields a simple and effective new baseline for building reliable continual learners, which can be seamlessly integrated with other CL approaches. Extensive experiments on different CL benchmarks demonstrate that IDER consistently improves prediction reliability while simultaneously boosting accuracy and reducing forgetting. Our results suggest the potential of idempotence as a promising principle for deploying efficient and trustworthy continual learning systems in real-world applications.Our code is available at https://github.com/YutingLi0606/Idempotent-Continual-Learning.

IDER: IDempotent Experience Replay for Reliable Continual Learning

TL;DR

Abstract

Paper Structure (28 sections, 14 equations, 9 figures, 10 tables)

This paper contains 28 sections, 14 equations, 9 figures, 10 tables.

Introduction
Related Work
Continual Learning.
Idempotence in Deep Learning.
Method
Problem Definition
Modified Architecture
Standard Idempotent Module:Training the Network Idempotent
Idempotent Distillation Module: Distilling The Network For Continual Learning
Overall Objective
Experiments
Results
Additional Analysis
Conclusion
Extended Discussion: Related Idempotence to Continual learning
...and 13 more sections

Figures (9)

Figure 1: We propose the IDER method, which can be directly applied to many recent rehearsal-based continual learning methods, resulting in less calibration error and significant improvements in FAA with less parameter growth compared with NPCL.
Figure 2: Modified Architecture. We modify the architecture of backbone(ResNet) and enable the model to accept two inputs.
Figure 3: We plot the distribution of idempotence errors, measured by the distance $|f(x, f(x,z)) - f(x,z)|$. Inputs $x$ with second incorrect prediction input $z$ exhibit significantly larger idempotence errors. Thus, this distance can be used as a idempotent distillation loss.
Figure 4: Probability of predicting each task at the end of training for models trained on CIFAR-10 with 500 buffer size. Idempotent distillation loss effectively mitigates the bias to the recent tasks and provides a more uniform probability size across different tasks.
Figure 5: Overall framework of Idempotent Experience Replay (IDER). Our method consists of two modules for continual learning: (1) Standard Idempotent Module that trains current model idempotent with data from the current task. (2) Idempotent Distillation Module that enforce the current model to become idempotent with respect to the last task model checkpoint, utilizing data from both the current task and buffer memory. IDER can be integrated into existing CL approaches to make reliable predictions while mitigate catastrophic forgetting.
...and 4 more figures

IDER: IDempotent Experience Replay for Reliable Continual Learning

TL;DR

Abstract

IDER: IDempotent Experience Replay for Reliable Continual Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (9)