Table of Contents
Fetching ...

Towards Plastic and Stable Exemplar-Free Incremental Learning: A Dual-Learner Framework with Cumulative Parameter Averaging

Wenju Sun, Qingyong Li, Wen Wang, Yangli-ao Geng

TL;DR

A Dual-Learner framework with Cumulative Parameter Averaging (DLCPA), which employs a dual-learner design: a plastic learner focused on acquiring new-task knowledge and a stable learner responsible for accumulating all learned knowledge.

Abstract

The dilemma between plasticity and stability presents a significant challenge in Incremental Learning (IL), especially in the exemplar-free scenario where accessing old-task samples is strictly prohibited during the learning of a new task. A straightforward solution to this issue is learning and storing an independent model for each task, known as Single Task Learning (STL). Despite the linear growth in model storage with the number of tasks in STL, we empirically discover that averaging these model parameters can potentially preserve knowledge across all tasks. Inspired by this observation, we propose a Dual-Learner framework with Cumulative Parameter Averaging (DLCPA). DLCPA employs a dual-learner design: a plastic learner focused on acquiring new-task knowledge and a stable learner responsible for accumulating all learned knowledge. The knowledge from the plastic learner is transferred to the stable learner via cumulative parameter averaging. Additionally, several task-specific classifiers work in cooperation with the stable learner to yield the final prediction. Specifically, when learning a new task, these modules are updated in a cyclic manner: i) the plastic learner is initially optimized using a self-supervised loss besides the supervised loss to enhance the feature extraction robustness; ii) the stable learner is then updated with respect to the plastic learner in a cumulative parameter averaging manner to maintain its task-wise generalization; iii) the task-specific classifier is accordingly optimized to align with the stable learner. Experimental results on CIFAR-100 and Tiny-ImageNet show that DLCPA outperforms several state-of-the-art exemplar-free baselines in both Task-IL and Class-IL settings.

Towards Plastic and Stable Exemplar-Free Incremental Learning: A Dual-Learner Framework with Cumulative Parameter Averaging

TL;DR

A Dual-Learner framework with Cumulative Parameter Averaging (DLCPA), which employs a dual-learner design: a plastic learner focused on acquiring new-task knowledge and a stable learner responsible for accumulating all learned knowledge.

Abstract

The dilemma between plasticity and stability presents a significant challenge in Incremental Learning (IL), especially in the exemplar-free scenario where accessing old-task samples is strictly prohibited during the learning of a new task. A straightforward solution to this issue is learning and storing an independent model for each task, known as Single Task Learning (STL). Despite the linear growth in model storage with the number of tasks in STL, we empirically discover that averaging these model parameters can potentially preserve knowledge across all tasks. Inspired by this observation, we propose a Dual-Learner framework with Cumulative Parameter Averaging (DLCPA). DLCPA employs a dual-learner design: a plastic learner focused on acquiring new-task knowledge and a stable learner responsible for accumulating all learned knowledge. The knowledge from the plastic learner is transferred to the stable learner via cumulative parameter averaging. Additionally, several task-specific classifiers work in cooperation with the stable learner to yield the final prediction. Specifically, when learning a new task, these modules are updated in a cyclic manner: i) the plastic learner is initially optimized using a self-supervised loss besides the supervised loss to enhance the feature extraction robustness; ii) the stable learner is then updated with respect to the plastic learner in a cumulative parameter averaging manner to maintain its task-wise generalization; iii) the task-specific classifier is accordingly optimized to align with the stable learner. Experimental results on CIFAR-100 and Tiny-ImageNet show that DLCPA outperforms several state-of-the-art exemplar-free baselines in both Task-IL and Class-IL settings.
Paper Structure (34 sections, 14 equations, 7 figures, 10 tables, 2 algorithms)

This paper contains 34 sections, 14 equations, 7 figures, 10 tables, 2 algorithms.

Figures (7)

  • Figure 1: Diagram of Single Task Learning (STL) and its variation. (a) STL represents an upper-bound IL model that trains a specific network for each task. (b) STL-me is an STL variation that averages all STL feature extractors in the parameter space.
  • Figure 2: Diagram of task $t$ training processes of DLCPA.
  • Figure 3: Incremental learning results (ACC %) of DLCPA with MoCoV2 under various $\lambda$ (weighting the supervised loss, see Eq. \ref{['equation:online_loss']}) on CIFAR-100 (a) and Tiny-ImageNet (b).
  • Figure 4: Effectiveness evaluation of various feature extractors on 10-split CIFAR100. The heatmap presents the linear probe accuracy (%) on the dataset of each task by using various feature extractors. These include individual task feature extractors, STL (i.e., choosing the best extractor for every single task), and STL-me (using the mean extractor).
  • Figure 5: Cross-entropy validation loss surface for task 1 and task 2 on CIFAR-100. STL is updated with supervised loss (left) in comparison to with both supervised and self-supervised loss (right).
  • ...and 2 more figures

Theorems & Definitions (1)

  • proof