DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning

Songze Li; Tonghua Su; Xu-Yao Zhang; Qixing Xu; Zhongjie Wang

DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning

Songze Li, Tonghua Su, Xu-Yao Zhang, Qixing Xu, Zhongjie Wang

TL;DR

This paper addresses catastrophic forgetting in pre-trained model-based continual learning by introducing DUKAE, which combines dual-level knowledge accumulation and an adaptive expertise ensemble. Feature-level accumulation builds task-specific PEFT modules with SSL to enrich representations, while decision-level accumulation aligns subspace classifiers via Gaussian distributions and updates them across tasks. An adaptive ensemble then integrates outputs from multiple subspaces to exploit domain-specific expertise and mitigate inter-subspace interference, achieving state-of-the-art results on CIFAR-100, ImageNet-R, CUB-200, and Cars-196. The approach offers practical benefits for rapid knowledge integration in PTMs, with potential extensions to federated continual learning, albeit with storage considerations for distribution parameters across many subspaces.

Abstract

Pre-trained model-based continual learning (PTMCL) has garnered growing attention, as it enables more rapid acquisition of new knowledge by leveraging the extensive foundational understanding inherent in pre-trained model (PTM). Most existing PTMCL methods use Parameter-Efficient Fine-Tuning (PEFT) to learn new knowledge while consolidating existing memory. However, they often face some challenges. A major challenge lies in the misalignment of classification heads, as the classification head of each task is trained within a distinct feature space, leading to inconsistent decision boundaries across tasks and, consequently, increased forgetting. Another critical limitation stems from the restricted feature-level knowledge accumulation, with feature learning typically restricted to the initial task only, which constrains the model's representation capabilities. To address these issues, we propose a method named DUal-level Knowledge Accumulation and Ensemble (DUKAE) that leverages both feature-level and decision-level knowledge accumulation by aligning classification heads into a unified feature space through Gaussian distribution sampling and introducing an adaptive expertise ensemble to fuse knowledge across feature subspaces. Extensive experiments on CIFAR-100, ImageNet-R, CUB-200, and Cars-196 datasets demonstrate the superior performance of our approach.

DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning

TL;DR

Abstract

DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)