Enhancing Consistency and Mitigating Bias: A Data Replay Approach for Incremental Learning
Chenyang Wang, Junjun Jiang, Xingyu Hu, Xianming Liu, Xiangyang Ji
TL;DR
This paper tackles catastrophic forgetting in class incremental learning under data privacy constraints by focusing on data-free replay through model inversion. It introduces CwD, a Consistency-enhanced data replay framework with a Debiased classifier that adds a data-consistency enhancement loss (DCE) to align inverted and real data distributions under a tied multivariate Gaussian assumption, and a weight alignment regularization (WAR) to balance class weights during training. An extra estimation stage collects old-task statistics to improve future inversions, forming a three-stage pipeline (Inversion, Training, Estimation). Across CIFAR-100, Tiny-ImageNet, and ImageNet-100, CwD consistently improves last-task accuracy and average performance over prior data-free baselines and can boost non-data-free baselines when combined. The work also provides thorough ablations, debiasing comparisons, and overhead analyses, highlighting practical gains and areas for further refinement in data-free continual learning.
Abstract
Deep learning systems are prone to catastrophic forgetting when learning from a sequence of tasks, as old data from previous tasks is unavailable when learning a new task. To address this, some methods propose replaying data from previous tasks during new task learning, typically using extra memory to store replay data. However, it is not expected in practice due to memory constraints and data privacy issues. Instead, data-free replay methods invert samples from the classification model. While effective, these methods face inconsistencies between inverted and real training data, overlooked in recent works. To that effect, we propose to measure the data consistency quantitatively by some simplification and assumptions. Using this measurement, we gain insight to develop a novel loss function that reduces inconsistency. Specifically, the loss minimizes the KL divergence between distributions of inverted and real data under a tied multivariate Gaussian assumption, which is simple to implement in continual learning. Additionally, we observe that old class weight norms decrease continually as learning progresses. We analyze the reasons and propose a regularization term to balance class weights, making old class samples more distinguishable. To conclude, we introduce Consistency-enhanced data replay with a Debiased classifier for class incremental learning (CwD). Extensive experiments on CIFAR-100, Tiny-ImageNet, and ImageNet100 show consistently improved performance of CwD compared to previous approaches.
