Enhancing Generative Class Incremental Learning Performance with Model Forgetting Approach
Taro Togo, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
TL;DR
This work tackles Generative Class Incremental Learning (GCIL) under streaming data with limited capacity by introducing a forgetting mechanism inspired by human memory. The method first applies Selective Amnesia (SA) to forget a designated class $c_f$, producing a forgetting model $M_f$ via Generative Replay, and then trains a final model $M_{final}$ to learn a new class $c_{new}$ from $D_{new}$ while leveraging retained information from $D_r$. It combines SA-based forgetting with Elastic Weight Consolidation (EWC) to balance the retention of old knowledge and acquisition of new information, leveraging a diagonal Fisher Information Matrix for efficiency. Across MNIST and Fashion-MNIST with a one-hot VAE base, the approach yields improved $c_{new}$ generation accuracy under forgetting in both EWC and fine-tuning regimes, and it reveals the role of class characteristics in the forgetting process, underscoring the practical impact for memory-constrained continual generative learning.
Abstract
This study presents a novel approach to Generative Class Incremental Learning (GCIL) by introducing the forgetting mechanism, aimed at dynamically managing class information for better adaptation to streaming data. GCIL is one of the hot topics in the field of computer vision, and this is considered one of the crucial tasks in society, specifically the continual learning of generative models. The ability to forget is a crucial brain function that facilitates continual learning by selectively discarding less relevant information for humans. However, in the field of machine learning models, the concept of intentionally forgetting has not been extensively investigated. In this study we aim to bridge this gap by incorporating the forgetting mechanisms into GCIL, thereby examining their impact on the models' ability to learn in continual learning. Through our experiments, we have found that integrating the forgetting mechanisms significantly enhances the models' performance in acquiring new knowledge, underscoring the positive role that strategic forgetting plays in the process of continual learning.
