CCSI: Continual Class-Specific Impression for Data-free Class Incremental Learning
Sana Ayromlou, Teresa Tsang, Purang Abolmaesumi, Xiaoxiao Li
TL;DR
This paper tackles data-free continual class incremental learning in medical imaging by introducing Continual Class-Specific Impression (CCSI), which synthesizes prior-class data via model inversion of a frozen classifier and mean-image initialization, guided by Continual Normalization (CN) statistics. A two-step pipeline first generates synthetic class impressions and then updates the model with new class data using three novel losses—intra-domain contrastive loss, margin loss, and cosine-normalized cross-entropy—along with a distillation term to mitigate forgetting. The approach yields substantial improvements over data-free baselines on MedMNIST benchmarks and a real Heart Echo dataset, with gains up to 51% in final-task accuracy and competitive performance relative to memory-based methods. The work highlights CN’s role in stabilizing data synthesis and continual training, offering a privacy-friendly avenue for deploying lifelong learning in clinical settings where storing prior patient data is restricted.
Abstract
In real-world clinical settings, traditional deep learning-based classification methods struggle with diagnosing newly introduced disease types because they require samples from all disease classes for offline training. Class incremental learning offers a promising solution by adapting a deep network trained on specific disease classes to handle new diseases. However, catastrophic forgetting occurs, decreasing the performance of earlier classes when adapting the model to new data. Prior proposed methodologies to overcome this require perpetual storage of previous samples, posing potential practical concerns regarding privacy and storage regulations in healthcare. To this end, we propose a novel data-free class incremental learning framework that utilizes data synthesis on learned classes instead of data storage from previous classes. Our key contributions include acquiring synthetic data known as Continual Class-Specific Impression (CCSI) for previously inaccessible trained classes and presenting a methodology to effectively utilize this data for updating networks when introducing new classes. We obtain CCSI by employing data inversion over gradients of the trained classification model on previous classes starting from the mean image of each class inspired by common landmarks shared among medical images and utilizing continual normalization layers statistics as a regularizer in this pixel-wise optimization process. Subsequently, we update the network by combining the synthesized data with new class data and incorporate several losses, including an intra-domain contrastive loss to generalize the deep network trained on the synthesized data to real data, a margin loss to increase separation among previous classes and new ones, and a cosine-normalized cross-entropy loss to alleviate the adverse effects of imbalanced distributions in training data.
