Addressing Imbalance for Class Incremental Learning in Medical Image Classification
Xuze Hao, Wenqian Ni, Xuhao Jiang, Weimin Tan, Bo Yan
TL;DR
The paper tackles catastrophic forgetting in class incremental learning for medical image classification caused by data imbalance. It introduces two plug-in losses: the CIL-balanced classification loss $L_{cbc}$, which uses logit adjustment to mitigate bias toward old and minority classes, and the distribution margin loss $L_{dm}$, which pushes distributions of old and new classes apart while promoting intra-class compactness; both are combined with knowledge distillation in the overall objective $L_{all}$. The method defines category frequencies $r_c$, logit adjustments with $v_{y_i,j}$ and a scale factor $\\gamma_c$, and models class distributions with $\\hat{w}_c = w_c + \eta \\hat{r}_c$, where $\\hat{r}_c = q_c / \\sum q_i$, to formulate $L_{dm}$. Extensive experiments on CCH5000, HAM10000, and EyePACS show state-of-the-art performance in both average accuracy and forgetting across multiple incremental settings, highlighting improved resilience to imbalance in medical CIL with practical applicability.
Abstract
Deep convolutional neural networks have made significant breakthroughs in medical image classification, under the assumption that training samples from all classes are simultaneously available. However, in real-world medical scenarios, there's a common need to continuously learn about new diseases, leading to the emerging field of class incremental learning (CIL) in the medical domain. Typically, CIL suffers from catastrophic forgetting when trained on new classes. This phenomenon is mainly caused by the imbalance between old and new classes, and it becomes even more challenging with imbalanced medical datasets. In this work, we introduce two simple yet effective plug-in methods to mitigate the adverse effects of the imbalance. First, we propose a CIL-balanced classification loss to mitigate the classifier bias toward majority classes via logit adjustment. Second, we propose a distribution margin loss that not only alleviates the inter-class overlap in embedding space but also enforces the intra-class compactness. We evaluate the effectiveness of our method with extensive experiments on three benchmark datasets (CCH5000, HAM10000, and EyePACS). The results demonstrate that our approach outperforms state-of-the-art methods.
