BiasPruner: Debiased Continual Learning for Medical Image Classification
Nourhan Bayasi, Jamil Fayyad, Alceu Bissoto, Ghassan Hamarneh, Rafeef Garbi
TL;DR
BiasPruner addresses bias in medical image continual learning by intentionally forgetting spurious correlations to improve generalization and fairness. It introduces a bias score per network unit, $\mathcal{S}^t_{c,n}$, computed from easy (biased) and hard (unbiased) samples under $\mathcal{L}_{\mathrm{GCE}}$, and constructs per-task debiased subnetworks by pruning high-bias units with pruning ratio $\gamma$. Knowledge transfer is performed by pruning the full network to create a new task subnetwork that includes prior debiased subnetworks; previous subnetworks are frozen while new units learn. In experiments on three medical datasets (FITZ, HAM, NIH), BiasPruner consistently outperforms state-of-the-art continual learning methods in both accuracy and fairness without requiring dataset bias annotations, demonstrating practical impact for debiased continual learning in medicine.
Abstract
Continual Learning (CL) is crucial for enabling networks to dynamically adapt as they learn new tasks sequentially, accommodating new data and classes without catastrophic forgetting. Diverging from conventional perspectives on CL, our paper introduces a new perspective wherein forgetting could actually benefit the sequential learning paradigm. Specifically, we present BiasPruner, a CL framework that intentionally forgets spurious correlations in the training data that could lead to shortcut learning. Utilizing a new bias score that measures the contribution of each unit in the network to learning spurious features, BiasPruner prunes those units with the highest bias scores to form a debiased subnetwork preserved for a given task. As BiasPruner learns a new task, it constructs a new debiased subnetwork, potentially incorporating units from previous subnetworks, which improves adaptation and performance on the new task. During inference, BiasPruner employs a simple task-agnostic approach to select the best debiased subnetwork for predictions. We conduct experiments on three medical datasets for skin lesion classification and chest X-Ray classification and demonstrate that BiasPruner consistently outperforms SOTA CL methods in terms of classification performance and fairness. Our code is available here.
