PRECISe : Prototype-Reservation for Explainable Classification under Imbalanced and Scarce-Data Settings
Vaibhav Ganatra, Drishti Goel
TL;DR
The paper addresses medical image classification under severe data scarcity and class imbalance while demanding faithful explanations for clinical trust. It introduces PRECISe, an explainable-by-design architecture that combines an autoencoder, a prototype-metric layer, and class-reserved prototypes to generate faithful, human-interpretable explanations via distances to prototypes in latent space. Across two imbalanced datasets (Pneumonia chest X-ray and BUSI ultrasound), PRECISe achieves state-of-the-art data-efficient performance (e.g., 92.04% and 88.75% accuracy) and maintains high minority-class accuracy with very small training sets, while providing tangible prototype-based explanations. The work demonstrates that prototype-based explanations can be produced without post-hoc methods and highlights potential clinical impact in settings with limited labeled data.
Abstract
Deep learning models used for medical image classification tasks are often constrained by the limited amount of training data along with severe class imbalance. Despite these problems, models should be explainable to enable human trust in the models' decisions to ensure wider adoption in high-risk situations. In this paper, we propose PRECISe, an explainable-by-design model meticulously constructed to concurrently address all three challenges. Evaluation on 2 imbalanced medical image datasets reveals that PRECISe outperforms the current state-of-the-art methods on data efficient generalization to minority classes, achieving an accuracy of ~87% in detecting pneumonia in chest x-rays upon training on <60 images only. Additionally, a case study is presented to highlight the model's ability to produce easily interpretable predictions, reinforcing its practical utility and reliability for medical imaging tasks.
