Weakly-supervised segmentation using inherently-explainable classification models and their application to brain tumour classification
Soumick Chatterjee, Hadya Yassin, Florian Dubost, Andreas Nürnberger, Oliver Speck
TL;DR
This work tackles the challenge of trustworthy, weakly-supervised brain tumour segmentation by introducing inherently explainable Gaussian Process-based CNN backbones (GP-UNet, GP-ShuffleUNet, GP-ReconResNet) that use a Global Pooling mechanism to generate localisation heatmaps driving classification. These heatmaps enable direct, interpretable segmentation from image-level labels, reducing annotation burden while maintaining competitive $F1$-score and near-equivalent diagnostic separability on two brain-tumour datasets, including BraTS 2020. The approach outperforms a strong interpretable baseline (MProtoNet) in tumour-specific segmentation and demonstrates robust handling of class imbalance via comprehensive ROC and PR analyses, with heatmaps aligning with occlusion and guided backpropagation interpretations. Limitations include 2D slice processing and longer training times, but the study highlights a promising path toward trustworthy, efficient clinical decision support, with future work spanning 3D extensions, federated learning, and integration with Vision-Language Models for radiological reporting.
Abstract
Deep learning has demonstrated significant potential in medical imaging; however, the opacity of "black-box" models hinders clinical trust, while segmentation tasks typically necessitate labourious, hard-to-obtain pixel-wise annotations. To address these challenges simultaneously, this paper introduces a framework for three inherently explainable classifiers (GP-UNet, GP-ShuffleUNet, and GP-ReconResNet). By integrating a global pooling mechanism, these networks generate localisation heatmaps that directly influence classification decisions, offering inherent interpretability without relying on potentially unreliable post-hoc methods. These heatmaps are subsequently thresholded to achieve weakly-supervised segmentation, requiring only image-level classification labels for training. Validated on two datasets for multi-class brain tumour classification, the proposed models achieved a peak F1-score of 0.93. For the weakly-supervised segmentation task, a median Dice score of 0.728 (95% CI 0.715-0.739) was recorded. Notably, on a subset of tumour-only images, the best model achieved an accuracy of 98.7%, outperforming state-of-the-art glioma grading binary classifiers. Furthermore, comparative Precision-Recall analysis validated the framework's robustness against severe class imbalance, establishing a direct correlation between diagnostic confidence and segmentation fidelity. These results demonstrate that the proposed framework successfully combines high diagnostic accuracy with essential transparency, offering a promising direction for trustworthy clinical decision support. Code is available on GitHub: https://github.com/soumickmj/GPModels
