Iterative Explainability for Weakly Supervised Segmentation in Medical PE Detection
Florin Condrea, Saikiran Rapaka, Marius Leordeanu
TL;DR
This work tackles the scarcity of fine-grained PE annotations by introducing iExplain, an iterative, weakly supervised framework that converts image-level PE labels into pixel-precise segmentation via explainability-driven pseudo-labels. The pipeline combines a slice-level classifier, iterative Integrated Gradients-based pseudo-label generation with hysteresis thresholding, and a DynUNET-based segmentation model, achieving strong localization while avoiding auxiliary models. On the RSPECT-augmented dataset, the method yields competitive results with strongly supervised approaches (e.g., F1 ≈ 71.6% in baseline, rising to 75.5% with limited human annotation for finetuning), demonstrating data-efficient learning for PE detection and localization. The approach offers a practical path to high-performance weakly supervised segmentation with broad potential for application in other medical imaging tasks and beyond.
Abstract
Pulmonary Embolism (PE) are a leading cause of cardiovascular death. Computed tomographic pulmonary angiography (CTPA) is the gold standard for PE diagnosis, with growing interest in AI-based diagnostic assistance. However, these algorithms are limited by scarce fine-grained annotations of thromboembolic burden. We address this challenge with iExplain, a weakly supervised learning algorithm that transforms coarse image-level annotations into detailed pixel-level PE masks through iterative model explainability. Our approach generates soft segmentation maps used to mask detected regions, enabling the process to repeat and discover additional embolisms that would be missed in a single pass. This iterative refinement effectively captures complete PE regions and detects multiple distinct embolisms. Models trained on these automatically generated annotations achieve excellent PE detection performance, with significant improvements at each iteration. We demonstrate iExplain's effectiveness on the RSPECT augmented dataset, achieving results comparable to strongly supervised methods while outperforming existing weakly supervised methods.
