Deep Learning with Uncertainty Quantification for Predicting the Segmentation Dice Coefficient of Prostate Cancer Biopsy Images
Audrey Xie, Elhoucine Elfatimi, Sambuddha Ghosal, Pratik Shah
TL;DR
This work addresses the need for uncertainty-aware evaluation of deep learning-based histopathology segmentation in clinical settings by linking model uncertainty to segmentation performance. It uses Monte Carlo dropout and backpropagation to generate pixel-level uncertainties for prostate H&E biopsy images and introduces a region-specific uncertainty algorithm to compute ROI-based uncertainties. The study then trains linear models to predict Dice scores from ROI uncertainties (tumor, non-tumor, and non-tissue regions), demonstrating that ROI-based uncertainty provides accurate Dice predictions with low RMSE and stronger correlations than image-wide uncertainty. Results on the Gleason MICCAI 2019 dataset show Dice around $0.899$ and AUROC around $0.894$, with uncertainty maps highlighting boundaries where predictions are less reliable, thereby enabling more informed clinical decision-making. Overall, the approach advances reliable deployment of DLMs in pathology by offering region-aware performance estimation and practical uncertainty visualization.
Abstract
Deep learning models (DLMs) can achieve state-of-the-art performance in histopathology image segmentation and classification, but have limited deployment potential in real-world clinical settings. Uncertainty estimates of DLMs can increase trust by identifying predictions and images that need further review. Dice scores and coefficients (Dice) are benchmarks for evaluation of image segmentation performance, but are usually not evaluated with DLM uncertainty quantification. This study reports DLMs trained with uncertainty estimations, using randomly initialized weights and Monte Carlo dropout, to segment tumors from microscopic Hematoxylin and Eosin dye stained prostate core biopsy histology RGB images. Image-level maps showed significant correlation (Spearman's rank, p < 0.05) between overall and specific prostate tissue image sub-region uncertainties with model performance estimations by Dice. This study reports that linear models, which can predict Dice segmentation scores from multiple clinical sub-region-based uncertainties of prostate cancer, can serve as a more comprehensive performance evaluation metric without loss in predictive capability of DLMs, with a low root mean square error.
