Conformal Semantic Image Segmentation: Post-hoc Quantification of Predictive Uncertainty
Luca Mossina, Joseba Dalmau, Léo andéol
TL;DR
This paper tackles uncertainty quantification for semantic image segmentation by introducing a post-hoc, model-agnostic conformal prediction framework. It constructs multi-labeled pixel-wise prediction sets parameterized by $\lambda$ and selects an optimal $\hat{\lambda}$ using Conformal Risk Control to bound an expected loss $\mathbb{E}[\ell(\mathcal{C}_{\hat{\lambda}}(X),Y)] \le \alpha$, ensuring ground-truth coverage with a finite-sample guarantee. Uncertainty visualization is provided via varisco heatmaps, which depict per-pixel label inclusion and are validated on Cityscapes, ADE20K, and LoveDA with a lightweight, scalable approach. The work yields practical, interpretable uncertainty diagnostics that are compatible with any segmentation predictor that outputs per-pixel softmax scores, enabling safer deployment and potential extensions to panoptic segmentation and real-time data streams.
Abstract
We propose a post-hoc, computationally lightweight method to quantify predictive uncertainty in semantic image segmentation. Our approach uses conformal prediction to generate statistically valid prediction sets that are guaranteed to include the ground-truth segmentation mask at a predefined confidence level. We introduce a novel visualization technique of conformalized predictions based on heatmaps, and provide metrics to assess their empirical validity. We demonstrate the effectiveness of our approach on well-known benchmark datasets and image segmentation prediction models, and conclude with practical insights.
