Relation U-Net
Sheng He, Rina Bao, P. Ellen Grant, Yangming Ou
TL;DR
Relation U-Net addresses the need for per-image confidence in medical image segmentation without ground-truth by introducing a two-input, four-output network that learns both per-image segmentations and their pairwise relations. It outputs two segmentation maps ($\hat{s}_1, \hat{s}_2$) and two relation maps ($\hat{r}^p, \hat{r}^c$), with a confidence score $\mathcal{C}$ defined as the Dice-based discrepancy between $\hat{r}^p$ and $\hat{r}^c$. Across LiTS, Hippocampus, BraTS, and ISIC, Relation U-Net improves Dice accuracy over vanilla U-Net and MC-Dropout baselines, and $\mathcal{C}$ correlates with segmentation performance, enabling ranking of test samples by difficulty without ground-truth. The approach supports robust, interpretable segmentation in clinical workflows by identifying and prioritizing challenging cases for expert review, while providing improved per-image predictions. Overall, the method demonstrates how pairwise relations and an explicit confidence signal can enhance segmentation accuracy and practical trustworthiness in medical imaging.
Abstract
Towards clinical interpretations, this paper presents a new ''output-with-confidence'' segmentation neural network with multiple input images and multiple output segmentation maps and their pairwise relations. A confidence score of the test image without ground-truth can be estimated from the difference among the estimated relation maps. We evaluate the method based on the widely used vanilla U-Net for segmentation and our new model is named Relation U-Net which can output segmentation maps of the input images as well as an estimated confidence score of the test image without ground-truth. Experimental results on four public datasets show that Relation U-Net can not only provide better accuracy than vanilla U-Net but also estimate a confidence score which is linearly correlated to the segmentation accuracy on test images.
