Probabilistic Modeling of Multi-rater Medical Image Segmentation for Diversity and Personalization

Ke Liu; Shangde Gao; Yichao Fu; Shangqi Gao; Chunhua Shen

Probabilistic Modeling of Multi-rater Medical Image Segmentation for Diversity and Personalization

Ke Liu, Shangde Gao, Yichao Fu, Shangqi Gao, Chunhua Shen

TL;DR

ProSeg introduces a two-latent-variable probabilistic model for multi-rater medical image segmentation to simultaneously achieve diversity and personalization. It uses tau to capture annotator preferences and Z to model boundary ambiguity, learned via variational inference to enable sampling of diverse and expert-aligned segmentations. On NPC and LIDC-IDRI datasets, ProSeg achieves state-of-the-art performance across diversity and personalization metrics, outperforming generation and personalization baselines. An ablation study confirms the necessity of both latent spaces, highlighting the practicality of a unified probabilistic framework for broader medical image segmentation tasks.

Abstract

Medical image segmentation is inherently influenced by data uncertainty, arising from ambiguous boundaries in medical scans and inter-observer variability in diagnosis. To address this challenge, previous works formulated the multi-rater medical image segmentation task, where multiple experts provide separate annotations for each image. However, existing models are typically constrained to either generate diverse segmentation that lacks expert specificity or to produce personalized outputs that merely replicate individual annotators. We propose Probabilistic modeling of multi-rater medical image Segmentation (ProSeg) that simultaneously enables both diversification and personalization. Specifically, we introduce two latent variables to model expert annotation preferences and image boundary ambiguity. Their conditional probabilistic distributions are then obtained through variational inference, allowing segmentation outputs to be generated by sampling from these distributions. Extensive experiments on both the nasopharyngeal carcinoma dataset (NPC) and the lung nodule dataset (LIDC-IDRI) demonstrate that our ProSeg achieves a new state-of-the-art performance, providing segmentation results that are both diverse and expert-personalized. Code can be found in https://github.com/AI4MOL/ProSeg.

Probabilistic Modeling of Multi-rater Medical Image Segmentation for Diversity and Personalization

TL;DR

Abstract

Probabilistic Modeling of Multi-rater Medical Image Segmentation for Diversity and Personalization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)

Theorems & Definitions (7)