Ambiguity-aware Truncated Flow Matching for Ambiguous Medical Image Segmentation
Fanding Li, Xiangyu Li, Xianghe Su, Xingyu Qiu, Suyu Dong, Wei Wang, Kuanquan Wang, Gongning Luo, Shuo Li
TL;DR
This work tackles ambiguous medical image segmentation by addressing the persistent trade-off between accuracy and diversity in predictions. It introduces Ambiguity-aware Truncated Flow Matching (ATFM), a TDPM-inspired framework with three innovations: Data-Hierarchical Inference to separate distribution-level accuracy from sample-level diversity, Gaussian Truncation Representation for faithful truncation modeling, and Segmentation Flow Matching to enforce semantic plausibility during flow-based prediction. Across LIDC-IDRI and ISIC3, ATFM yields superior GED, HM-IoU, and MDM scores while maintaining efficient inference. The results establish ATFM as a practical, high-performance AMIS solution with strong theoretical and architectural grounding.
Abstract
A simultaneous enhancement of accuracy and diversity of predictions remains a challenge in ambiguous medical image segmentation (AMIS) due to the inherent trade-offs. While truncated diffusion probabilistic models (TDPMs) hold strong potential with a paradigm optimization, existing TDPMs suffer from entangled accuracy and diversity of predictions with insufficient fidelity and plausibility. To address the aforementioned challenges, we propose Ambiguity-aware Truncated Flow Matching (ATFM), which introduces a novel inference paradigm and dedicated model components. Firstly, we propose Data-Hierarchical Inference, a redefinition of AMIS-specific inference paradigm, which enhances accuracy and diversity at data-distribution and data-sample level, respectively, for an effective disentanglement. Secondly, Gaussian Truncation Representation (GTR) is introduced to enhance both fidelity of predictions and reliability of truncation distribution, by explicitly modeling it as a Gaussian distribution at $T_{\text{trunc}}$ instead of using sampling-based approximations. Thirdly, Segmentation Flow Matching (SFM) is proposed to enhance the plausibility of diverse predictions by extending semantic-aware flow transformation in Flow Matching (FM). Comprehensive evaluations on LIDC and ISIC3 datasets demonstrate that ATFM outperforms SOTA methods and simultaneously achieves a more efficient inference. ATFM improves GED and HM-IoU by up to $12\%$ and $7.3\%$ compared to advanced methods.
