Re-DiffiNet: Modeling discrepancies in tumor segmentation using diffusion models
Tianyi Ren, Abhishek Sharma, Juampablo Heras Rivera, Harshitha Rebala, Ethan Honey, Agamdeep Chopra, Jacob Ruzevick, Mehmet Kurt
TL;DR
Glioblastoma tumor margin segmentation requires precise boundary delineation, yet contemporary models struggle with boundary fidelity. Re-DiffiNet introduces a discrepancy-focused diffusion approach: a baseline U-Net predicts segmentation masks, while a conditional diffusion model estimates the discrepancy between those predictions and the ground truth, with Δx_0 = |U(I) − x_0| and Δ̂x_0 guiding refinement to ̂x_0 = |U(I) − Δ̂x_0|. On the BraTS 2023 dataset with 5-fold cross-validation, this discrepancy modeling achieves a 16.28% reduction in HD95 and a 0.55% improvement in Dice compared to the baseline, outperforming direct diffusion-based segmentation. The results highlight that diffusion models excel at refining boundary details when used to model and correct prediction discrepancies, offering a practical boost for boundary-sensitive clinical decision-making and potential generalization to other tumor types.
Abstract
Identification of tumor margins is essential for surgical decision-making for glioblastoma patients and provides reliable assistance for neurosurgeons. Despite improvements in deep learning architectures for tumor segmentation over the years, creating a fully autonomous system suitable for clinical floors remains a formidable challenge because the model predictions have not yet reached the desired level of accuracy and generalizability for clinical applications. Generative modeling techniques have seen significant improvements in recent times. Specifically, Generative Adversarial Networks (GANs) and Denoising-diffusion-based models (DDPMs) have been used to generate higher-quality images with fewer artifacts and finer attributes. In this work, we introduce a framework called Re-Diffinet for modeling the discrepancy between the outputs of a segmentation model like U-Net and the ground truth, using DDPMs. By explicitly modeling the discrepancy, the results show an average improvement of 0.55\% in the Dice score and 16.28\% in HD95 from cross-validation over 5-folds, compared to the state-of-the-art U-Net segmentation model.
