Table of Contents
Fetching ...

DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy

Yiwen Zhang, Chuanpu Li, Liming Zhong, Zeli Chen, Wei Yang, Xuetao Wang

TL;DR

DoseDiff addresses the challenge of accurate patient-specific dose prediction in radiotherapy by introducing a distance-aware conditional diffusion model that uses CT images and signed distance maps (SDMs) as conditions. It formulates dose generation as a reverse diffusion process and employs DDIM for accelerated inference, while MMFNet enables effective high- and low-level fusion of multimodal features with a transformer-based fusionFormer. The method demonstrates state-of-the-art performance on two in-house datasets and the OpenKBP public dataset, delivering improvements in dosimetric accuracy (e.g., MAE, SSIM, PSNR) and preserving ray-path realism in predicted dose maps. This approach has practical potential to streamline radiotherapy planning by providing accurate, distribution-consistent dose maps with realistic beam-path characteristics prior to planning hyper-parameter adjustments.

Abstract

Treatment planning, which is a critical component of the radiotherapy workflow, is typically carried out by a medical physicist in a time-consuming trial-and-error manner. Previous studies have proposed knowledge-based or deep-learning-based methods for predicting dose distribution maps to assist medical physicists in improving the efficiency of treatment planning. However, these dose prediction methods usually fail to effectively utilize distance information between surrounding tissues and targets or organs-at-risk (OARs). Moreover, they are poor at maintaining the distribution characteristics of ray paths in the predicted dose distribution maps, resulting in a loss of valuable information. In this paper, we propose a distance-aware diffusion model (DoseDiff) for precise prediction of dose distribution. We define dose prediction as a sequence of denoising steps, wherein the predicted dose distribution map is generated with the conditions of the computed tomography (CT) image and signed distance maps (SDMs). The SDMs are obtained by distance transformation from the masks of targets or OARs, which provide the distance from each pixel in the image to the outline of the targets or OARs. We further propose a multi-encoder and multi-scale fusion network (MMFNet) that incorporates multi-scale and transformer-based fusion modules to enhance information fusion between the CT image and SDMs at the feature level. We evaluate our model on two in-house datasets and a public dataset, respectively. The results demonstrate that our DoseDiff method outperforms state-of-the-art dose prediction methods in terms of both quantitative performance and visual quality.

DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy

TL;DR

DoseDiff addresses the challenge of accurate patient-specific dose prediction in radiotherapy by introducing a distance-aware conditional diffusion model that uses CT images and signed distance maps (SDMs) as conditions. It formulates dose generation as a reverse diffusion process and employs DDIM for accelerated inference, while MMFNet enables effective high- and low-level fusion of multimodal features with a transformer-based fusionFormer. The method demonstrates state-of-the-art performance on two in-house datasets and the OpenKBP public dataset, delivering improvements in dosimetric accuracy (e.g., MAE, SSIM, PSNR) and preserving ray-path realism in predicted dose maps. This approach has practical potential to streamline radiotherapy planning by providing accurate, distribution-consistent dose maps with realistic beam-path characteristics prior to planning hyper-parameter adjustments.

Abstract

Treatment planning, which is a critical component of the radiotherapy workflow, is typically carried out by a medical physicist in a time-consuming trial-and-error manner. Previous studies have proposed knowledge-based or deep-learning-based methods for predicting dose distribution maps to assist medical physicists in improving the efficiency of treatment planning. However, these dose prediction methods usually fail to effectively utilize distance information between surrounding tissues and targets or organs-at-risk (OARs). Moreover, they are poor at maintaining the distribution characteristics of ray paths in the predicted dose distribution maps, resulting in a loss of valuable information. In this paper, we propose a distance-aware diffusion model (DoseDiff) for precise prediction of dose distribution. We define dose prediction as a sequence of denoising steps, wherein the predicted dose distribution map is generated with the conditions of the computed tomography (CT) image and signed distance maps (SDMs). The SDMs are obtained by distance transformation from the masks of targets or OARs, which provide the distance from each pixel in the image to the outline of the targets or OARs. We further propose a multi-encoder and multi-scale fusion network (MMFNet) that incorporates multi-scale and transformer-based fusion modules to enhance information fusion between the CT image and SDMs at the feature level. We evaluate our model on two in-house datasets and a public dataset, respectively. The results demonstrate that our DoseDiff method outperforms state-of-the-art dose prediction methods in terms of both quantitative performance and visual quality.
Paper Structure (19 sections, 16 equations, 11 figures, 6 tables)

This paper contains 19 sections, 16 equations, 11 figures, 6 tables.

Figures (11)

  • Figure 1: The overall workflow for DoseDiff.
  • Figure 2: Architecture of MMFNet for dose prediction.
  • Figure 3: Visual comparison of the mask, TSBDM, ISDM, and PSDM with respect to the ROIs of breast cancer.
  • Figure 4: Performance and inference time of different generation steps based on DDIM reverse process. Gray dotted lines indicate the metrics of DoseDiff without DDIM ($T=1000$).
  • Figure 5: Boxplots of dose prediction results using compared methods on the breast cancer dataset.
  • ...and 6 more figures