Enhancing Cross-Modality Synthesis: Subvolume Merging for MRI-to-CT Conversion
Fuxin Fan, Jingna Qiu, Yixing Huang, Andreas Maier
TL;DR
This work tackles MRI-to-CT synthesis for radiotherapy planning by combining a SwinUNETR-based network with a novel 3D subvolume merging strategy during inference to reduce stitching artifacts and improve attenuation accuracy. The method uses $32\times96\times96$ subvolumes and a patch size of $2\times2\times2$ within a four-stage Swin Transformer–enabled encoder–decoder, with L1 loss and Adam optimization. Empirical results show the subvolume merging approach lowers MAE from $52.65\ \mathrm{HU}$ to $47.75\ \mathrm{HU}$ and that a gamma-weighted overlap with $\gamma=0.9$ yields optimal performance; an overlap of $50\%$–$70\%$ balances image quality and computational efficiency. The findings support broader applicability to other regression tasks requiring subvolume processing in medical imaging.
Abstract
Providing more precise tissue attenuation information, synthetic computed tomography (sCT) generated from magnetic resonance imaging (MRI) contributes to improved radiation therapy treatment planning. In our study, we employ the advanced SwinUNETR framework for synthesizing CT from MRI images. Additionally, we introduce a three-dimensional subvolume merging technique in the prediction process. By selecting an optimal overlap percentage for adjacent subvolumes, stitching artifacts are effectively mitigated, leading to a decrease in the mean absolute error (MAE) between sCT and the labels from 52.65 HU to 47.75 HU. Furthermore, implementing a weight function with a gamma value of 0.9 results in the lowest MAE within the same overlap area. By setting the overlap percentage between 50% and 70%, we achieve a balance between image quality and computational efficiency.
