XctDiff: Reconstruction of CT Images with Consistent Anatomical Structures from a Single Radiographic Projection Image
Qingze Bai, Tiange Liu, Zhi Liu, Yubing Tong, Drew Torigian, Jayaram Udupa
TL;DR
XctDiff tackles single-view CT reconstruction by learning robust 3D priors from 2D radiographs and guiding diffusion-based CT generation in a latent space. It introduces a 3D perceptual compression model, a progressive semantic encoder for 3D priors, and a prior-guided diffusion model with cross-attention, aided by a homogeneous spatial codebook. Trained on DRR-generated radiographs and adapted to real radiographs through style transfer, it achieves state-of-the-art PSNR and SSIM on the LIDC-IDRI dataset while preserving anatomical consistency and reducing blur. The approach also shows promise for self-supervised pretraining in medical image analysis and downstream tasks, highlighting practical impact for data-limited clinical contexts.
Abstract
In this paper, we present XctDiff, an algorithm framework for reconstructing CT from a single radiograph, which decomposes the reconstruction process into two easily controllable tasks: feature extraction and CT reconstruction. Specifically, we first design a progressive feature extraction strategy that is able to extract robust 3D priors from radiographs. Then, we use the extracted prior information to guide the CT reconstruction in the latent space. Moreover, we design a homogeneous spatial codebook to improve the reconstruction quality further. The experimental results show that our proposed method achieves state-of-the-art reconstruction performance and overcomes the blurring issue. We also apply XctDiff on self-supervised pre-training task. The effectiveness indicates that it has promising additional applications in medical image analysis. The code is available at:https://github.com/qingze-bai/XctDiff
