DVG-Diffusion: Dual-View Guided Diffusion Model for CT Reconstruction from X-Rays
Xing Xie, Jiawei Liu, Huijie Fan, Zhi Han, Yandong Tang, Liangqiong Qu
TL;DR
DVG-Diffusion tackles few-view CT reconstruction by introducing a dual-view diffusion framework that jointly leverages a real input X-ray and a synthesized additional view. A view-parameter guided encoder (VPGE) back-projects X-rays into a CT-aligned latent space, enabling 3D-3D latent-domain learning for CT reconstruction. The method synthesizes a new X-ray view and uses dual-view latent features as diffusion conditions to refine the CT latent representation, which is decoded to a CT volume; results show state-of-the-art performance with a favorable fidelity-perceptual trade-off. Extensive ablations and analyses reveal the roles of VPGE and new-view guidance, and the approach demonstrates robustness across view counts and shapes of angular distributions, with clinical data showing improved boundary and pathology delineation.
Abstract
Directly reconstructing 3D CT volume from few-view 2D X-rays using an end-to-end deep learning network is a challenging task, as X-ray images are merely projection views of the 3D CT volume. In this work, we facilitate complex 2D X-ray image to 3D CT mapping by incorporating new view synthesis, and reduce the learning difficulty through view-guided feature alignment. Specifically, we propose a dual-view guided diffusion model (DVG-Diffusion), which couples a real input X-ray view and a synthesized new X-ray view to jointly guide CT reconstruction. First, a novel view parameter-guided encoder captures features from X-rays that are spatially aligned with CT. Next, we concatenate the extracted dual-view features as conditions for the latent diffusion model to learn and refine the CT latent representation. Finally, the CT latent representation is decoded into a CT volume in pixel space. By incorporating view parameter guided encoding and dual-view guided CT reconstruction, our DVG-Diffusion can achieve an effective balance between high fidelity and perceptual quality for CT reconstruction. Experimental results demonstrate our method outperforms state-of-the-art methods. Based on experiments, the comprehensive analysis and discussions for views and reconstruction are also presented.
