SS-CTML: Self-Supervised Cross-Task Mutual Learning for CT Image Reconstruction
Gaofeng Chen, Yaoduo Zhang, Li Huang, Pengfei Wang, Wenyu Zhang, Dong Zeng, Jianhua Ma, Ji He
TL;DR
SS-CTML introduces a self-supervised cross-task mutual learning framework for CT image reconstruction that simultaneously improves three related tasks—FVCT, SVCT, and LVCT—without requiring paired normal-dose data. The architecture combines prior neural modules, compensation mechanisms, and dual-domain networks to enforce consistency across sinogram and image domains, guided by mutual learning losses among priors and final reconstructions. Experiments on simulated Mayo data and real LDCT demonstrate promising improvements in artifact suppression and structural preservation, with ablations validating the contribution of each component. The work suggests a viable path toward clinical deployment of LDCT reconstruction methods and lays groundwork for a broader CT reconstruction foundation model.
Abstract
Supervised deep-learning (SDL) techniques with paired training datasets have been widely studied for X-ray computed tomography (CT) image reconstruction. However, due to the difficulties of obtaining paired training datasets in clinical routine, the SDL methods are still away from common uses in clinical practices. In recent years, self-supervised deep-learning (SSDL) techniques have shown great potential for the studies of CT image reconstruction. In this work, we propose a self-supervised cross-task mutual learning (SS-CTML) framework for CT image reconstruction. Specifically, a sparse-view scanned and a limited-view scanned sinogram data are first extracted from a full-view scanned sinogram data, which results in three individual reconstruction tasks, i.e., the full-view CT (FVCT) reconstruction, the sparse-view CT (SVCT) reconstruction, and limited-view CT (LVCT) reconstruction. Then, three neural networks are constructed for the three reconstruction tasks. Considering that the ultimate goals of the three tasks are all to reconstruct high-quality CT images, we therefore construct a set of cross-task mutual learning objectives for the three tasks, in which way, the three neural networks can be self-supervised optimized by learning from each other. Clinical datasets are adopted to evaluate the effectiveness of the proposed framework. Experimental results demonstrate that the SS-CTML framework can obtain promising CT image reconstruction performance in terms of both quantitative and qualitative measurements.
