Patch Triplet Similarity Purification for Guided Real-World Low-Dose CT Image Denoising
Junhao Long, Fengwei Yang, Juncheng Yan, Baoping Zhang, Chao Jin, Jian Yang, Changliang Zou, Jun Xu
TL;DR
This work tackles real-world low-dose CT denoising by leveraging clean non-contrast CT (NCCT) images as guidance and addressing spatial misalignment through a patch-level Patch Triplet Similarity Purification (PTSP) strategy. By replacing vanilla self-attention with cross-attention in SwinIR and HAT and training on highly similar LDCT/NDCT/NCCT patch triplets, the approach achieves superior denoising performance on both synthetic and real-world LDCT datasets, outperforming 15 baselines. Ablation studies confirm the value of NCCT guidance and the PTSP data selection, with best results at a patch size of 64 and PTSP threshold $s=0.85$, using $n=3$ segmentation points. The method demonstrates robust preservation of anatomical structures while reducing noise, suggesting meaningful clinical impact for safer LDCT imaging and potential data/code public release for reproducibility and broader adoption.
Abstract
Image denoising of low-dose computed tomography (LDCT) is an important problem for clinical diagnosis with reduced radiation exposure. Previous methods are mostly trained with pairs of synthetic or misaligned LDCT and normal-dose CT (NDCT) images. However, trained with synthetic noise or misaligned LDCT/NDCT image pairs, the denoising networks would suffer from blurry structure or motion artifacts. Since non-contrast CT (NCCT) images share the content characteristics to the corresponding NDCT images in a three-phase scan, they can potentially provide useful information for real-world LDCT image denoising. To exploit this aspect, in this paper, we propose to incorporate clean NCCT images as useful guidance for the learning of real-world LDCT image denoising networks. To alleviate the issue of spatial misalignment in training data, we design a new Patch Triplet Similarity Purification (PTSP) strategy to select highly similar patch (instead of image) triplets of LDCT, NDCT, and NCCT images for network training. Furthermore, we modify two image denoising transformers of SwinIR and HAT to accommodate the NCCT image guidance, by replacing vanilla self-attention with cross-attention. On our collected clinical dataset, the modified transformers trained with the data selected by our PTSP strategy show better performance than 15 comparison methods on real-world LDCT image denoising. Ablation studies validate the effectiveness of our NCCT image guidance and PTSP strategy. We will publicly release our data and code.
