D-PerceptCT: Deep Perceptual Enhancement for Low-Dose CT Images
Taifour Yousra Nabila, Azeddine Beghdadi, Marie Luong, Zuheng Ming, Habib Zaidi, Faouzi Alaya Cheikh
TL;DR
D-PerceptCT addresses the challenge of preserving perceptually salient information in LDCT images while reducing noise. It integrates a Visual Dual-path Extractor (ViDex) that fuses semantic priors from DINOv2 with local detail processing, and a Deep Visual State Space (DV2SM) backbone with Global-Local State Space Blocks to capture multiscale, long-range features. A novel Deep Perceptual Relevancy Loss Function (DPRLF), informed by human contrast sensitivity, guides training toward perceptually important features. Evaluations on Mayo2016 show competitive traditional metrics and superior perceptual quality against state-of-the-art methods, highlighting the approach's potential for radiologist-facing LDCT enhancements and clinical impact.
Abstract
Low Dose Computed Tomography (LDCT) is widely used as an imaging solution to aid diagnosis and other clinical tasks. However, this comes at the price of a deterioration in image quality due to the low dose of radiation used to reduce the risk of secondary cancer development. While some efficient methods have been proposed to enhance LDCT quality, many overestimate noise and perform excessive smoothing, leading to a loss of critical details. In this paper, we introduce D-PerceptCT, a novel architecture inspired by key principles of the Human Visual System (HVS) to enhance LDCT images. The objective is to guide the model to enhance or preserve perceptually relevant features, thereby providing radiologists with CT images where critical anatomical structures and fine pathological details are perceptu- ally visible. D-PerceptCT consists of two main blocks: 1) a Visual Dual-path Extractor (ViDex), which integrates semantic priors from a pretrained DINOv2 model with local spatial features, allowing the network to incorporate semantic-awareness during enhancement; (2) a Global-Local State-Space block that captures long-range information and multiscale features to preserve the important structures and fine details for diagnosis. In addition, we propose a novel deep perceptual loss, designated as the Deep Perceptual Relevancy Loss Function (DPRLF), which is inspired by human contrast sensitivity, to further emphasize perceptually important features. Extensive experiments on the Mayo2016 dataset demonstrate the effectiveness of D-PerceptCT method for LDCT enhancement, showing better preservation of structural and textural information within LDCT images compared to SOTA methods.
