Learnable Total Variation with Lambda Mapping for Low-Dose CT Denoising
Yusuf Talha Basak, Mehmet Ozan Unal, Metin Ertas, Isa Yildirim
TL;DR
The paper tackles LDCT denoising by learning when and where to apply TV smoothing. It introduces Learnable Total Variation (LTV), which couples an unrolled primal--dual TV solver with a LambdaNet that predicts a per-pixel regularization map $\lambda$, enabling spatially adaptive denoising guided by both the physics-informed solver and learned priors. The end-to-end training uses a composite loss that combines image fidelity terms with priors on the $\lambda$-map to ensure spatial coherence, structure alignment, and distributional diversity of regularization. Empirically, LTV achieves substantial gains over classical TV and FBP+U-Net on LDCT data (e.g., up to $+2.9$ dB PSNR and $+6 ext{–}7 ext%$ SSIM improvement), while retaining interpretability and offering a principled path toward 3D and data-consistency–driven reconstruction.
Abstract
Although Total Variation (TV) performs well in noise reduction and edge preservation on images, its dependence on the lambda parameter limits its efficiency and makes it difficult to use effectively. In this study, we present a Learnable Total Variation (LTV) framework that couples an unrolled TV solver with a data-driven Lambda Mapping Network (LambdaNet) predicting a per-pixel regularization map. The pipeline is trained end-to-end so that reconstruction and regularization are optimized jointly, yielding spatially adaptive smoothing: strong in homogeneous regions, relaxed near anatomical boundaries. Experiments on the DeepLesion dataset, using a realistic noise model adapted from the LoDoPaB-CT methodology, show consistent gains over classical TV and FBP+U-Net: +2.9 dB PSNR and +6% SSIM on average. LTV provides an interpretable alternative to black-box CNNs and a basis for 3D and data-consistency-driven reconstruction.
