BioAtt: Anatomical Prior Driven Low-Dose CT Denoising
Namhun Kim, UiHyun Cho
TL;DR
Low-Dose CT denoising often sacrifices anatomical detail when using purely data-driven methods. BioAtt addresses this by injecting organ-aware priors derived from BiomedCLIP into the spatial attention mechanism to preserve clinically relevant structures while suppressing noise. It demonstrates superior SSIM and competitive RMSE/PSNR against baselines and attention variants, supported by ablations and attention-map visualizations that confirm anatomy-guided improvements rather than increased model complexity. The approach establishes a new architectural paradigm for integrating semantic anatomical priors into LDCT denoising, with potential for further gains by combining segmentation priors and more diverse textual descriptors.
Abstract
Deep-learning-based denoising methods have significantly improved Low-Dose CT (LDCT) image quality. However, existing models often over-smooth important anatomical details due to their purely data-driven attention mechanisms. To address this challenge, we propose a novel LDCT denoising framework, BioAtt. The key innovation lies in attending anatomical prior distributions extracted from the pretrained vision-language model BiomedCLIP. These priors guide the denoising model to focus on anatomically relevant regions to suppress noise while preserving clinically relevant structures. We highlight three main contributions: BioAtt outperforms baseline and attention-based models in SSIM, PSNR, and RMSE across multiple anatomical regions. The framework introduces a new architectural paradigm by embedding anatomic priors directly into spatial attention. Finally, BioAtt attention maps provide visual confirmation that the improvements stem from anatomical guidance rather than increased model complexity.
