A-IDE : Agent-Integrated Denoising Experts
Uihyun Cho, Namhun Kim
TL;DR
Low-Dose CT denoising is hampered by anatomy-specific HU distributions and limited paired data. The authors propose A-IDE, an agentic framework that uses BiomedCLIP to semantically characterize scans and a GPT-4o-based agent to route each image to one of three region-specialized RED-CNN denoisers. The approach employs clustering to create anatomy-aware experts and an automated, graph-based decision pipeline to select the appropriate denoiser. On Mayo-2016, A-IDE achieves superior PSNR and SSIM while maintaining robustness across anatomies, illustrating the practical value of intelligent orchestration for LDCT denoising.
Abstract
Recent advances in deep-learning based denoising methods have improved Low-Dose CT image quality. However, due to distinct HU distributions and diverse anatomical characteristics, a single model often struggles to generalize across multiple anatomies. To address this limitation, we introduce \textbf{Agent-Integrated Denoising Experts (A-IDE)} framework, which integrates three anatomical region-specialized RED-CNN models under the management of decision-making LLM agent. The agent analyzes semantic cues from BiomedCLIP to dynamically route incoming LDCT scans to the most appropriate expert model. We highlight three major advantages of our approach. A-IDE excels in heterogeneous, data-scarce environments. The framework automatically prevents overfitting by distributing tasks among multiple experts. Finally, our LLM-driven agentic pipeline eliminates the need for manual interventions. Experimental evaluations on the Mayo-2016 dataset confirm that A-IDE achieves superior performance in RMSE, PSNR, and SSIM compared to a single unified denoiser.
