EndoIR: Degradation-Agnostic All-in-One Endoscopic Image Restoration via Noise-Aware Routing Diffusion
Tong Chen, Xinyu Ma, Long Bai, Wenyang Wang, Yue Sun, Luping Zhou
TL;DR
EndoIR addresses the restoration of endoscopic images under multiple, unknown degradations by learning degradation-aware prompts from both spatial and frequency domains and by disentangling content from corruption cues through a dual-stream diffusion framework. Its key innovations—Dual-Domain Prompter, Task Adaptive Embedding, Dual-Stream Encoder, Rectified Fusion Block, and Noise-Aware Routing Block—enable efficient, degradation-agnostic denoising with improved preservation of anatomical details. The approach achieves state-of-the-art results on SegSTRONG-C and CEC, with fewer parameters and favorable downstream segmentation performance, highlighting potential clinical utility. This work demonstrates a practical path toward robust, real-time endoscopic image restoration that supports accurate diagnosis and surgical guidance in challenging visual conditions.
Abstract
Endoscopic images often suffer from diverse and co-occurring degradations such as low lighting, smoke, and bleeding, which obscure critical clinical details. Existing restoration methods are typically task-specific and often require prior knowledge of the degradation type, limiting their robustness in real-world clinical use. We propose EndoIR, an all-in-one, degradation-agnostic diffusion-based framework that restores multiple degradation types using a single model. EndoIR introduces a Dual-Domain Prompter that extracts joint spatial-frequency features, coupled with an adaptive embedding that encodes both shared and task-specific cues as conditioning for denoising. To mitigate feature confusion in conventional concatenation-based conditioning, we design a Dual-Stream Diffusion architecture that processes clean and degraded inputs separately, with a Rectified Fusion Block integrating them in a structured, degradation-aware manner. Furthermore, Noise-Aware Routing Block improves efficiency by dynamically selecting only noise-relevant features during denoising. Experiments on SegSTRONG-C and CEC datasets demonstrate that EndoIR achieves state-of-the-art performance across multiple degradation scenarios while using fewer parameters than strong baselines, and downstream segmentation experiments confirm its clinical utility.
