Diffusion Models for Low-Light Image Enhancement: A Multi-Perspective Taxonomy and Performance Analysis
Eashan Adhikarla, Yixin Liu, Brian D. Davison
TL;DR
This survey addresses low-light image enhancement through diffusion models, proposing a six-perspective taxonomy (Intrinsic Decomposition, Spectral & Latent, Accelerated, Guided, Multimodal, Autonomous) that maps methods to physical priors, conditioning, and efficiency. It provides a comprehensive performance comparison against GAN and Transformer baselines, analyzes qualitative and quantitative failure modes, and discusses deployment constraints and ethical considerations. Key insights include an expanding efficiency-fidelity frontier driven by latent and spectral diffusion, a shift toward controllable and task-aware enhancement, and the rising relevance of foundation-model guidance for LLIE. The practical impact lies in guiding the next generation of diffusion-based LLIE toward real-time, on-device, and robust cross-domain applications while acknowledging data scarcity, interpretability, and responsible AI concerns.
Abstract
Low-light image enhancement (LLIE) is vital for safety-critical applications such as surveillance, autonomous navigation, and medical imaging, where visibility degradation can impair downstream task performance. Recently, diffusion models have emerged as a promising generative paradigm for LLIE due to their capacity to model complex image distributions via iterative denoising. This survey provides an up-to-date critical analysis of diffusion models for LLIE, distinctively featuring an in-depth comparative performance evaluation against Generative Adversarial Network and Transformer-based state-of-the-art methods, a thorough examination of practical deployment challenges, and a forward-looking perspective on the role of emerging paradigms like foundation models. We propose a multi-perspective taxonomy encompassing six categories: Intrinsic Decomposition, Spectral & Latent, Accelerated, Guided, Multimodal, and Autonomous; that map enhancement methods across physical priors, conditioning schemes, and computational efficiency. Our taxonomy is grounded in a hybrid view of both the model mechanism and the conditioning signals. We evaluate qualitative failure modes, benchmark inconsistencies, and trade-offs between interpretability, generalization, and inference efficiency. We also discuss real-world deployment constraints (e.g., memory, energy use) and ethical considerations. This survey aims to guide the next generation of diffusion-based LLIE research by highlighting trends and surfacing open research questions, including novel conditioning, real-time adaptation, and the potential of foundation models.
