Pyramid Diffusion Models For Low-light Image Enhancement
Dewei Zhou, Zongxin Yang, Yi Yang
TL;DR
This work introduces PyDiff, a diffusion-model-based approach for low-light image enhancement that overcomes two core limitations of standard diffusion: slow, fixed-resolution sampling and global degradation like RGB shifts. It achieves this through pyramid diffusion, which performs sampling at progressively higher resolutions with a carefully designed downsampling schedule, and a lightweight global corrector that mitigates global color distortions without heavy cost. Empirical results on LOL and LOLV2 benchmarks demonstrate superior quality and significant speedups over prior state-of-the-art methods, with strong generalization to unseen noise and illumination distributions. PyDiff thus establishes diffusion models as a competitive, practical baseline for low-light enhancement and potentially other low-level vision tasks.
Abstract
Recovering noise-covered details from low-light images is challenging, and the results given by previous methods leave room for improvement. Recent diffusion models show realistic and detailed image generation through a sequence of denoising refinements and motivate us to introduce them to low-light image enhancement for recovering realistic details. However, we found two problems when doing this, i.e., 1) diffusion models keep constant resolution in one reverse process, which limits the speed; 2) diffusion models sometimes result in global degradation (e.g., RGB shift). To address the above problems, this paper proposes a Pyramid Diffusion model (PyDiff) for low-light image enhancement. PyDiff uses a novel pyramid diffusion method to perform sampling in a pyramid resolution style (i.e., progressively increasing resolution in one reverse process). Pyramid diffusion makes PyDiff much faster than vanilla diffusion models and introduces no performance degradation. Furthermore, PyDiff uses a global corrector to alleviate the global degradation that may occur in the reverse process, significantly improving the performance and making the training of diffusion models easier with little additional computational consumption. Extensive experiments on popular benchmarks show that PyDiff achieves superior performance and efficiency. Moreover, PyDiff can generalize well to unseen noise and illumination distributions.
