Zero-Shot Low Light Image Enhancement with Diffusion Prior
Joshua Cho, Sara Aghajanzadeh, Zhen Zhu, D. A. Forsyth
TL;DR
This work addresses low-light image enhancement and auto white balance using a zero-shot approach that hinges on a pre-trained diffusion prior and internal self-attention signals, avoiding any training or test-time optimization. It introduces a four-step inference pipeline—preprocessing, DDIM inversion with self-attention feature extraction, AdaIN-based latent normalization to a standard Gaussian, and SA-guided denoising with a QuadPrior decoder—to achieve faithful recovery with color constancy. Across paired (LOL, LSRW) and unpaired datasets, the method delivers state-of-the-art performance among zero-shot and unsupervised methods and remains competitive with supervised baselines, while AWB evaluations show zero-shot color correction without task-specific training. The approach demonstrates that diffusion priors combined with self-attention guidance can robustly correct illumination and color distortions, offering a practical, training-free solution for LLIE and AWB with broad applicability.
Abstract
In this paper, we present a simple yet highly effective "free lunch" solution for low-light image enhancement (LLIE), which aims to restore low-light images as if acquired in well-illuminated environments. Our method necessitates no optimization, training, fine-tuning, text conditioning, or hyperparameter adjustments, yet it consistently reconstructs low-light images with superior fidelity. Specifically, we leverage a pre-trained text-to-image diffusion prior, learned from training on a large collection of natural images, and the features present in the model itself to guide the inference, in contrast to existing methods that depend on customized constraints. Comprehensive quantitative evaluations demonstrate that our approach outperforms SOTA methods on established datasets, while qualitative analyses indicate enhanced color accuracy and the rectification of subtle chromatic deviations. Furthermore, additional experiments reveal that our method, without any modifications, achieves SOTA-comparable performance in the auto white balance (AWB) task.
