FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process
Yang Luo, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Zhineng Chen, Yu-Gang Jiang, Tao Mei
TL;DR
FreeEnhance tackles image enhancement with diffusion models by reframing it as a tuning-free two-stage process in the latent space. It introduces a frequency-aware noising stage (a two-stream scheme with gradient-guided sampling) and a denoising stage regulated by three gradient-based terms—acutance, distribution, and adversarial degradation—along with distribution calibration to preserve diffusion priors. The method leverages a pre-trained latent diffusion model (SDXL) and DDIM inversion to achieve content-consistent detail enrichment, validated on HPDv2 where it outperforms baselines and Magnific AI in both NR-IQA metrics and human preference. The approach demonstrates strong generalization to different diffusion models and extends to text-to-image generation and natural image enhancement, offering practical post-processing benefits for real-world images. Overall, FreeEnhance delivers a tunable, high-quality enhancement pipeline that preserves content while enriching details, making it valuable for commercial and consumer applications.
Abstract
The emergence of text-to-image generation models has led to the recognition that image enhancement, performed as post-processing, would significantly improve the visual quality of the generated images. Exploring diffusion models to enhance the generated images nevertheless is not trivial and necessitates to delicately enrich plentiful details while preserving the visual appearance of key content in the original image. In this paper, we propose a novel framework, namely FreeEnhance, for content-consistent image enhancement using the off-the-shelf image diffusion models. Technically, FreeEnhance is a two-stage process that firstly adds random noise to the input image and then capitalizes on a pre-trained image diffusion model (i.e., Latent Diffusion Models) to denoise and enhance the image details. In the noising stage, FreeEnhance is devised to add lighter noise to the region with higher frequency to preserve the high-frequent patterns (e.g., edge, corner) in the original image. In the denoising stage, we present three target properties as constraints to regularize the predicted noise, enhancing images with high acutance and high visual quality. Extensive experiments conducted on the HPDv2 dataset demonstrate that our FreeEnhance outperforms the state-of-the-art image enhancement models in terms of quantitative metrics and human preference. More remarkably, FreeEnhance also shows higher human preference compared to the commercial image enhancement solution of Magnific AI.
