ProDehaze: Prompting Diffusion Models Toward Faithful Image Dehazing
Tianwen Zhou, Jing Wang, Songtao Wu, Kuanhong Xu
TL;DR
ProDehaze tackles hallucination in diffusion-model-based image dehazing by injecting selective internal priors to guide external pretrained priors. It introduces two components: Structure-Prompted Restorer (SPR) in the latent space, which uses high-frequency structure cues via Haar DWT to emphasize structure-rich regions, and Haze-Aware Self-Correcting Refiner (HCR) in decoding, which employs a Dark Channel Prior-derived haze mask to modulate attention and promote distribution alignment between clearer input regions and the output. The approach demonstrates superior fidelity and reduced color shifts on synthetic and real-world hazy datasets, outperforming several diffusion-based and prompting methods without extensive real-world fine-tuning. Overall, ProDehaze validates internal priors as a powerful mechanism to guide pretrained diffusion models toward faithful restoration, with potential applicability to a broader class of inverse problems in vision.
Abstract
Recent approaches using large-scale pretrained diffusion models for image dehazing improve perceptual quality but often suffer from hallucination issues, producing unfaithful dehazed image to the original one. To mitigate this, we propose ProDehaze, a framework that employs internal image priors to direct external priors encoded in pretrained models. We introduce two types of \textit{selective} internal priors that prompt the model to concentrate on critical image areas: a Structure-Prompted Restorer in the latent space that emphasizes structure-rich regions, and a Haze-Aware Self-Correcting Refiner in the decoding process to align distributions between clearer input regions and the output. Extensive experiments on real-world datasets demonstrate that ProDehaze achieves high-fidelity results in image dehazing, particularly in reducing color shifts. Our code is at https://github.com/TianwenZhou/ProDehaze.
