Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory
Xiaoyan Xing, Vincent Tao Hu, Jan Hendrik Metzen, Konrad Groh, Sezer Karaoglu, Theo Gevers
TL;DR
The paper presents Retinex-Diffusion, a training-free framework that controls illumination in diffusion models by guiding the reverse process with an illumination-energy term decomposed into illumination-variant and -invariant parts. It leverages a Retinex-inspired illumination extraction and a Gaussian illumination prompt to steer lighting, including shadows and inter-reflections, while preserving geometry in real-image relighting via CCR-based guidance. The method demonstrates realistic illumination generation and editing on generated and real images using pre-trained diffusion models, with ablations confirming the importance of illumination-property extraction and geometry preservation. This approach enables practical, plug-and-play illumination control without additional training or data labels, expanding diffusion models' applicability in photorealistic rendering and relighting tasks.
Abstract
This paper introduces a novel approach to illumination manipulation in diffusion models, addressing the gap in conditional image generation with a focus on lighting conditions. We conceptualize the diffusion model as a black-box image render and strategically decompose its energy function in alignment with the image formation model. Our method effectively separates and controls illumination-related properties during the generative process. It generates images with realistic illumination effects, including cast shadow, soft shadow, and inter-reflections. Remarkably, it achieves this without the necessity for learning intrinsic decomposition, finding directions in latent space, or undergoing additional training with new datasets.
