WeatherDiffusion: Controllable Weather Editing in Intrinsic Space
Yixin Zhu, Zuoliang Zhu, Jian Yang, Miloš Hašan, Jin Xie, Beibei Wang
TL;DR
WeatherDiffusion tackles robust, controllable weather editing in outdoor driving scenes by operating in intrinsic space. It jointly learns an inverse renderer to infer weather-invariant maps (albedo, normals, etc.) and a forward renderer that re-synthesizes weather-conditioned images guided by text prompts, enhanced by intrinsic-map-aware attention and CLIP-space interpolation. The authors introduce WeatherSynthetic and WeatherReal datasets with intrinsic maps to support learning and evaluation, and demonstrate advantages over pixel-space editing, weather restoration, and rendering-based methods, including measurable gains in downstream perception tasks. This intrinsic-space approach enables fine-grained, physically grounded control of weather effects while preserving scene geometry and material integrity, with potential impact on autonomous driving robustness under adverse conditions.
Abstract
We present WeatherDiffusion, a diffusion-based framework for controllable weather editing in intrinsic space. Our framework includes two components based on diffusion priors: an inverse renderer that estimates material properties, scene geometry, and lighting as intrinsic maps from an input image, and a forward renderer that utilizes these geometry and material maps along with a text prompt that describes specific weather conditions to generate a final image. The intrinsic maps enhance controllability compared to traditional pixel-space editing approaches. We propose an intrinsic map-aware attention mechanism that improves spatial correspondence and decomposition quality in large outdoor scenes. For forward rendering, we leverage CLIP-space interpolation of weather prompts to achieve fine-grained weather control. We also introduce a synthetic and a real-world dataset, containing 38k and 18k images under various weather conditions, each with intrinsic map annotations. WeatherDiffusion outperforms state-of-the-art pixel-space editing approaches, weather restoration methods, and rendering-based methods, showing promise for downstream tasks such as autonomous driving, enhancing the robustness of detection and segmentation in challenging weather scenarios.
