The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing
Shen Nie, Hanzhong Allan Guo, Cheng Lu, Yuhao Zhou, Chenyu Zheng, Chongxuan Li
TL;DR
The paper reframes diffusion-based image editing as a probabilistic process where editing defines a task-specific SDE/ODE. It proves that KL divergence between edited and data distributions contracts under SDE editing as time approaches zero, unlike ODE editing, and introduces SDE-Drag along with the DragBench benchmark. Through extensive inpainting, image-to-image translation, and dragging experiments, the authors demonstrate that SDE-based editing consistently outperforms ODE baselines and state-of-the-art dragging methods, with comparable time efficiency. The work provides a principled theoretical foundation for the superiority of diffusion randomness in editing and delivers practical, open-set capable editing tools. Overall, it broadens the scope and effectiveness of diffusion-based image editing across diverse tasks.
Abstract
We present a unified probabilistic formulation for diffusion-based image editing, where a latent variable is edited in a task-specific manner and generally deviates from the corresponding marginal distribution induced by the original stochastic or ordinary differential equation (SDE or ODE). Instead, it defines a corresponding SDE or ODE for editing. In the formulation, we prove that the Kullback-Leibler divergence between the marginal distributions of the two SDEs gradually decreases while that for the ODEs remains as the time approaches zero, which shows the promise of SDE in image editing. Inspired by it, we provide the SDE counterparts for widely used ODE baselines in various tasks including inpainting and image-to-image translation, where SDE shows a consistent and substantial improvement. Moreover, we propose SDE-Drag -- a simple yet effective method built upon the SDE formulation for point-based content dragging. We build a challenging benchmark (termed DragBench) with open-set natural, art, and AI-generated images for evaluation. A user study on DragBench indicates that SDE-Drag significantly outperforms our ODE baseline, existing diffusion-based methods, and the renowned DragGAN. Our results demonstrate the superiority and versatility of SDE in image editing and push the boundary of diffusion-based editing methods.
