A Theoretical Justification for Image Inpainting using Denoising Diffusion Probabilistic Models
Litu Rout, Advait Parulekar, Constantine Caramanis, Sanjay Shakkottai
TL;DR
The work addresses image inpainting with diffusion models by assuming data lie on a low-dimensional subspace and proving that diffusion priors can recover missing regions without retraining. It reveals a misalignment bias in RePaint and proposes RePaint$^+$, which realigns drift and dispersion to achieve linear convergence toward the true sample, with provable recovery in the two-state setting and extensions to noisy generators. The analysis provides a universal-mask principle showing inpainting generalizes across unseen masks, derives a closed-form generative solution for two-state diffusion, and demonstrates that resampling intermediate states yields stronger, faster convergence than merely slowing diffusion. Collectively, these results offer theoretical guarantees and practical guidance for diffusion-based inpainting, including mask handling and resampling strategies, with potential extensions to nonlinear manifolds.
Abstract
We provide a theoretical justification for sample recovery using diffusion based image inpainting in a linear model setting. While most inpainting algorithms require retraining with each new mask, we prove that diffusion based inpainting generalizes well to unseen masks without retraining. We analyze a recently proposed popular diffusion based inpainting algorithm called RePaint (Lugmayr et al., 2022), and show that it has a bias due to misalignment that hampers sample recovery even in a two-state diffusion process. Motivated by our analysis, we propose a modified RePaint algorithm we call RePaint$^+$ that provably recovers the underlying true sample and enjoys a linear rate of convergence. It achieves this by rectifying the misalignment error present in drift and dispersion of the reverse process. To the best of our knowledge, this is the first linear convergence result for a diffusion based image inpainting algorithm.
