Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling
Lintao Zhang, Xiangcheng Du, LeoWu TomyEnrique, Yiqun Wang, Yingbin Zheng, Cheng Jin
TL;DR
The paper tackles slow DDPM-based image inpainting by extending RePaint with three speed-up strategies: using a Light-Weight Diffusion Model (LWDM) with a perception-driven loss, skip-step DDIM sampling, and Coarse-to-Fine Sampling (CFS). These components form a two-stage, conditioned diffusion framework with dedicated modules for denoising (CDM) and resampling (CRM). Experimental results on CelebA-HQ and ImageNet across six mask types demonstrate substantial speedups (about $60\times$) with competitive inpainting quality. The approach enables faster, flexible diffusion-based inpainting suitable for interactive editing and broad mask distributions.
Abstract
For image inpainting, the existing Denoising Diffusion Probabilistic Model (DDPM) based method i.e. RePaint can produce high-quality images for any inpainting form. It utilizes a pre-trained DDPM as a prior and generates inpainting results by conditioning on the reverse diffusion process, namely denoising process. However, this process is significantly time-consuming. In this paper, we propose an efficient DDPM-based image inpainting method which includes three speed-up strategies. First, we utilize a pre-trained Light-Weight Diffusion Model (LWDM) to reduce the number of parameters. Second, we introduce a skip-step sampling scheme of Denoising Diffusion Implicit Models (DDIM) for the denoising process. Finally, we propose Coarse-to-Fine Sampling (CFS), which speeds up inference by reducing image resolution in the coarse stage and decreasing denoising timesteps in the refinement stage. We conduct extensive experiments on both faces and general-purpose image inpainting tasks, and our method achieves competitive performance with approximately 60 times speedup.
