TurboFill: Adapting Few-step Text-to-image Model for Fast Image Inpainting
Liangbin Xie, Daniil Pakhomov, Zhonghao Wang, Zongze Wu, Ziyan Chen, Yuqian Zhou, Haitian Zheng, Zhifei Zhang, Zhe Lin, Jiantao Zhou, Chao Dong
TL;DR
TurboFill addresses the high computational cost of diffusion-based image inpainting by training an inpainting adapter directly on a few-step diffusion model using a 3-step adversarial scheme that jointly uses a diffusion discriminator and GAN objectives. The approach enables high-quality inpainting with only four diffusion steps and introduces LocalCaptionData for targeted region prompts, along with DilationBench and HumanBench to assess performance under varied mask complexities and user-centric prompts. Empirical results show TurboFill outperforms both multi-step BrushNet and LoRA-accelerated few-step baselines in objective quality metrics and human preferences, while significantly reducing training and inference costs. The work offers a practical, scalable solution for fast, realistic inpainting in real-world workflows, with dedicated benchmarks to guide future improvements.
Abstract
This paper introduces TurboFill, a fast image inpainting model that enhances a few-step text-to-image diffusion model with an inpainting adapter for high-quality and efficient inpainting. While standard diffusion models generate high-quality results, they incur high computational costs. We overcome this by training an inpainting adapter on a few-step distilled text-to-image model, DMD2, using a novel 3-step adversarial training scheme to ensure realistic, structurally consistent, and visually harmonious inpainted regions. To evaluate TurboFill, we propose two benchmarks: DilationBench, which tests performance across mask sizes, and HumanBench, based on human feedback for complex prompts. Experiments show that TurboFill outperforms both multi-step BrushNet and few-step inpainting methods, setting a new benchmark for high-performance inpainting tasks. Our project page: https://liangbinxie.github.io/projects/TurboFill/
