Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Yuhao Liu, Zhanghan Ke, Fang Liu, Nanxuan Zhao, Rynson W. H. Lau
TL;DR
Diff-Plugin addresses the challenge of preserving fine input details in diffusion-based low-level vision tasks by introducing a modular, plug-and-play framework. It adds Task-Plugins with dual branches (Task-Prompt Branch and Spatial Complement Branch) and a Plugin-Selector that routes natural-language prompts to the appropriate plugin, enabling text-driven, multi-task editing without retraining the base model. The approach demonstrates strong fidelity across eight tasks, showing state-of-the-art performance among diffusion-based methods and competitive results versus regression-based models, with robust training and schedulability across dataset sizes. Practically, this framework provides a flexible, scalable tool for reliable, detail-preserving edits in real-world scenarios, while also highlighting potential for region-specific guidance in future work.
Abstract
Diffusion models trained on large-scale datasets have achieved remarkable progress in image synthesis. However, due to the randomness in the diffusion process, they often struggle with handling diverse low-level tasks that require details preservation. To overcome this limitation, we present a new Diff-Plugin framework to enable a single pre-trained diffusion model to generate high-fidelity results across a variety of low-level tasks. Specifically, we first propose a lightweight Task-Plugin module with a dual branch design to provide task-specific priors, guiding the diffusion process in preserving image content. We then propose a Plugin-Selector that can automatically select different Task-Plugins based on the text instruction, allowing users to edit images by indicating multiple low-level tasks with natural language. We conduct extensive experiments on 8 low-level vision tasks. The results demonstrate the superiority of Diff-Plugin over existing methods, particularly in real-world scenarios. Our ablations further validate that Diff-Plugin is stable, schedulable, and supports robust training across different dataset sizes.
