LayerDiffusion: Layered Controlled Image Editing with Diffusion Models
Pengzhi Li, QInxuan Huang, Yikang Ding, Zhiheng Li
TL;DR
LayerDiffusion tackles the challenge of text-guided image editing that requires simultaneous background replacement and subject attribute changes while preserving the subject's identity. It introduces a layered framework that decouples foreground and background editing via layered controlled optimization of text embeddings and a layered diffusion training regime, complemented by an iterative guidance strategy to tightly enforce textual constraints. The method achieves high fidelity to input subject features and coherent integration into new backgrounds, outperforming existing editing approaches on multitask scenarios. User studies corroborate the quantitative gains, highlighting LayerDiffusion's potential to enable versatile, controllable image edits with single-image inputs.
Abstract
Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining consistency between the subject and the background remains challenging. In this paper, we propose LayerDiffusion, a semantic-based layered controlled image editing method. Our method enables non-rigid editing and attribute modification of specific subjects while preserving their unique characteristics and seamlessly integrating them into new backgrounds. We leverage a large-scale text-to-image model and employ a layered controlled optimization strategy combined with layered diffusion training. During the diffusion process, an iterative guidance strategy is used to generate a final image that aligns with the textual description. Experimental results demonstrate the effectiveness of our method in generating highly coherent images that closely align with the given textual description. The edited images maintain a high similarity to the features of the input image and surpass the performance of current leading image editing methods. LayerDiffusion opens up new possibilities for controllable image editing.
