KV-Edit: Training-Free Image Editing for Precise Background Preservation

Tianrui Zhu; Shiyi Zhang; Jiawei Shao; Yansong Tang

KV-Edit: Training-Free Image Editing for Precise Background Preservation

Tianrui Zhu, Shiyi Zhang, Jiawei Shao, Yansong Tang

TL;DR

KV-Edit presents a training-free image editing framework that strictly preserves background content by caching background key-value tokens during inversion and reusing them during denoising. By decoupling foreground editing from the background through an attention scheme and leveraging a KV cache, it achieves perfect background preservation while enabling flexible edits guided by user prompts. The approach is complemented by mask-guided inversion and reinitialization options, plus an inversion-free variant that reduces memory to O(1), increasing practicality. Extensive PIE-Bench evaluations and user studies demonstrate superior background preservation and competitive image quality relative to training-free and training-based methods, with strong potential for broader applications such as video editing.

Abstract

Background consistency remains a significant challenge in image editing tasks. Despite extensive developments, existing works still face a trade-off between maintaining similarity to the original image and generating content that aligns with the target. Here, we propose KV-Edit, a training-free approach that uses KV cache in DiTs to maintain background consistency, where background tokens are preserved rather than regenerated, eliminating the need for complex mechanisms or expensive training, ultimately generating new content that seamlessly integrates with the background within user-provided regions. We further explore the memory consumption of the KV cache during editing and optimize the space complexity to $O(1)$ using an inversion-free method. Our approach is compatible with any DiT-based generative model without additional training. Experiments demonstrate that KV-Edit significantly outperforms existing approaches in terms of both background and image quality, even surpassing training-based methods. Project webpage is available at https://xilluill.github.io/projectpages/KV-Edit

KV-Edit: Training-Free Image Editing for Precise Background Preservation

TL;DR

Abstract

KV-Edit: Training-Free Image Editing for Precise Background Preservation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)