Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters

Yuan Wang; Ouxiang Li; Tingting Mu; Yanbin Hao; Kuien Liu; Xiang Wang; Xiangnan He

Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters

Yuan Wang, Ouxiang Li, Tingting Mu, Yanbin Hao, Kuien Liu, Xiang Wang, Xiangnan He

TL;DR

This work tackles the problem of precise, fast, and low-cost concept erasure in text-to-image diffusion models. It introduces AdaVD, a training-free method that performs projection onto the orthogonal complement of target concept value vectors in cross-attention layers, enhanced by a token-wise adaptive shift to preserve non-target priors. The approach demonstrates superior erasure efficacy and prior preservation across single- and multi-concept erasure tasks, transferring effectively across SD v1.4, SDXL v1.0, and SDv3, with notable runtime efficiency. Practically, AdaVD enables real-time, scalable concept erasure with broad applicability in image generation platforms and editing workflows.

Abstract

Recent success of text-to-image (T2I) generation and its increasing practical applications, enabled by diffusion models, require urgent consideration of erasing unwanted concepts, e.g., copyrighted, offensive, and unsafe ones, from the pre-trained models in a precise, timely, and low-cost manner. The twofold demand of concept erasure includes not only a precise removal of the target concept (i.e., erasure efficacy) but also a minimal change on non-target content (i.e., prior preservation), during generation. Existing methods face challenges in maintaining an effective balance between erasure efficacy and prior preservation, and they can be computationally costly. To improve, we propose a precise, fast, and low-cost concept erasure method, called Adaptive Value Decomposer (AdaVD), which is training-free. Our method is grounded in a classical linear algebraic operation of computing the orthogonal complement, implemented in the value space of each cross-attention layer within the UNet of diffusion models. We design a shift factor to adaptively navigate the erasure strength, enhancing effective prior preservation without sacrificing erasure efficacy. Extensive comparative experiments with both training-based and training-free state-of-the-art methods demonstrate that the proposed AdaVD excels in both single and multiple concept erasure, showing 2 to 10 times improvement in prior preservation than the second best, meanwhile achieving the best or near best erasure efficacy. AdaVD supports a series of diffusion models and downstream image generation tasks, with code available on: https://github.com/WYuan1001/AdaVD.

Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters

TL;DR

Abstract

Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)