Classifier-Free Guidance inside the Attraction Basin May Cause Memorization
Anubhav Jain, Yuya Kobayashi, Takashi Shibuya, Yuhta Takida, Nasir Memon, Julian Togelius, Yuki Mitsufuji
TL;DR
This work shows that memorization in diffusion-based image generation can be understood through an attraction-basin dynamic in the denoising trajectory. The authors propose an inference-time mitigation that delays classifier-free guidance (CFG) until a transition point is reached, plus Opposite Guidance to hasten exit from the attraction basin, all without retraining. They formalize static and dynamic transition-point strategies and demonstrate their effectiveness across multiple memorization scenarios, including LAION-100k finetuning, data duplication, and trigger-token prompts, while maintaining image quality and textual alignment. The proposed method is fast, requires no prompt or weight changes, and generalizes beyond individual scenarios, offering a practical, broadly applicable solution to memorization in diffusion models. Overall, it contributes a dynamical-systems perspective and a lightweight, robust mitigation that can be readily integrated into existing diffusion pipelines.
Abstract
Diffusion models are prone to exactly reproduce images from the training data. This exact reproduction of the training data is concerning as it can lead to copyright infringement and/or leakage of privacy-sensitive information. In this paper, we present a novel perspective on the memorization phenomenon and propose a simple yet effective approach to mitigate it. We argue that memorization occurs because of an attraction basin in the denoising process which steers the diffusion trajectory towards a memorized image. However, this can be mitigated by guiding the diffusion trajectory away from the attraction basin by not applying classifier-free guidance until an ideal transition point occurs from which classifier-free guidance is applied. This leads to the generation of non-memorized images that are high in image quality and well-aligned with the conditioning mechanism. To further improve on this, we present a new guidance technique, opposite guidance, that escapes the attraction basin sooner in the denoising process. We demonstrate the existence of attraction basins in various scenarios in which memorization occurs, and we show that our proposed approach successfully mitigates memorization.
