LoyalDiffusion: A Diffusion Model Guarding Against Data Replication
Chenghao Li, Yuke Zhang, Dake Chen, Jingqi Xu, Peter A. Beerel
TL;DR
This work addresses privacy risks in diffusion models arising from memorization of training data. It introduces LoyalDiffusion, which combines a replication-aware U-Net (RAU-Net) that replaces direct skip-connection transfers with information transfer blocks and a two-stage training strategy that applies RAU-Net only at timesteps where image fidelity is less sensitive, thereby reducing replication without sacrificing quality. The framework also integrates with data-centric strategies (GC&DF) to further mitigate memorization, and its effectiveness is demonstrated on SD v2.1 fine-tuned on LAION-2B, achieving substantial replication reductions while maintaining competitive FID and CLIP scores. A Bing-based RepliBing evaluation suggests replication scores approach a real-world lower bound, underscoring the practical significance of the approach for privacy-preserving diffusion models. Overall, LoyalDiffusion provides a novel model-centric avenue for replication mitigation that complements existing data-centric methods and highlights the role of timesteps in memorization dynamics.
Abstract
Diffusion models have demonstrated significant potential in image generation. However, their ability to replicate training data presents a privacy risk, particularly when the training data includes confidential information. Existing mitigation strategies primarily focus on augmenting the training dataset, leaving the impact of diffusion model architecture under explored. In this paper, we address this gap by examining and mitigating the impact of the model structure, specifically the skip connections in the diffusion model's U-Net model. We first present our observation on a trade-off in the skip connections. While they enhance image generation quality, they also reinforce the memorization of training data, increasing the risk of replication. To address this, we propose a replication-aware U-Net (RAU-Net) architecture that incorporates information transfer blocks into skip connections that are less essential for image quality. Recognizing the potential impact of RAU-Net on generation quality, we further investigate and identify specific timesteps during which the impact on memorization is most pronounced. By applying RAU-Net selectively at these critical timesteps, we couple our novel diffusion model with a targeted training and inference strategy, forming a framework we refer to as LoyalDiffusion. Extensive experiments demonstrate that LoyalDiffusion outperforms the state-of-the-art replication mitigation method achieving a 48.63% reduction in replication while maintaining comparable image quality.
