Differentially Private Adaptation of Diffusion Models via Noisy Aggregated Embeddings
Pura Peetathawatchai, Wei-Ning Chen, Berivan Isik, Sanmi Koyejo, Albert No
TL;DR
The paper tackles privacy risks in personalizing diffusion models on small, sensitive datasets by proposing DPAgg-TI, which privately aggregates per-image TI embeddings to adapt generation without full model fine-tuning. By learning separate embeddings for each image and aggregating them into a noisy centroid, DPAgg-TI achieves formal $(\varepsilon,\delta)$-DP guarantees and preserves stylistic fidelity much better than DP-SGD under the same privacy budget, as demonstrated on private artwork and Paris 2024 pictograms. The approach leverages subsampling to amplify privacy and uses normalization to bound sensitivity, enabling efficient, modular adaptation with competitive outputs close to non-private baselines. Experimental results include perceptual user studies, KID analyses, and an ablation comparing to DP-SGD, highlighting the method’s robustness in low-data regimes and its practical implications for privacy-preserving style transfer in diffusion models.
Abstract
Personalizing large-scale diffusion models poses serious privacy risks, especially when adapting to small, sensitive datasets. A common approach is to fine-tune the model using differentially private stochastic gradient descent (DP-SGD), but this suffers from severe utility degradation due to the high noise needed for privacy, particularly in the small data regime. We propose an alternative that leverages Textual Inversion (TI), which learns an embedding vector for an image or set of images, to enable adaptation under differential privacy (DP) constraints. Our approach, Differentially Private Aggregation via Textual Inversion (DPAgg-TI), adds calibrated noise to the aggregation of per-image embeddings to ensure formal DP guarantees while preserving high output fidelity. We show that DPAgg-TI outperforms DP-SGD finetuning in both utility and robustness under the same privacy budget, achieving results closely matching the non-private baseline on style adaptation tasks using private artwork from a single artist and Paris 2024 Olympic pictograms. In contrast, DP-SGD fails to generate meaningful outputs in this setting.
