On the Importance of Conditioning for Privacy-Preserving Data Augmentation
Julian Lorenz, Katja Ludwig, Valentin Haug, Rainer Lienhart
TL;DR
This paper investigates privacy risks in privacy-preserving data augmentation that uses conditioned latent diffusion models. It shows that conditioning on edges and depth preserves identity-related cues, enabling a simple contrastive-learning attacker to re-identify individuals with substantial accuracy, and even enables black-box attacks relying solely on edge representations. Through extensive ablations on conditioning, backbones, and temperatures, the authors demonstrate that conditioned augmentations inherently leak information, with higher re-identification rates when more reference images are available. The findings challenge the viability of edge/depth-preserving diffusion-based anonymization and call for alternative privacy-preserving strategies in data augmentation with real-world implications for sensitive datasets.
Abstract
Latent diffusion models can be used as a powerful augmentation method to artificially extend datasets for enhanced training. To the human eye, these augmented images look very different to the originals. Previous work has suggested to use this data augmentation technique for data anonymization. However, we show that latent diffusion models that are conditioned on features like depth maps or edges to guide the diffusion process are not suitable as a privacy preserving method. We use a contrastive learning approach to train a model that can correctly identify people out of a pool of candidates. Moreover, we demonstrate that anonymization using conditioned diffusion models is susceptible to black box attacks. We attribute the success of the described methods to the conditioning of the latent diffusion model in the anonymization process. The diffusion model is instructed to produce similar edges for the anonymized images. Hence, a model can learn to recognize these patterns for identification.
