Enhancing Facial Privacy Protection via Weakening Diffusion Purification
Ali Salar, Qing Liu, Yingli Tian, Guoying Zhao
TL;DR
This work tackles facial privacy protection against unauthorized AFR by exploiting latent diffusion models to edit latent codes rather than pixel space. It introduces learnable unconditional embeddings (null-text guidance) and self-attention-based structure preservation to weaken diffusion purification and preserve image quality while maintaining impersonation strength. The two-stage approach—unconditional-embedding learning followed by adversarial latent-code optimization—yields improved protection transferability across multiple FR models and datasets (CelebA-HQ, LADN) with competitive visual fidelity. The method demonstrates robustness to common countermeasures and outperforms state-of-the-art baselines on PSR and related image-quality metrics, offering a practical privacy-protection tool while highlighting ethical considerations around impersonation versus obfuscation. Overall, the paper provides a scalable, transfer-based black-box strategy for targeted de-identification via latent-diffusion editing that balances privacy efficacy and perceptual realism.
Abstract
The rapid growth of social media has led to the widespread sharing of individual portrait images, which pose serious privacy risks due to the capabilities of automatic face recognition (AFR) systems for mass surveillance. Hence, protecting facial privacy against unauthorized AFR systems is essential. Inspired by the generation capability of the emerging diffusion models, recent methods employ diffusion models to generate adversarial face images for privacy protection. However, they suffer from the diffusion purification effect, leading to a low protection success rate (PSR). In this paper, we first propose learning unconditional embeddings to increase the learning capacity for adversarial modifications and then use them to guide the modification of the adversarial latent code to weaken the diffusion purification effect. Moreover, we integrate an identity-preserving structure to maintain structural consistency between the original and generated images, allowing human observers to recognize the generated image as having the same identity as the original. Extensive experiments conducted on two public datasets, i.e., CelebA-HQ and LADN, demonstrate the superiority of our approach. The protected faces generated by our method outperform those produced by existing facial privacy protection approaches in terms of transferability and natural appearance.
