FaceShield: Defending Facial Image against Deepfake Threats

Jaehwan Jeong; Sumin In; Sieun Kim; Hannie Shin; Jongheon Jeong; Sang Ho Yoon; Jaewook Chung; Sangpil Kim

FaceShield: Defending Facial Image against Deepfake Threats

Jaehwan Jeong, Sumin In, Sieun Kim, Hannie Shin, Jongheon Jeong, Sang Ho Yoon, Jaewook Chung, Sangpil Kim

TL;DR

FaceShield introduces a proactive, transferable defense against deepfakes by perturbing conditioning flows in diffusion models and disrupting facial feature extractors. It combines conditioned-face attacks, multi-backbone feature extractor perturbations, and an enhanced noise update mechanism with Gaussian blur and low-pass filtering to achieve imperceptible yet robust protection. The method demonstrates state-of-the-art protection against diffusion-model deepfakes, with transferability to GAN-based attacks and robustness to JPEG compression and purification techniques. Its extensibility to various deepfake pipelines and reduced computational cost make it a practical defense for real-world deployments.

Abstract

The rising use of deepfakes in criminal activities presents a significant issue, inciting widespread controversy. While numerous studies have tackled this problem, most primarily focus on deepfake detection. These reactive solutions are insufficient as a fundamental approach for crimes where authenticity is disregarded. Existing proactive defenses also have limitations, as they are effective only for deepfake models based on specific Generative Adversarial Networks (GANs), making them less applicable in light of recent advancements in diffusion-based models. In this paper, we propose a proactive defense method named FaceShield, which introduces novel defense strategies targeting deepfakes generated by Diffusion Models (DMs) and facilitates defenses on various existing GAN-based deepfake models through facial feature extractor manipulations. Our approach consists of three main components: (i) manipulating the attention mechanism of DMs to exclude protected facial features during the denoising process, (ii) targeting prominent facial feature extraction models to enhance the robustness of our adversarial perturbation, and (iii) employing Gaussian blur and low-pass filtering techniques to improve imperceptibility while enhancing robustness against JPEG compression. Experimental results on the CelebA-HQ and VGGFace2-HQ datasets demonstrate that our method achieves state-of-the-art performance against the latest deepfake models based on DMs, while also exhibiting transferability to GANs and showcasing greater imperceptibility of noise along with enhanced robustness. Code is available here: https://github.com/kuai-lab/iccv25_faceshield

FaceShield: Defending Facial Image against Deepfake Threats

TL;DR

Abstract

FaceShield: Defending Facial Image against Deepfake Threats

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (30)