Table of Contents
Fetching ...

ID-Guard: A Universal Framework for Combating Facial Manipulation via Breaking Identification

Zuomin Qu, Wei Lu, Xiangyang Luo, Qian Wang, Xiaochun Cao

TL;DR

ID-Guard tackles the problem of facial manipulation by introducing a universal, proactive defense that generates cross-model transferable perturbations in a single forward pass. It combines an Identity Destruction Module to erase identifiable facial features with a dynamic, multi-task training scheme (including MGDA and KPI-based strategies) and a gradient prior perturbation to stabilize optimization. The approach delivers strong cross-model disruption across multiple open-source manipulators, effectively degrading identity information as measured by $L_{2}^{face}$ and ID similarity, while remaining inconspicuous within a perturbation bound $\|\delta\|_\infty \le \epsilon$. It also demonstrates practical utility as an adversarial training module and remains robust under lossy operations and in gray-box settings, offering a versatile tool for mitigating face stigmatization and enhancing resilience of downstream systems. Overall, ID-Guard provides a scalable, plug-and-play framework for proactive defense against facial manipulation with tangible societal and technical impact.

Abstract

The misuse of deep learning-based facial manipulation poses a significant threat to civil rights. To prevent this fraud at its source, proactive defense has been proposed to disrupt the manipulation process by adding invisible adversarial perturbations into images, making the forged output unconvincing to observers. However, the non-specific disruption against the output may lead to the retention of identifiable facial features, potentially resulting in the stigmatization of the individual. This paper proposes a universal framework for combating facial manipulation, termed ID-Guard. Specifically, this framework operates with a single forward pass of an encoder-decoder network to produce a cross-model transferable adversarial perturbation. A novel Identity Destruction Module (IDM) is introduced to degrade identifiable features in forged faces. We optimize the perturbation generation by framing the disruption of different facial manipulations as a multi-task learning problem, and a dynamic weight strategy is devised to enhance cross-model performance. Experimental results demonstrate that the proposed ID-Guard exhibits strong efficacy in defending against various facial manipulation models, effectively degrading identifiable regions in manipulated images. It also enables disrupted images to evade facial inpainting and image recognition systems. Additionally, ID-Guard can seamlessly function as a plug-and-play component, integrating with other tasks such as adversarial training.

ID-Guard: A Universal Framework for Combating Facial Manipulation via Breaking Identification

TL;DR

ID-Guard tackles the problem of facial manipulation by introducing a universal, proactive defense that generates cross-model transferable perturbations in a single forward pass. It combines an Identity Destruction Module to erase identifiable facial features with a dynamic, multi-task training scheme (including MGDA and KPI-based strategies) and a gradient prior perturbation to stabilize optimization. The approach delivers strong cross-model disruption across multiple open-source manipulators, effectively degrading identity information as measured by and ID similarity, while remaining inconspicuous within a perturbation bound . It also demonstrates practical utility as an adversarial training module and remains robust under lossy operations and in gray-box settings, offering a versatile tool for mitigating face stigmatization and enhancing resilience of downstream systems. Overall, ID-Guard provides a scalable, plug-and-play framework for proactive defense against facial manipulation with tangible societal and technical impact.

Abstract

The misuse of deep learning-based facial manipulation poses a significant threat to civil rights. To prevent this fraud at its source, proactive defense has been proposed to disrupt the manipulation process by adding invisible adversarial perturbations into images, making the forged output unconvincing to observers. However, the non-specific disruption against the output may lead to the retention of identifiable facial features, potentially resulting in the stigmatization of the individual. This paper proposes a universal framework for combating facial manipulation, termed ID-Guard. Specifically, this framework operates with a single forward pass of an encoder-decoder network to produce a cross-model transferable adversarial perturbation. A novel Identity Destruction Module (IDM) is introduced to degrade identifiable features in forged faces. We optimize the perturbation generation by framing the disruption of different facial manipulations as a multi-task learning problem, and a dynamic weight strategy is devised to enhance cross-model performance. Experimental results demonstrate that the proposed ID-Guard exhibits strong efficacy in defending against various facial manipulation models, effectively degrading identifiable regions in manipulated images. It also enables disrupted images to evade facial inpainting and image recognition systems. Additionally, ID-Guard can seamlessly function as a plug-and-play component, integrating with other tasks such as adversarial training.
Paper Structure (44 sections, 16 equations, 17 figures, 8 tables, 1 algorithm)

This paper contains 44 sections, 16 equations, 17 figures, 8 tables, 1 algorithm.

Figures (17)

  • Figure 1: Illustration of the impact of malicious propagation of facial manipulation samples. Fakes will lead to rumors spreading, and insufficient distortion of faces by traditional defense methods will cause face stigmatization. Our method disrupts the observer's identification of the identity in the sample and thus adequately protects the individual's rights.
  • Figure 2: Illustration of potential threats to the insufficiently disrupted facial example. Challenges come primarily from commercial face recognition systems and facial inpainting algorithms.
  • Figure 3: The publicly available pre-trained models can be easily downloaded from open-source platforms to implement forgeries. For a given image, the proposed ID-Guard can generate transferable perturbations for defense against multiple open-source facial manipulations through a single forward propagation of an image reconstruction network.
  • Figure 4: Illustration of the proposed ID-Guard framework. The perturbation generator takes a natural image $x$ as input and requires only one forward propagation to generate a cross-model adversarial perturbation dedicated to the input face that can be used to defend against multiple facial manipulations. In the training phase, ID-Guard consists of three modules, including the Identity Destruction Module, the dynamic weight strategy, and the gradient prior perturbation strategy. The notation descriptions are shown in the upper right corner for reference.
  • Figure 5: Visual example of $L_{2}^{face}$ metric design. (a) is a natural image, (b) is a forged image, and (c) and (d) are disrupted images in two different situations.
  • ...and 12 more figures