Table of Contents
Fetching ...

Rethinking Impersonation and Dodging Attacks on Face Recognition Systems

Fengfan Zhou, Qianyu Zhou, Bangjie Yin, Hui Zheng, Xuequan Lu, Lizhuang Ma, Hefei Ling

TL;DR

Face recognition systems can be deceived by adversarial examples, yet success in impersonation does not guarantee dodging in black-box FR due to open-set, multi-identity samples. The authors introduce Adv-Pruning, a three-stage attack (Priming, Pruning, Restoration) that uses Adversarial Priority Quantification to prune low-impact perturbations and Biased Gradient Adaptation to bias remaining perturbations toward dodging, while preserving impersonation. They formalize impersonation ($\mathcal{L}^{i}$) and dodging ($\mathcal{L}^{d}$) losses and show that a multi-task objective $\mathcal{L} = \lambda \mathcal{L}^{i} + \mathcal{L}^{d}$ suffers in black-box settings; Adv-Pruning mitigates this by freeing space for dodging-focused perturbations. Extensive experiments across datasets and models demonstrate significantly improved dodging ASR with minimal loss to impersonation, including under JPEG compression and on adversarially robust FR models, indicating practical impact for assessing FR security.

Abstract

Face Recognition (FR) systems can be easily deceived by adversarial examples that manipulate benign face images through imperceptible perturbations. Adversarial attacks on FR encompass two types: impersonation (targeted) attacks and dodging (untargeted) attacks. Previous methods often achieve a successful impersonation attack on FR, however, it does not necessarily guarantee a successful dodging attack on FR in the black-box setting. In this paper, our key insight is that the generation of adversarial examples should perform both impersonation and dodging attacks simultaneously. To this end, we propose a novel attack method termed as Adversarial Pruning (Adv-Pruning), to fine-tune existing adversarial examples to enhance their dodging capabilities while preserving their impersonation capabilities. Adv-Pruning consists of Priming, Pruning, and Restoration stages. Concretely, we propose Adversarial Priority Quantification to measure the region-wise priority of original adversarial perturbations, identifying and releasing those with minimal impact on absolute model output variances. Then, Biased Gradient Adaptation is presented to adapt the adversarial examples to traverse the decision boundaries of both the attacker and victim by adding perturbations favoring dodging attacks on the vacated regions, preserving the prioritized features of the original perturbations while boosting dodging performance. As a result, we can maintain the impersonation capabilities of original adversarial examples while effectively enhancing dodging capabilities. Comprehensive experiments demonstrate the superiority of our method compared with state-of-the-art adversarial attack methods.

Rethinking Impersonation and Dodging Attacks on Face Recognition Systems

TL;DR

Face recognition systems can be deceived by adversarial examples, yet success in impersonation does not guarantee dodging in black-box FR due to open-set, multi-identity samples. The authors introduce Adv-Pruning, a three-stage attack (Priming, Pruning, Restoration) that uses Adversarial Priority Quantification to prune low-impact perturbations and Biased Gradient Adaptation to bias remaining perturbations toward dodging, while preserving impersonation. They formalize impersonation () and dodging () losses and show that a multi-task objective suffers in black-box settings; Adv-Pruning mitigates this by freeing space for dodging-focused perturbations. Extensive experiments across datasets and models demonstrate significantly improved dodging ASR with minimal loss to impersonation, including under JPEG compression and on adversarially robust FR models, indicating practical impact for assessing FR security.

Abstract

Face Recognition (FR) systems can be easily deceived by adversarial examples that manipulate benign face images through imperceptible perturbations. Adversarial attacks on FR encompass two types: impersonation (targeted) attacks and dodging (untargeted) attacks. Previous methods often achieve a successful impersonation attack on FR, however, it does not necessarily guarantee a successful dodging attack on FR in the black-box setting. In this paper, our key insight is that the generation of adversarial examples should perform both impersonation and dodging attacks simultaneously. To this end, we propose a novel attack method termed as Adversarial Pruning (Adv-Pruning), to fine-tune existing adversarial examples to enhance their dodging capabilities while preserving their impersonation capabilities. Adv-Pruning consists of Priming, Pruning, and Restoration stages. Concretely, we propose Adversarial Priority Quantification to measure the region-wise priority of original adversarial perturbations, identifying and releasing those with minimal impact on absolute model output variances. Then, Biased Gradient Adaptation is presented to adapt the adversarial examples to traverse the decision boundaries of both the attacker and victim by adding perturbations favoring dodging attacks on the vacated regions, preserving the prioritized features of the original perturbations while boosting dodging performance. As a result, we can maintain the impersonation capabilities of original adversarial examples while effectively enhancing dodging capabilities. Comprehensive experiments demonstrate the superiority of our method compared with state-of-the-art adversarial attack methods.
Paper Structure (14 sections, 12 equations, 7 figures, 3 tables)

This paper contains 14 sections, 12 equations, 7 figures, 3 tables.

Figures (7)

  • Figure 1: Top: previous methods that achieve a successful impersonation attack on FR cannot guarantee a successful dodging attack on FR in the black-box setting. In contrast, we present Adv-Pruning, including Priming, Pruning, and Restoration Stages, to perform both impersonation and dodging attacks simultaneously. Bottom (left): benign Multi-identity Samples (MS). Bottom (right): the dodging Attack Success Rate (%) between the previous methods and Adv-Pruning on multiple models.
  • Figure 2: Overview of our Adv-Pruning attack framework, which consists of Priming, Pruning, and Restoration stages. (a) During the Priming stage, we optimize the adversarial examples to ensure they have sufficient attack performance. (b) In the Pruning stage, we propose Adversarial Priority Quantification to quantify the priority of adversarial perturbations. Subsequently, we sparsify the adversarial perturbations based on the quantified priorities. (c) In the Restoration stage, we present Biased Gradient Adaptation to introduce gradient perturbations biased to dodging attacks on the sparsified regions.
  • Figure 3: The ASR on victim models trained by multiple algorithms with IR152 as the surrogate model.
  • Figure 4: Comparisons of ASR (%) on LFW with adversarial robust models as victim models.
  • Figure 5: The Illustration of adversarial examples crafted by various attacks. First column: attacker images. Last column: the corresponding victim images. The second to fifth columns exhibit the corresponding adversarial examples of BPFA, BPFA + Ours, BSR, and BSR + Ours, respectively.
  • ...and 2 more figures