Enhancing Transferability of Adversarial Attacks with GE-AdvGAN+: A Comprehensive Framework for Gradient Editing

Zhibo Jin; Jiayu Zhang; Zhiyu Zhu; Chenyu Zhang; Jiahao Huang; Jianlong Zhou; Fang Chen

Enhancing Transferability of Adversarial Attacks with GE-AdvGAN+: A Comprehensive Framework for Gradient Editing

Zhibo Jin, Jiayu Zhang, Zhiyu Zhu, Chenyu Zhang, Jiahao Huang, Jianlong Zhou, Fang Chen

TL;DR

A novel general framework for gradient editing-based transferable attacks, named GE-AdvGAN+, which integrates nearly all mainstream attack methods to enhance transferability while significantly reducing computational resource consumption is proposed.

Abstract

Transferable adversarial attacks pose significant threats to deep neural networks, particularly in black-box scenarios where internal model information is inaccessible. Studying adversarial attack methods helps advance the performance of defense mechanisms and explore model vulnerabilities. These methods can uncover and exploit weaknesses in models, promoting the development of more robust architectures. However, current methods for transferable attacks often come with substantial computational costs, limiting their deployment and application, especially in edge computing scenarios. Adversarial generative models, such as Generative Adversarial Networks (GANs), are characterized by their ability to generate samples without the need for retraining after an initial training phase. GE-AdvGAN, a recent method for transferable adversarial attacks, is based on this principle. In this paper, we propose a novel general framework for gradient editing-based transferable attacks, named GE-AdvGAN+, which integrates nearly all mainstream attack methods to enhance transferability while significantly reducing computational resource consumption. Our experiments demonstrate the compatibility and effectiveness of our framework. Compared to the baseline AdvGAN, our best-performing method, GE-AdvGAN++, achieves an average ASR improvement of 47.8. Additionally, it surpasses the latest competing algorithm, GE-AdvGAN, with an average ASR increase of 5.9. The framework also exhibits enhanced computational efficiency, achieving 2217.7 FPS, outperforming traditional methods such as BIM and MI-FGSM. The implementation code for our GE-AdvGAN+ framework is available at https://github.com/GEAdvGANP

Enhancing Transferability of Adversarial Attacks with GE-AdvGAN+: A Comprehensive Framework for Gradient Editing

TL;DR

Abstract

Paper Structure (38 sections, 21 equations, 3 figures, 5 tables)

This paper contains 38 sections, 21 equations, 3 figures, 5 tables.

Introduction
Background
White-box Adversarial Attacks
Black-box Adversarial Attacks
Gradient Editing
Semantic Similarity
Target Modification
Methods
Definition of Adversarial Attack
Mathematical Definition of AdvGAN
GE-AdvGAN
Gradient Editing with Additional Information
Mathematical Definition of Gradient Editing
$GE_{BIM}$ & $GE_{PGD}$
$GE_{MIM}$
...and 23 more sections

Figures (3)

Figure 1: The flowchart of transferability-based attack
Figure 2: The flowchart of AdvGAN+ Framework
Figure 3: Comparison of the transferability of different adversarial attack methods with and without integration into the GE-AdvGAN+ framework. The five subfigures represent experiments with different models as surrogate models. The three colors represent three types of adversarial attack methods, with solid bars representing the original attack methods and shaded bars representing the performance of the GE-AdvGAN+ framework using the corresponding gradient information.

Enhancing Transferability of Adversarial Attacks with GE-AdvGAN+: A Comprehensive Framework for Gradient Editing

TL;DR

Abstract

Enhancing Transferability of Adversarial Attacks with GE-AdvGAN+: A Comprehensive Framework for Gradient Editing

Authors

TL;DR

Abstract

Table of Contents

Figures (3)