Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization

Yanting Gao; Yepeng Liu; Junming Liu; Qi Zhang; Hongyun Zhang; Duoqian Miao; Cairong Zhao

Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization

Yanting Gao, Yepeng Liu, Junming Liu, Qi Zhang, Hongyun Zhang, Duoqian Miao, Cairong Zhao

TL;DR

This work tackles the problem of adversarial overfitting in transfer-based attacks on Vision Transformers by proposing a twofold strategy: Commonality Enhancement (CE) and Individuality Suppression (IS). CE perturbs mid-to-low frequency components to emphasize features shared across ViTs trained on the same data, while IS adaptively suppresses surrogate-specific gradient directions, particularly in the qkv module. The resulting Commonality-Oriented Gradient Optimization (COGO) produces perturbations that align with shared model decision patterns and avoid surrogate biases, yielding substantial transferability gains over state-of-the-art methods across ViTs and CNNs. The approach is validated through extensive experiments, ablations, and gradient-dispersion analyses, demonstrating practical improvements for evaluating and potentially improving ViT robustness in black-box settings.

Abstract

Exploring effective and transferable adversarial examples is vital for understanding the characteristics and mechanisms of Vision Transformers (ViTs). However, adversarial examples generated from surrogate models often exhibit weak transferability in black-box settings due to overfitting. Existing methods improve transferability by diversifying perturbation inputs or applying uniform gradient regularization within surrogate models, yet they have not fully leveraged the shared and unique features of surrogate models trained on the same task, leading to suboptimal transfer performance. Therefore, enhancing perturbations of common information shared by surrogate models and suppressing those tied to individual characteristics offers an effective way to improve transferability. Accordingly, we propose a commonality-oriented gradient optimization strategy (COGO) consisting of two components: Commonality Enhancement (CE) and Individuality Suppression (IS). CE perturbs the mid-to-low frequency regions, leveraging the fact that ViTs trained on the same dataset tend to rely more on mid-to-low frequency information for classification. IS employs adaptive thresholds to evaluate the correlation between backpropagated gradients and model individuality, assigning weights to gradients accordingly. Extensive experiments demonstrate that COGO significantly improves the transfer success rates of adversarial attacks, outperforming current state-of-the-art methods.

Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization

TL;DR

Abstract

Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)