Enhancing Adversarial Transferability through Block Stretch and Shrink
Quan Liu, Feng Ye, Chenhao Lu, Shuming Zhen, Guanliang Huang, Lunzhe Chen, Xudong Ke
TL;DR
The work tackles the challenge of transferring white-box adversarial perturbations to black-box models by proposing Block Stretch and Shrink (BSS), an input transformation that diversifies attention heatmaps through block-wise, constrained segmentation and local stretch/shrink while preserving global semantics. It introduces a unified number scale $N$ to standardize evaluation across methods and presents thorough experiments showing BSS outperforms state-of-the-art operators and combinations on CNNs and ViTs, under various defenses and ablations. Key contributions include the constrained segmentation mechanism, dual-dimension deformation, and an empirical framework that reveals the importance of both attention diversity and semantic preservation for high transferability. The results have significant implications for evaluating and understanding transferable adversarial attacks and highlight the need for standardized benchmarks in this research area.
Abstract
Adversarial attacks introduce small, deliberately crafted perturbations that mislead neural networks, and their transferability from white-box to black-box target models remains a critical research focus. Input transformation-based attacks are a subfield of adversarial attacks that enhance input diversity through input transformations to improve the transferability of adversarial examples. However, existing input transformation-based attacks tend to exhibit limited cross-model transferability. Previous studies have shown that high transferability is associated with diverse attention heatmaps and the preservation of global semantics in transformed inputs. Motivated by this observation, we propose Block Stretch and Shrink (BSS), a method that divides an image into blocks and applies stretch and shrink operations to these blocks, thereby diversifying attention heatmaps in transformed inputs while maintaining their global semantics. Empirical evaluations on a subset of ImageNet demonstrate that BSS outperforms existing input transformation-based attack methods in terms of transferability. Furthermore, we examine the impact of the number scale, defined as the number of transformed inputs, in input transformation-based attacks, and advocate evaluating these methods under a unified number scale to enable fair and comparable assessments.
