Boosting the Transferability of Adversarial Attacks with Global Momentum Initialization
Jiafeng Wang, Zhaoyu Chen, Kaixun Jiang, Dingkang Yang, Lingyi Hong, Pinxue Guo, Haijing Guo, Wenqiang Zhang
TL;DR
This work addresses the limited transferability of adversarial examples under defenses by analyzing gradient consistency and introducing Global Momentum Initialization (GI). GI employs gradient pre-convergence and a global search to mitigate gradient elimination and improve momentum convergence, enabling stronger transfer attacks that integrate with existing gradient-based methods and input-transformations. Empirical results show GI yields notable gains across image and video domains, achieving average attack successes up to 95.4% against advanced defenses and approaching white-box performance with ensembles, while remaining more time-efficient than prior high-cost approaches. Overall, GI provides a practical, versatile baseline that enhances black-box transfer attacks and offers a new lens on optimizing adversarial perturbations via gradient consistency.
Abstract
Deep Neural Networks (DNNs) are vulnerable to adversarial examples, which are crafted by adding human-imperceptible perturbations to the benign inputs. Simultaneously, adversarial examples exhibit transferability across models, enabling practical black-box attacks. However, existing methods are still incapable of achieving the desired transfer attack performance. In this work, focusing on gradient optimization and consistency, we analyse the gradient elimination phenomenon as well as the local momentum optimum dilemma. To tackle these challenges, we introduce Global Momentum Initialization (GI), providing global momentum knowledge to mitigate gradient elimination. Specifically, we perform gradient pre-convergence before the attack and a global search during this stage. GI seamlessly integrates with existing transfer methods, significantly improving the success rate of transfer attacks by an average of 6.4% under various advanced defense mechanisms compared to the state-of-the-art method. Ultimately, GI demonstrates strong transferability in both image and video attack domains. Particularly, when attacking advanced defense methods in the image domain, it achieves an average attack success rate of 95.4%. The code is available at $\href{https://github.com/Omenzychen/Global-Momentum-Initialization}{https://github.com/Omenzychen/Global-Momentum-Initialization}$.
