Adversarial Attacks of Vision Tasks in the Past 10 Years: A Survey
Chiyu Zhang, Lu Zhou, Xiaogang Xu, Jiafei Wu, Zhe Liu
TL;DR
This paper surveys adversarial attacks in vision over the past decade, contrasting traditional, single-modal attacks with emerging LVLM-era threats. It develops a unified framework around adversariality, transferability, and generalization, and presents detailed threat models, victim models, datasets, and evaluation metrics for both traditional and LVLM contexts. The authors categorize attacks by knowledge, purposes, and techniques, and highlight two LVLM-specific generalizations—Cross-Prompt and Cross-Corpus—along with multimodal attack strategies. They also review defenses across training and inference phases and outline future directions, emphasizing transferability, stealth, physical robustness, and efficiency to guide robust LVLM design and evaluation. Overall, the work offers a comprehensive, actionable synthesis to inform defenses and future exploration in visual adversarial attacks across modalities.
Abstract
With the advent of Large Vision-Language Models (LVLMs), new attack vectors, such as cognitive bias, prompt injection, and jailbreaking, have emerged. Understanding these attacks promotes system robustness improvement and neural networks demystification. However, existing surveys often target attack taxonomy and lack in-depth analysis like 1) unified insights into adversariality, transferability, and generalization; 2) detailed evaluations framework; 3) motivation-driven attack categorizations; and 4) an integrated perspective on both traditional and LVLM attacks. This article addresses these gaps by offering a thorough summary of traditional and LVLM adversarial attacks, emphasizing their connections and distinctions, and providing actionable insights for future research.
