Panda or not Panda? Understanding Adversarial Attacks with Interactive Visualization
Yuzhe You, Jarvis Tse, Jian Zhao
TL;DR
AdvEx addresses the challenge of understanding adversarial attacks in image classification by offering a web-based, multi-level interactive visualization tailored for novices. Its backend generates adversarial examples using both white-box FGSM and black-box ZOO attacks and computes embeddings via an Embedding Projector, while the frontend presents Data Projectors, an Instance-level Attack Explainer, Robustness Analyzers, a Perturbation Adjuster, and integrated tutorials. The authors derive design goals from learner and teacher feedback, implement a plug-and-play framework supporting multiple models and attacks (e.g., CIFAR-10 with VGG/ResNet pairs and TRADES), and validate effectiveness through a novice-user study and an expert interview study. Results indicate strong learning gains, high engagement, and perceived generalizability, with identified opportunities for extending to other datasets, attacks, and ML domains. Collectively, AdvEx bridges theory and practice in AML education and offers a scalable template for visualization-driven ML pedagogy and model robustness assessment.
Abstract
Adversarial machine learning (AML) studies attacks that can fool machine learning algorithms into generating incorrect outcomes as well as the defenses against worst-case attacks to strengthen model robustness. Specifically for image classification, it is challenging to understand adversarial attacks due to their use of subtle perturbations that are not human-interpretable, as well as the variability of attack impacts influenced by diverse methodologies, instance differences, and model architectures. Through a design study with AML learners and teachers, we introduce AdvEx, a multi-level interactive visualization system that comprehensively presents the properties and impacts of evasion attacks on different image classifiers for novice AML learners. We quantitatively and qualitatively assessed AdvEx in a two-part evaluation including user studies and expert interviews. Our results show that AdvEx is not only highly effective as a visualization tool for understanding AML mechanisms, but also provides an engaging and enjoyable learning experience, thus demonstrating its overall benefits for AML learners.
