A Comparative Analysis of Adversarial Robustness for Quantum and Classical Machine Learning Models
Maximilian Wendlinger, Kilian Tscharke, Pascal Debus
TL;DR
This work addresses the robustness gap between quantum and classical machine learning by systemically comparing adversarial attacks across PQCs and classical models on a handcrafted four-class dataset. It introduces a classical Fourier-network surrogate as a middle-ground between quantum and classical representations and analyzes the role of data encoding and regularization on adversarial susceptibility through transfer attacks and Lipschitz-bound estimation. The study demonstrates that regularization can reduce Lipschitz constants and improve robustness in quantum models, while transfer patterns reveal that the quantum-classical boundary is nuanced and highly architecture-dependent. Overall, the findings offer practical guidance for designing more robust QML systems and clarify when quantum models may or may not exhibit robustness advantages in realistic adversarial settings.
Abstract
Quantum machine learning (QML) continues to be an area of tremendous interest from research and industry. While QML models have been shown to be vulnerable to adversarial attacks much in the same manner as classical machine learning models, it is still largely unknown how to compare adversarial attacks on quantum versus classical models. In this paper, we show how to systematically investigate the similarities and differences in adversarial robustness of classical and quantum models using transfer attacks, perturbation patterns and Lipschitz bounds. More specifically, we focus on classification tasks on a handcrafted dataset that allows quantitative analysis for feature attribution. This enables us to get insight, both theoretically and experimentally, on the robustness of classification networks. We start by comparing typical QML model architectures such as amplitude and re-upload encoding circuits with variational parameters to a classical ConvNet architecture. Next, we introduce a classical approximation of QML circuits (originally obtained with Random Fourier Features sampling but adapted in this work to fit a trainable encoding) and evaluate this model, denoted Fourier network, in comparison to other architectures. Our findings show that this Fourier network can be seen as a "middle ground" on the quantum-classical boundary. While adversarial attacks successfully transfer across this boundary in both directions, we also show that regularization helps quantum networks to be more robust, which has direct impact on Lipschitz bounds and transfer attacks.
