Quantum Adversarial Learning for Kernel Methods
Giuseppe Montalbano, Leonardo Banchi
TL;DR
This work shows that quantum-kernel classifiers based on QSVMs are susceptible to evasion attacks produced by small input perturbations, paralleling vulnerabilities seen in classical and quantum neural approaches. It develops adversarial training via data augmentation to enhance robustness and demonstrates both simulation and a proof-of-principle hardware experiment on IBM Quantum hardware, including a compact and a large quantum embedding. The results indicate that adversarial training substantially improves resilience against evasion and can partly mitigate hardware noise, while kernel concentration in expressive embeddings remains a challenge. The study provides practical guidance for building more robust quantum kernel methods, highlights the role of kernel alignment in shaping generalization, and points to future work on embedding choice and direct links between robustness and generalization.
Abstract
We show that hybrid quantum classifiers based on quantum kernel methods and support vector machines are vulnerable against adversarial attacks, namely small engineered perturbations of the input data can deceive the classifier into predicting the wrong result. Nonetheless, we also show that simple defence strategies based on data augmentation with a few crafted perturbations can make the classifier robust against new attacks. Our results find applications in security-critical learning problems and in mitigating the effect of some forms of quantum noise, since the attacker can also be understood as part of the surrounding environment.
