A Numerical Gradient Inversion Attack in Variational Quantum Neural-Networks
Georgios Papadopoulos, Shaltiel Eloul, Yash Satsangi, Jamie Heredge, Niraj Kumar, Chun-Fu Chen, Marco Pistoia
TL;DR
This work demonstrates a practical gradient inversion attack on trainable Variational Quantum Neural Networks (VQNNs), showing that input data can be reconstructed from shared gradients despite challenging loss landscapes with many local minima. The authors introduce a numerical scheme that blends finite-difference gradient estimation with adaptive low-pass filtering to locate the global minimum, and further improve convergence using a Kalman-filter-based update. Across multiple datasets and VQNN architectures, the attack recovers inputs with extremely low mean-squared error (down to ~1e-10 in many cases), and is effective even in batch settings, though batch size can reduce recoverability. By comparing to classical neural nets and differential privacy mechanisms, the study provides a practical privacy benchmark for distributed learning with VQNNs and highlights privacy-performance trade-offs inherent to these quantum-classical models. The results underscore data-leakage risks in gradient-based federated or decentralized training and offer a framework for evaluating defenses and model designs against gradient-inversion attacks.
Abstract
The loss landscape of Variational Quantum Neural Networks (VQNNs) is characterized by local minima that grow exponentially with increasing qubits. Because of this, it is more challenging to recover information from model gradients during training compared to classical Neural Networks (NNs). In this paper we present a numerical scheme that successfully reconstructs input training, real-world, practical data from trainable VQNNs' gradients. Our scheme is based on gradient inversion that works by combining gradients estimation with the finite difference method and adaptive low-pass filtering. The scheme is further optimized with Kalman filter to obtain efficient convergence. Our experiments show that our algorithm can invert even batch-trained data, given the VQNN model is sufficiently over-parameterized.
