Quantum-Enhanced Weight Optimization for Neural Networks Using Grover's Algorithm
Stefan-Alexandru Jura, Mihai Udrescu
TL;DR
The paper addresses the challenge of weight optimization in neural networks by replacing gradient-based updates with a quantum-accelerated, gradient-free search using Grover's algorithm. By discretizing each weight's update space and applying amplitude amplification to select the best candidate, the method achieves a per-weight complexity of $O\left(\sqrt{N}\right)$ and reduces overall training time compared to backpropagation while maintaining or improving accuracy on small to medium datasets. Experiments on Wine and Digits demonstrate rapid convergence, high accuracy (up to 100% in some cases), and robustness to moderate noise, with a scalable architecture that can support deeper networks using a small number of qubits. The work highlights practical advantages for near-term quantum devices, offering a viable path toward quantum-enhanced optimization of classical neural networks and motivating further hardware-oriented refinements and larger-scale evaluations.
Abstract
The main approach to hybrid quantum-classical neural networks (QNN) is employing quantum computing to build a neural network (NN) that has quantum features, which is then optimized classically. Here, we propose a different strategy: to use quantum computing in order to optimize the weights of a classical NN. As such, we design an instance of Grover's quantum search algorithm to accelerate the search for the optimal parameters of an NN during the training process, a task traditionally performed using the backpropagation algorithm with the gradient descent method. Indeed, gradient descent has issues such as exploding gradient, vanishing gradient, or convexity problem. Other methods tried to address such issues with strategies like genetic searches, but they carry additional problems like convergence consistency. Our original method avoids these issues -- because it does not calculate gradients -- and capitalizes on classical architectures' robustness and Grover's quadratic speedup in high-dimensional search spaces to significantly reduce test loss (58.75%) and improve test accuracy (35.25%), compared to classical NN weight optimization, on small datasets. Unlike most QNNs that are trained on small datasets only, our method is also scalable, as it allows the optimization of deep networks; for an NN with 3 hidden layers, trained on the Digits dataset from scikit-learn, we obtained a mean accuracy of 97.7%. Moreover, our method requires a much smaller number of qubits compared to other QNN approaches, making it very practical for near-future quantum computers that will still deliver a limited number of logical qubits.
