Hamiltonian-based Quantum Reinforcement Learning for Neural Combinatorial Optimization
Georg Kruse, Rodrigo Coehlo, Andreas Rosskopf, Robert Wille, Jeanette Miriam Lorenz
TL;DR
This work tackles binary combinatorial optimization by marrying quantum computing with neural reinforcement learning through a Hamiltonian-based QRL framework. The method constructs variational quantum circuit ansatzes directly from the problem's QUBO Hamiltonian (notably the sge-sgv design), enabling application to a broad class of CO problems and addressing trainability via inequality encoding and correlated parameterization. Empirical results on Weighted-MaxCut, Unit Commitment, and Knapsack show that Hamiltonian-based QRL can outperform QAOA in identifying optimal or valid solutions, while maintaining feasibility through masking and problem-aligned rewards, albeit with higher training cost. Overall, the approach offers a scalable, generalizable quantum-classical learning paradigm with promising potential for hardware-efficient implementations and strong generalization to unseen problem instances.
Abstract
Advancements in Quantum Computing (QC) and Neural Combinatorial Optimization (NCO) represent promising steps in tackling complex computational challenges. On the one hand, Variational Quantum Algorithms such as QAOA can be used to solve a wide range of combinatorial optimization problems. On the other hand, the same class of problems can be solved by NCO, a method that has shown promising results, particularly since the introduction of Graph Neural Networks. Given recent advances in both research areas, we introduce Hamiltonian-based Quantum Reinforcement Learning (QRL), an approach at the intersection of QC and NCO. We model our ansatzes directly on the combinatorial optimization problem's Hamiltonian formulation, which allows us to apply our approach to a broad class of problems. Our ansatzes show favourable trainability properties when compared to the hardware efficient ansatzes, while also not being limited to graph-based problems, unlike previous works. In this work, we evaluate the performance of Hamiltonian-based QRL on a diverse set of combinatorial optimization problems to demonstrate the broad applicability of our approach and compare it to QAOA.
