Understanding the Nature of Depth-1 Equivariant Quantum Circuit
Jonathan Teo, Lee Xin Wei, Hoong Chuin Lau
TL;DR
This paper investigates the depth-1 Equivariant Quantum Circuit (EQC) for solving the Traveling Salesman Problem within Quantum Reinforcement Learning (QRL). It introduces Size-Invariant Grid Search (SIGS), a lightweight training optimization that leverages size-invariant properties to replicate depth-1 EQC performance while dramatically reducing runtime, enabling simulations up to 350-node TSP instances. The work provides both theoretical results—characterizing the favorable parameter region and constraining the search space—and extensive simulations showing SIGS matches RL-based training in performance with orders of magnitude faster evaluation. It further analyzes limitations of depth-1 EQCs, demonstrates limited gains from deeper EQCs in this setting, and positions SIGS as a practical benchmarking tool for the QRL community, with potential applicability to broader combinatorial optimization problems.
Abstract
The Equivariant Quantum Circuit (EQC) for the Travelling Salesman Problem (TSP) has been shown to achieve near-optimal performance in solving small TSP problems (up to 20 nodes) using only two parameters at depth 1. However, extending EQCs to larger TSP problem sizes remains challenging due to the exponential time and memory for quantum circuit simulation, as well as increasing noise and decoherence when running on actual quantum hardware. In this work, we propose the Size-Invariant Grid Search (SIGS), an efficient training optimization for Quantum Reinforcement Learning (QRL), and use it to simulate the outputs of a trained Depth-1 EQC up to 350-node TSP instances - well beyond previously tractable limits. At TSP with 100 nodes, we reduce total simulation times by 96.4%, when comparing to RL simulations with the analytical expression (151 minutes using RL to under 6 minutes using SIGS on TSP-100), while achieving a mean optimality gap within 0.005 of the RL trained model on the test set. SIGS provides a practical benchmarking tool for the QRL community, allowing us to efficiently analyze the performance of QRL algorithms on larger problem sizes. We provide a theoretical explanation for SIGS called the Size-Invariant Properties that goes beyond the concept of equivariance discussed in prior literature.
