QASER: Breaking the Depth vs. Accuracy Trade-Off for Quantum Architecture Search
Ioana Moflic, Alexandru Paler, Akash Kundu
TL;DR
The paper tackles the depth–accuracy trade-off in quantum circuit design by introducing QASER, an exponential, multi-objective reward for reinforcement-learning–based quantum architecture search. By tracking the historical maxima of depth and gate cost and coupling them with energy via an exponent, QASER steers the search toward circuits that are simultaneously shallow, resource-efficient, and accurate. Empirical results on quantum chemistry ground-state preparation show up to 20% fewer 2-qubit gates, reduced circuit depth, and up to 50% gains in accuracy compared to state-of-the-art RL-QAS methods, under both noisy and noiseless conditions, with further acceleration observed in warm-start TensorRL-QAS. The findings highlight reward engineering as a potent lever to improve hardware-aware quantum compilation, suggesting practical gains for post-NISQ implementations and scalable QAS workflows.
Abstract
Quantum computing faces a key challenge: balancing the need for low circuit depth (crucial for fault tolerance) with the high accuracy required for complex computations like quantum chemistry and error correction, which typically require deeper circuits. We overcome this trade-off by introducing a novel reinforcement learning approach featuring engineered reward functions, called \textbf{QASER}, that take into account seemingly contradictory optimization goals. This reward enables the compilation of circuits with lower depth and higher accuracy, significantly outperforming state-of-the-art techniques. Benchmarks on quantum chemistry state preparation circuits demonstrate stable compilations. We achieve up to 50\% improved accuracy, while reducing 2-qubit gate counts and depths by 20\%. This advancement enables more efficient and reliable quantum compilation.
