Gate Sequence Optimization for Parameterized Quantum Circuits using Reinforcement Learning
Tom R. Rieckmann, Stefan Scheel, A. Douglas K. Plato
TL;DR
The paper tackles noise from entangling gates on NISQ devices by optimizing entangling gate sequences for parameterized quantum circuits using reinforcement learning. It introduces a Double Deep Q-Network framework that sequentially designs gate sequences and tunes continuous gate parameters with fidelity-based rewards, without requiring a priori optimal baselines. Compared with fixed-layered hardware-efficient ansätze, the RL approach achieves higher state-preparation fidelities with fewer CNOTs across multiple connectivities and device simulations, including IBM backends. This flexible, tunable method offers practical improvements for near-term quantum algorithms and can be extended to incorporate real-device noise data.
Abstract
Current experimental quantum computing devices are limited by noise, mainly originating from entangling gates. If an efficient gate sequence for an operation is unknown, one often employs layered parameterized quantum circuits, especially hardware-efficient ansätze, with fixed entangling layer structures. We demonstrate a reinforcement learning algorithm to improve on these by optimizing the entangling gate sequence in the task of quantum state preparation. This allows us to restrict the required number of CNOT gates while taking the qubit connectivity architecture into account. Recent advancements using reinforcement learning have already demonstrated the power of this technique when optimizing the circuit for a sequence of non-parameterized gates. We extend this approach to parameterized gate sets by incorporating general single-qubit unitaries, thus allowing us to consistently reach higher state preparation fidelities at the same number of CNOT gates compared to a hardware-efficient ansatz.
