Quantum Circuit Discovery for Fault-Tolerant Logical State Preparation with Reinforcement Learning
Remmy Zen, Jan Olle, Luis Colmenarez, Matteo Puviani, Markus Müller, Florian Marquardt
TL;DR
The paper tackles the challenge of fault-tolerant quantum circuit design under hardware constraints by employing reinforcement learning to discover flag-based FT circuits for quantum error correction. It develops an RL framework that uses stabilizer-tableau representations and discrete Clifford gates, optimizing via proximal policy optimization to produce both logical state preparation circuits and their verification circuits, with an integrated approach (IFT-LSP) showing superior performance. Key findings include RL-generated circuits with smaller gate counts and ancilla overhead across several codes, effective transfer learning to accelerate adaptation to different connectivity, and successful FT circuit synthesis under restricted connectivity such as 2D grids and heavy-hex layouts. The work demonstrates the viability of RL for scalable FT circuit discovery and paves the way for applications to magic state preparation, syndrome measurement, and logical-gate synthesis, potentially impacting the practical realization of large-scale quantum computers.
Abstract
The realization of large-scale quantum computers requires not only quantum error correction (QEC) but also fault-tolerant operations to handle errors that propagate into harmful errors. Recently, flag-based protocols have been introduced that use ancillary qubits to flag harmful errors. However, there is no clear recipe for finding a fault-tolerant quantum circuit with flag-based protocols, especially when we consider hardware constraints, such as qubit connectivity and available gate set. In this work, we propose and explore reinforcement learning (RL) to automatically discover compact and hardware-adapted fault-tolerant quantum circuits. We show that in the task of fault-tolerant logical state preparation, RL discovers circuits with fewer gates and ancillary qubits than published results without and with hardware constraints of up to 15 physical qubits. Furthermore, RL allows for straightforward exploration of different qubit connectivities and the use of transfer learning to accelerate the discovery. More generally, our work opens the door towards the use of RL for the discovery of fault-tolerant quantum circuits for addressing tasks beyond state preparation, including magic state preparation, logical gate synthesis, or syndrome measurement.
