Machine learning for efficient generation of universal hybrid quantum computing resources
Amanuel Anteneh, Olivier Pfister
TL;DR
The paper demonstrates that deep reinforcement learning can optimally control a time-multiplexed, measurement-based optical circuit to generate squeezed cat states with an average success rate of $98\%$, providing a viable path to GKP precursors for fault-tolerant continuous-variable quantum computing. By modeling the circuit as a Markov decision process and using PPO, the authors extract state-generation strategies that are robust to measurement outcomes and leverage PNR detection. In addition to high-performance multi-step generation, they identify fixed-parameter single-step protocols, including a moving-target analysis that yields near-optimal fidelities and a practical GKP-breeding pathway achieving $0.99$ fidelity to standard GKP states. These results highlight a scalable, qubit-free optical route to non-Gaussian resources, with immediate implications for efficient resource generation in photonic quantum computing.
Abstract
We present numerical simulations of deep reinforcement learning on a measurement-based quantum processor--a time-multiplexed optical circuit sampled by photon-number-resolving detection--and find it generates squeezed cat states with an average success rate of 98%, far outperforming all other similar proposals.
