Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware

Akash Kundu; Leopoldo Sarra

Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware

Akash Kundu, Leopoldo Sarra

TL;DR

This work introduces gadget reinforcement learning (GRL), a framework that merges reinforcement learning with program synthesis to autonomously construct and incorporate composite circuit components, or gadgets, into the action space for variational quantum algorithms. By learning from simple TFIM instances and transferring those gadgets to harder regimes, GRL achieves faster convergence, higher accuracy, and hardware-compatible, more compact PQCs, including circuits that perform well on IBM hardware with reduced transpilation overhead. The approach demonstrates scalable gains, effective transfer across problem sizes, and resilience to noise when using gadget redundancy, indicating practical potential for hardware-aware quantum circuit design. Overall, GRL bridges algorithmic design and real hardware constraints, enabling more efficient exploration of PQCs under realistic budgets and paving the way for broader adoption of automated quantum circuit optimization.

Abstract

Designing quantum circuits for specific tasks is challenging due to the exponential growth of the state space. We introduce gadget reinforcement learning (GRL), which integrates reinforcement learning with program synthesis to automatically generate and incorporate composite gates (gadgets) into the action space. This enhances the exploration of parameterized quantum circuits (PQCs) for complex tasks like approximating ground states of quantum Hamiltonians, an NP-hard problem. We evaluate GRL using the transverse field Ising model under typical computational budgets (e.g., 2- 3 days of GPU runtime). Our results show improved accuracy, hardware compatibility and scalability. GRL exhibits robust performance as the size and complexity of the problem increases, even with constrained computational resources. By integrating gadget extraction, GRL facilitates the discovery of reusable circuit components tailored for specific hardware, bridging the gap between algorithmic design and practical implementation. This makes GRL a versatile framework for optimizing quantum circuits with applications in hardware-specific optimizations and variational quantum algorithms. The code is available at: https://github.com/Aqasch/Gadget_RL

Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware

TL;DR

Abstract

Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)