Dynamic Inhomogeneous Quantum Resource Scheduling with Reinforcement Learning
Linsen Li, Pratyush Anand, Kaiming He, Dirk Englund
TL;DR
This work tackles the NP-hard problem of dynamic inhomogeneous quantum resource scheduling by embedding it in a digitized Monte Carlo environment and applying reinforcement learning with a Transformer-on-QuPairs architecture. The approach processes all qubit-pair links with self-attention to produce next-step scheduling decisions, yielding more than a 3× improvement in quantum-resource performance over rule-based baselines. The framework demonstrates scalability to larger qubit sets and varying inhomogeneity, and shows strong transfer learning potential across system sizes. These results support a path toward co-design of physical and control layers for quantum networks, computing, and communication, by enabling real-time, high-fidelity resource state construction under probabilistic Heralded entanglement dynamics.
Abstract
A central challenge in quantum information science and technology is achieving real-time estimation and feedforward control of quantum systems. This challenge is compounded by the inherent inhomogeneity of quantum resources, such as qubit properties and controls, and their intrinsically probabilistic nature. This leads to stochastic challenges in error detection and probabilistic outcomes in processes such as heralded remote entanglement. Given these complexities, optimizing the construction of quantum resource states is an NP-hard problem. In this paper, we address the quantum resource scheduling issue by formulating the problem and simulating it within a digitized environment, allowing the exploration and development of agent-based optimization strategies. We employ reinforcement learning agents within this probabilistic setting and introduce a new framework utilizing a Transformer model that emphasizes self-attention mechanisms for pairs of qubits. This approach facilitates dynamic scheduling by providing real-time, next-step guidance. Our method significantly improves the performance of quantum systems, achieving more than a 3$\times$ improvement over rule-based agents, and establishes an innovative framework that improves the joint design of physical and control systems for quantum applications in communication, networking, and computing.
