Hardware Co-Designed Optimal Control for Programmable Atomic Quantum Processors via Reinforcement Learning

Qian Ding; Dirk Englund

Hardware Co-Designed Optimal Control for Programmable Atomic Quantum Processors via Reinforcement Learning

Qian Ding, Dirk Englund

Abstract

Developing scalable, fault-tolerant atomic quantum processors requires precise control over large arrays of optical beams. This remains a major challenge due to inherent imperfections in classical control hardware, such as inter-channel crosstalk and beam leakage. In this work, we introduce a hardware co-designed intelligent quantum control framework to address these limitations. We construct a mathematical model of the photonic control hardware, integrate it into the quantum optimal control (QOC) framework, and apply reinforcement learning (RL) techniques to discover optimal control strategies. We demonstrate that the proposed framework enables robust, high-fidelity parallel single-qubit gate operations under realistic control conditions, where each atom is individually addressed by an optical beam. Specifically, we implement and benchmark three optimization strategies: a classical hybrid Self-Adaptive Differential Evolution-Adam (SADE-Adam) optimizer, a conventional RL approach based on Proximal Policy Optimization (PPO), and a novel end-to-end differentiable RL method. Using SADE-Adam as a baseline, we find that while PPO performance degrades as system complexity increases, the end-to-end differentiable RL consistently achieves gate fidelities above 99.9$\%$, exhibits faster convergence, and maintains robustness under varied channel crosstalk strength and randomized dynamic control imperfections.

Hardware Co-Designed Optimal Control for Programmable Atomic Quantum Processors via Reinforcement Learning

Abstract

Hardware Co-Designed Optimal Control for Programmable Atomic Quantum Processors via Reinforcement Learning

Abstract

Paper Structure

Table of Contents

Figures (3)