Benchmarking Quantum Reinforcement Learning

Nico Meyer; Christian Ufrecht; George Yammine; Georgios Kontes; Christopher Mutschler; Daniel D. Scherer

Benchmarking Quantum Reinforcement Learning

Nico Meyer, Christian Ufrecht, George Yammine, Georgios Kontes, Christopher Mutschler, Daniel D. Scherer

TL;DR

The paper tackles the lack of standardized benchmarking in reinforcement learning and quantum reinforcement learning by introducing a statistically grounded sample-complexity estimator and a rigorously defined outperformance criterion. It pairs this methodology with a flexible BeamManagement6G benchmark to enable scalable, reproducible comparisons between classical and hybrid quantum agents, evaluating both off-policy DDQN and on-policy PPO. Across extensive, 100-seed experiments, hybrid quantum models demonstrate competitive sample efficiency and occasional advantages over similarly sized classical networks, though no definitive quantum advantage emerges without scaling to larger, hardware-capable qubit counts. The work emphasizes the empirical nature of quantum advantage, calls for larger-scale studies, and provides open-source tools to promote rigorous, reproducible benchmarking in QRL.

Abstract

Benchmarking and establishing proper statistical validation metrics for reinforcement learning (RL) remain ongoing challenges, where no consensus has been established yet. The emergence of quantum computing and its potential applications in quantum reinforcement learning (QRL) further complicate benchmarking efforts. To enable valid performance comparisons and to streamline current research in this area, we propose a novel benchmarking methodology, which is based on a statistical estimator for sample complexity and a definition of statistical outperformance. Furthermore, considering QRL, our methodology casts doubt on some previous claims regarding its superiority. We conducted experiments on a novel benchmarking environment with flexible levels of complexity. While we still identify possible advantages, our findings are more nuanced overall. We discuss the potential limitations of these results and explore their implications for empirical research on quantum advantage in QRL.

Benchmarking Quantum Reinforcement Learning

TL;DR

Abstract

Benchmarking Quantum Reinforcement Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (20)

Theorems & Definitions (7)