Reinforcement learning-based architecture search for quantum machine learning
Frederic Rapp, David A. Kreplin, Marco F. Huber, Marco Roth
TL;DR
The paper tackles the challenge of designing problem-specific encoding circuits for quantum machine learning by framing circuit generation as a model-based reinforcement learning problem. Using MuZero, it builds a layered encoding circuit search (MCS) that optimizes a cross-validation-based reward while respecting hardware constraints and circuit depth, leading to sample-efficient exploration. Across regression and classification tasks, MCS circuits consistently outperform literature reference circuits and genetic/random baselines, with QSVMs leveraging the projected quantum kernel for efficient evaluation. The findings demonstrate the practical impact of automated, data-driven circuit design for QML and point to future extensions to other QML models and symmetry-aware architectures.
Abstract
Quantum machine learning models use encoding circuits to map data into a quantum Hilbert space. While it is well known that the architecture of these circuits significantly influences core properties of the resulting model, they are often chosen heuristically. In this work, we present a novel approach using reinforcement learning techniques to generate problem-specific encoding circuits to improve the performance of quantum machine learning models. By specifically using a model-based reinforcement learning algorithm, we reduce the number of necessary circuit evaluations during the search, providing a sample-efficient framework. In contrast to previous search algorithms, our method uses a layered circuit structure that significantly reduces the search space. Additionally, our approach can account for multiple objectives such as solution quality, hardware restrictions and circuit depth. We benchmark our tailored circuits against various reference models, including models with problem-agnostic circuits and classical models. Our results highlight the effectiveness of problem-specific encoding circuits in enhancing QML model performance.
