Enhancing variational quantum state diagonalization using reinforcement learning techniques
Akash Kundu, Przemysław Bedełek, Mateusz Ostaszewski, Onur Danaci, Yash J. Patel, Vedran Dunjko, Jarosław A. Miszczak
TL;DR
This work addresses diagonalizing quantum states on NISQ hardware by augmenting Variational Quantum State Diagonalization (VQSD) with Reinforcement Learning (RL) to automate compact ansatz construction. It introduces a binary, depth-based RL-state encoding and a dense reward within a Double Deep Q-Network framework, achieving significantly shallower and fewer-gate circuits while preserving or improving eigenvalue accuracy. Across 2-, 3-, and 4-qubit test cases, RL-VQSD outperforms fixed-structure Layered Hardware Efficient Ansatz (LHEA) and exhibits strong scaling advantages, aided by an encoding and reward design that tightly couples the RL agent to the VQSD objective. The approach is readily adaptable to other variational quantum algorithms and demonstrates the potential of RL-based architecture search for quantum data processing on near-term devices.
Abstract
The variational quantum algorithms are crucial for the application of NISQ computers. Such algorithms require short quantum circuits, which are more amenable to implementation on near-term hardware, and many such methods have been developed. One of particular interest is the so-called variational quantum state diagonalization method, which constitutes an important algorithmic subroutine and can be used directly to work with data encoded in quantum states. In particular, it can be applied to discern the features of quantum states, such as entanglement properties of a system, or in quantum machine learning algorithms. In this work, we tackle the problem of designing a very shallow quantum circuit, required in the quantum state diagonalization task, by utilizing reinforcement learning (RL). We use a novel encoding method for the RL-state, a dense reward function, and an $ε$-greedy policy to achieve this. We demonstrate that the circuits proposed by the reinforcement learning methods are shallower than the standard variational quantum state diagonalization algorithm and thus can be used in situations where hardware capabilities limit the depth of quantum circuits. The methods we propose in the paper can be readily adapted to address a wide range of variational quantum algorithms.
