Density Matrix Emulation of Quantum Recurrent Neural Networks for Multivariate Time Series Prediction
José Daniel Viqueira, Daniel Faílde, Mariamo M. Juane, Andrés Gómez, David Mera
TL;DR
This paper tackles training and simulating Quantum Recurrent Neural Networks (QRNNs) for multivariate time series by introducing a density-matrix emulation framework that handles mid-circuit measurements via an operator-sum representation. It derives exact analytical first- and second-order derivatives with respect to trainable parameters using the Parameter Shift Rule, enabling gradient-based optimization even when outputs are noisy. The authors implement a hardware-efficient QRNN ansatz, test on four diverse datasets, and compare against classical RNNs and a Quantum Reservoir Computing (QRC) baseline, showing competitive predictive performance and feasible training with small qubit counts (around 5 qubits). The work provides a principled, scalable pathway to evaluate QRNNs in the near term, offering explicit tools for emulation, gradient computation, and performance benchmarking relevant to quantum hardware development and time-series prediction tasks.
Abstract
Quantum Recurrent Neural Networks (QRNNs) are robust candidates for modelling and predicting future values in multivariate time series. However, the effective implementation of some QRNN models is limited by the need for mid-circuit measurements. Those increase the requirements for quantum hardware, which in the current NISQ era does not allow reliable computations. Emulation arises as the main near-term alternative to explore the potential of QRNNs, but existing quantum emulators are not dedicated to circuits with multiple intermediate measurements. In this context, we design a specific emulation method that relies on density matrix formalism. Using a compact tensor notation, we provide the mathematical formulation of the operator-sum representation involved. This allows us to show how the present and past information from a time series is transmitted through the circuit, and how to reduce the computational cost in every time step of the emulated network. In addition, we derive the analytical gradient and the Hessian of the network outputs with respect to its trainable parameters, which are needed when the outputs have stochastic noise due to hardware errors and a finite number of circuit shots (sampling). We finally test the presented methods using a hardware-efficient ansatz and four diverse datasets that include univariate and multivariate time series, with and without sampling noise. In addition, we compare the model with other existing quantum and classical approaches. Our results show how QRNNs can be trained with numerical and analytical gradients to make accurate predictions of future values by capturing non-trivial patterns of input series with different complexities.
