Train on classical, deploy on quantum: scaling generative quantum machine learning to a thousand qubits
Erik Recio-Armengol, Shahnawaz Ahmed, Joseph Bowles
TL;DR
This work tackles the fundamental scalability bottleneck in variational quantum machine learning by proposing a scalable training paradigm for parameterised IQP circuits. Training is performed classically by rewriting the loss as a mixture of Pauli-$Z$ expectation values and optimising the squared maximum mean discrepancy $\text{MMD}^2(p, q_{\boldsymbol{\theta}})$ using automatic differentiation, enabling circuit sizes up to thousands of qubits. Crucially, sampling from the trained model remains a quantum bottleneck, offering potential quantum advantage when deployed on hardware. The authors show through extensive experiments on six binary datasets that the IQP-based models can learn high-dimensional distributions competitively with, and in some cases outperform, classical baselines, while attributing gains to the role of coherence and carefully designed initialisation and symmetry strategies. Overall, the paper demonstrates a viable path to scalable quantum generative learning that can be explored today at large scales, with practical implications for quantum data and beyond.
Abstract
We propose an approach to generative quantum machine learning that overcomes the fundamental scaling issues of variational quantum circuits. The core idea is to use a class of generative models based on instantaneous quantum polynomial circuits, which we show can be trained efficiently on classical hardware. Although training is classically efficient, sampling from these circuits is widely believed to be classically hard, and so computational advantages are possible when sampling from the trained model on quantum hardware. By combining our approach with a data-dependent parameter initialisation strategy, we do not encounter issues of barren plateaus and successfully circumvent the poor scaling of gradient estimation that plagues traditional approaches to quantum circuit optimisation. We investigate and evaluate our approach on a number of real and synthetic datasets, training models with up to one thousand qubits and hundreds of thousands of parameters. We find that the quantum models can successfully learn from high dimensional data, and perform surprisingly well compared to simple energy-based classical generative models trained with a similar amount of hyperparameter optimisation. Overall, our work demonstrates that a path to scalable quantum generative machine learning exists and can be investigated today at large scales.
