Sample-based training of quantum generative models
Maria Demidik, Cenk Tüysüz, Michele Grossi, Karl Jansen
TL;DR
The paper tackles the challenge of efficiently training quantum generative models, specifically quantum and semi-quantum Boltzmann machines, where gradient evaluation is costly. It develops a generalized contrastive-divergence (CD) framework that leverages conditional sampling circuits to realize CD-k with a constant forward-pass cost, enabling training directly on quantum hardware. A key theoretical result provides closed-form conditional probabilities $p(h^P\mid v)$ and $p(v\mid h^P)$ and corresponding quantum circuits, while only $p(v\mid h^P)$ requires quantum resources; classical sampling handles $p(h^P\mid v)$. Numerical experiments on Bars-and-Stripes and Gaussian data show that generalized CD achieves comparable accuracy to likelihood-based training with substantially fewer samples, and sqRBMs can be trained with fewer hidden units than RBMs. Overall, the framework offers a scalable, sample-efficient route to training expressive quantum generative models on quantum hardware, advancing practical quantum machine learning capabilities.
Abstract
Quantum computers can efficiently sample from probability distributions that are believed to be classically intractable, providing a foundation for quantum generative modeling. However, practical training of such models remains challenging, as gradient evaluation via the parameter-shift rule scales linearly with the number of parameters and requires repeated expectation-value estimation under finite-shot noise. We introduce a training framework that extends the principle of contrastive divergence to quantum models. By deriving the circuit structure and providing a general recipe for constructing it, we obtain quantum circuits that generate the samples required for parameter updates, yielding constant scaling with respect to the cost of a forward pass, analogous to backpropagation in classical neural networks. Numerical results demonstrate that it attains comparable accuracy to likelihood-based optimization while requiring substantially fewer samples. The framework thereby establishes a scalable route to training expressive quantum generative models directly on quantum hardware.
