Training-efficient density quantum machine learning
Brian Coyle, Snehal Raj, Natansh Mathur, El Amine Cherrat, Nishant Jain, Skander Kazdaghli, Iordanis Kerenidis
TL;DR
This work introduces density quantum neural networks (density QNNs), a framework that forms a density state $\rho(\boldsymbol{\theta},\boldsymbol{\alpha},\boldsymbol{x}) = \sum_{k=1}^K \alpha_k U_k(\boldsymbol{\theta}_k)\rho(\boldsymbol{x})U_k^{\dagger}(\boldsymbol{\theta}_k)$, enabling expressive yet trainable quantum models on depth-limited hardware. By incorporating data-dependent coefficients and two preparation modes (deterministic and randomized), the authors connect density QNNs to LCU and the Hastings-Campbell Mixing lemma, showing that randomised density QNNs can retain LCU benefits with shallower circuits. The paper formally relates density QNNs to classical mechanisms (dropout, mixture-of-experts) and to other QML frameworks (kernel methods, data reuploading, post-variational models), and provides theoretical gradient-extraction bounds and efficient training strategies via commuting-block circuits. Numerical experiments across equivariant XX/YY, orthogonal HW-preserving, and data reuploading density QNNs demonstrate improved trainability, robustness to overfitting, and practical potential for scalable quantum learning on near-term devices.
Abstract
Quantum machine learning (QML) requires powerful, flexible and efficiently trainable models to be successful in solving challenging problems. We introduce density quantum neural networks, a model family that prepares mixtures of trainable unitaries, with a distributional constraint over coefficients. This framework balances expressivity and efficient trainability, especially on quantum hardware. For expressivity, the Hastings-Campbell Mixing lemma converts benefits from linear combination of unitaries into density models with similar performance guarantees but shallower circuits. For trainability, commuting-generator circuits enable density model construction with efficiently extractable gradients. The framework connects to various facets of QML including post-variational and measurement-based learning. In classical settings, density models naturally integrate the mixture of experts formalism, and offer natural overfitting mitigation. The framework is versatile - we uplift several quantum models into density versions to improve model performance, or trainability, or both. These include Hamming weight-preserving and equivariant models, among others. Extensive numerical experiments validate our findings.
