Learning with Density Matrices and Random Features
Fabio A. González, Alejandro Gallego, Santiago Toledo-Cortés, Vladimir Vargas-Calderón
TL;DR
This work introduces a quantum-inspired framework that uses density matrices as building blocks for machine learning tasks. By coupling density matrices with random Fourier features, the authors show how arbitrary distributions over $\mathbb{R}^n$ can be represented and learned in a differentiable, GPU-friendly manner, enabling density estimation, classification, and regression. The paper develops four methods (DMKDE, DMKDC, QMC, QMR) with differentiable training strategies, including optimization-based (SGD) approaches and optimization-free estimators, and validates them across density estimation, classification, and ordinal regression benchmarks. Empirical results demonstrate competitive performance against standard KDE and SVM baselines, while offering favorable scalability and integration with deep architectures. The work opens avenues for future exploration of complex-valued density matrices and entanglement-inspired representations in quantum-inspired classical learning frameworks.
Abstract
A density matrix describes the statistical state of a quantum system. It is a powerful formalism to represent both the quantum and classical uncertainty of quantum systems and to express different statistical operations such as measurement, system combination and expectations as linear algebra operations. This paper explores how density matrices can be used as a building block for machine learning models exploiting their ability to straightforwardly combine linear algebra and probability. One of the main results of the paper is to show that density matrices coupled with random Fourier features could approximate arbitrary probability distributions over $\mathbb{R}^n$. Based on this finding the paper builds different models for density estimation, classification and regression. These models are differentiable, so it is possible to integrate them with other differentiable components, such as deep learning architectures and to learn their parameters using gradient-based optimization. In addition, the paper presents optimization-less training strategies based on estimation and model averaging. The models are evaluated in benchmark tasks and the results are reported and discussed.
