Statistical Complexity of Quantum Learning
Leonardo Banchi, Jason Luke Pereira, Sharu Theresa Jose, Osvaldo Simeone
TL;DR
The paper develops an information-theoretic framework to quantify the statistical complexity of quantum learning, introducing data complexity ($N$), training-copy complexity ($S$), and testing-copy complexity ($V$) alongside model complexity, and analyzes both supervised and unsupervised tasks. By connecting classical statistical learning theory with quantum state discrimination, the authors derive error decompositions into optimality gaps and generalization terms, and compare unconstrained versus constrained operation regimes (e.g., Helstrom discrimination, tomography, kernel methods, and parametric quantum circuits). Key contributions include scaling laws for generalization and knowledge gaps across regimes, principled use of transductive learning for unknown quantum states, and concrete applications to learning phases of matter and entanglement, as well as to classical-shadows and other architectures. The work clarifies how information-theoretic quantities such as trace distance, quantum mutual information, and Rényi entropies govern learnability under realistic resource constraints, offering guidance for designing quantum learning algorithms that balance data efficiency and model expressivity. Overall, the paper provides a unified foundation for assessing quantum learning performance and suggests directions for future research on quantum advantages and generalization guarantees in practical settings.
Abstract
Recent years have seen significant activity on the problem of using data for the purpose of learning properties of quantum systems or of processing classical or quantum data via quantum computing. As in classical learning, quantum learning problems involve settings in which the mechanism generating the data is unknown, and the main goal of a learning algorithm is to ensure satisfactory accuracy levels when only given access to data and, possibly, side information such as expert knowledge. This article reviews the complexity of quantum learning using information-theoretic techniques by focusing on data complexity, copy complexity, and model complexity. Copy complexity arises from the destructive nature of quantum measurements, which irreversibly alter the state to be processed, limiting the information that can be extracted about quantum data. For example, in a quantum system, unlike in classical machine learning, it is generally not possible to evaluate the training loss simultaneously on multiple hypotheses using the same quantum data. To make the paper self-contained and approachable by different research communities, we provide extensive background material on classical results from statistical learning theory, as well as on the distinguishability of quantum states. Throughout, we highlight the differences between quantum and classical learning by addressing both supervised and unsupervised learning, and we provide extensive pointers to the literature.
