The curse of random quantum data
Kaining Zhang, Junyu Liu, Liu Liu, Liang Jiang, Min-Hsiu Hsieh, Dacheng Tao
TL;DR
The paper identifies a fundamental curse of random quantum data: when quantum inputs are uniformly random, the learning performance of quantum kernel methods and wide quantum neural networks degrades exponentially with the Hilbert-space dimension. It develops a rigorous framework based on the quantum neural tangent kernel (QNTK) to quantify training dynamics and generalization, establishing bounds that show limited test-improvement for small-to-moderate training sets drawn from Haar-like distributions. It further shows that the QNTK spectrum collapses for random quantum data, but that carefully designed state distributions with biased Pauli-coefficient variance can restore meaningful spectra and enable efficient convergence. Numerical experiments on quantum dynamics learning and binary classification corroborate the theory, illustrating how data design can dramatically affect convergence speed and generalization, and highlighting a practical pathway to achieving robust quantum learning by engineering quantum datasets and encodings.
Abstract
Quantum machine learning, which involves running machine learning algorithms on quantum devices, may be one of the most significant flagship applications for these devices. Unlike its classical counterparts, the role of data in quantum machine learning has not been fully understood. In this work, we quantify the performances of quantum machine learning in the landscape of quantum data. Provided that the encoding of quantum data is sufficiently random, the performance, we find that the training efficiency and generalization capabilities in quantum machine learning will be exponentially suppressed with the increase in the number of qubits, which we call "the curse of random quantum data". Our findings apply to both the quantum kernel method and the large-width limit of quantum neural networks. Conversely, we highlight that through meticulous design of quantum datasets, it is possible to avoid these curses, thereby achieving efficient convergence and robust generalization. Our conclusions are corroborated by extensive numerical simulations.
