Exploiting the equivalence between quantum neural networks and perceptrons
Chris Mingard, Jessica Pointing, Charles London, Yoonsoo Nam, Ard A. Louis
TL;DR
The paper analyzes the expressivity of quantum neural networks (QNNs) by exploiting an exact mapping to a tensor-product perceptron (TPP) that acts on $h = x\otimes x$ (or $x\circ x$ for complex inputs), revealing strong encoding-dependent inductive biases and parity limitations that hinder QNNs as general-purpose learners on classical data. It demonstrates that training QNNs can be circumvented via the TPP mapping, enabling systematic study of function classes on the Boolean dataset and showing that amplitude encoding imposes severe expressivity constraints, while some encodings yield varying biases. The authors propose two pathways to go beyond standard QNNs: using QNNs to compute inner products for classical neural network kernels, and constructing layered non-linear quantum neural networks (DQNNs) with universal approximation capabilities, which exhibit improved inductive bias albeit with computational costs. Overall, the work cautions against claims of quantum advantage on classical data, highlights the potential of quantum approaches for quantum data or hybrid schemes, and underscores the need for careful, transparent benchmarking and scalable architectures.
Abstract
Quantum machine learning models based on parametrized quantum circuits, also called quantum neural networks (QNNs), are considered to be among the most promising candidates for applications on near-term quantum devices. Here we explore the expressivity and inductive bias of QNNs by exploiting an exact mapping from QNNs with inputs $x$ to classical perceptrons acting on $x \otimes x$ (generalised to complex inputs). The simplicity of the perceptron architecture allows us to provide clear examples of the shortcomings of current QNN models, and the many barriers they face to becoming useful general-purpose learning algorithms. For example, a QNN with amplitude encoding cannot express the Boolean parity function for $n\geq 3$, which is but one of an exponential number of data structures that such a QNN is unable to express. Mapping a QNN to a classical perceptron simplifies training, allowing us to systematically study the inductive biases of other, more expressive embeddings on Boolean data. Several popular embeddings primarily produce an inductive bias towards functions with low class balance, reducing their generalisation performance compared to deep neural network architectures which exhibit much richer inductive biases. We explore two alternate strategies that move beyond standard QNNs. In the first, we use a QNN to help generate a classical DNN-inspired kernel. In the second we draw an analogy to the hierarchical structure of deep neural networks and construct a layered non-linear QNN that is provably fully expressive on Boolean data, while also exhibiting a richer inductive bias than simple QNNs. Finally, we discuss characteristics of the QNN literature that may obscure how hard it is to achieve quantum advantage over deep learning algorithms on classical data.
