Quantum Convolutional Neural Networks are (Effectively) Classically Simulable
Pablo Bermejo, Paolo Braccia, Manuel S. Rudolph, Zoë Holmes, Lukasz Cincio, M. Cerezo
TL;DR
The paper argues that Quantum Convolutional Neural Networks (QCNNs) derive their apparent power from operating in a low-bodyness subspace and from being tested on locally-easy datasets, making their action classically simulable when augmented with Pauli shadows. It provides both a theoretical framework and extensive numerical demonstrations showing that random QCNN initializations and the use of shadow tomography enable efficient classical simulations across quantum- and classical-data tasks, including large-scale quantum datasets up to 1024 qubits. The results suggest that current QCNN benchmarks do not imply true quantum advantage and that non-trivial, hard datasets are essential to justify QCNNs, while offering a dequantization perspective and practical routes for quantum-measurement-enhanced learning. The authors also discuss implications for extending these ideas to other quantum neural architectures and advocate re-evaluating benchmarking tasks to avoid trivial datasets.
Abstract
Quantum Convolutional Neural Networks (QCNNs) are widely regarded as a promising model for Quantum Machine Learning (QML). In this work we tie their heuristic success to two facts. First, that when randomly initialized, they can only operate on the information encoded in low-bodyness measurements of their input states. And second, that they are commonly benchmarked on "locally-easy'' datasets whose states are precisely classifiable by the information encoded in these low-bodyness observables subspace. We further show that the QCNN's action on this subspace can be efficiently classically simulated by a classical algorithm equipped with Pauli shadows on the dataset. Indeed, we present a shadow-based simulation of QCNNs on up-to $1024$ qubits for phases of matter classification. Our results can then be understood as highlighting a deeper symptom of QML: Models could only be showing heuristic success because they are benchmarked on simple problems, for which their action can be classically simulated. This insight points to the fact that non-trivial datasets are a truly necessary ingredient for moving forward with QML. To finish, we discuss how our results can be extrapolated to classically simulate other architectures.
