On quantum backpropagation, information reuse, and cheating measurement collapse
Amira Abbas, Robbie King, Hsin-Yuan Huang, William J. Huggins, Ramis Movassagh, Dar Gilboa, Jarrod R. McClean
TL;DR
This work analyzes whether parameterized quantum models can achieve backpropagation-like training efficiency. It introduces an online shadow tomography framework to reuse information across layers, showing that single-copy access cannot yield backpropagation scaling in general, while multi-copy access with gentle measurements can approach the classical gradient scaling in quantum resources, albeit with potentially exponential classical overhead. A concrete quantum-efficient protocol reduces quantum gradient costs to $O(M\,\mathrm{polylog}(M))$ operations at the expense of storing a hypothesis state, and its connection to shadow tomography demonstrates fundamental limits tied to efficiently learning observables. The paper also proves that fully gentle strategies alone cannot suffice due to Grover-type bounds, and discusses approximate schemes (e.g., tensor networks) as practical avenues. Overall, it clarifies the nuanced landscape of training large quantum models and motivates targeted architectural or approximation-based approaches for scalable quantum learning.
Abstract
The success of modern deep learning hinges on the ability to train neural networks at scale. Through clever reuse of intermediate information, backpropagation facilitates training through gradient computation at a total cost roughly proportional to running the function, rather than incurring an additional factor proportional to the number of parameters - which can now be in the trillions. Naively, one expects that quantum measurement collapse entirely rules out the reuse of quantum information as in backpropagation. But recent developments in shadow tomography, which assumes access to multiple copies of a quantum state, have challenged that notion. Here, we investigate whether parameterized quantum models can train as efficiently as classical neural networks. We show that achieving backpropagation scaling is impossible without access to multiple copies of a state. With this added ability, we introduce an algorithm with foundations in shadow tomography that matches backpropagation scaling in quantum resources while reducing classical auxiliary computational costs to open problems in shadow tomography. These results highlight the nuance of reusing quantum information for practical purposes and clarify the unique difficulties in training large quantum models, which could alter the course of quantum machine learning.
