Arbitrary Polynomial Separations in Trainable Quantum Machine Learning
Eric R. Anschuetz, Xun Gao
TL;DR
The paper tackles the challenge of achieving large quantum advantages in machine learning without sacrificing trainability by proposing k-hypergraph recurrent neural networks (k-HRNNs) that are efficiently trainable and remain expressive. It constructs both continuous-variable (qumode) and qubit versions, embedding the network dynamics into low-dimensional Lie subgroups to avoid quantum trainability barriers, while leveraging hypergraph-stabilizer structures to realize powerful quantum contextuality-based expressivity. The authors formalize a sequence-modeling task, (ell,n,k)-HSMT, and prove that classical networks require memory scaling as $inom{n}{k}-1$ to approximate the task, whereas k-HRNNs can perform it with zero error, yielding an arbitrary polynomial memory separation in $n$ and $k$ (e.g., for $k=n/2$ the separation is exponential). They discuss the implications for inference-time advantages, potential hardware platforms, and future directions for exploiting semantic ambiguity and contextuality in real-world data, suggesting a natural setting where quantum learning can outperform classical counterparts during deployment.
Abstract
Recent theoretical results in quantum machine learning have demonstrated a general trade-off between the expressive power of quantum neural networks (QNNs) and their trainability; as a corollary of these results, practical exponential separations in expressive power over classical machine learning models are believed to be infeasible as such QNNs take a time to train that is exponential in the model size. We here circumvent these negative results by constructing a hierarchy of efficiently trainable QNNs that exhibit unconditionally provable, polynomial memory separations of arbitrary constant degree over classical neural networks -- including state-of-the-art models, such as Transformers -- in performing a classical sequence modeling task. This construction is also computationally efficient, as each unit cell of the introduced class of QNNs only has constant gate complexity. We show that contextuality -- informally, a quantitative notion of semantic ambiguity -- is the source of the expressivity separation, suggesting that other learning tasks with this property may be a natural setting for the use of quantum learning algorithms.
