Towards structure-preserving quantum encodings
Arthur J. Parzygnat, Tai-Danae Bradley, Andrew Vlasic, Anh Pham
TL;DR
The paper advocates using category theory to organize and design structure-preserving quantum encodings that map classical data $\mathcal{X}$ into quantum states $\mathcal{S}(\mathcal{H})$, via encodings $\rho: \mathcal{X}\to\mathcal{S}(\mathcal{H})$ or unitary encodings $U: \mathcal{X}\to\mathcal{U}(\mathcal{H})$. By pairing data with mathematical structures (e.g., symmetry, topology, metric) and viewing encodings as morphisms in categories, the authors show how forgetful functors lift problems to structure-rich contexts, thereby reducing the search space for good encodings. They illustrate this framework through concrete structure types: symmetry via $G$-equivariant encodings, continuity and smoothness via topologies, and distance notions via topological data analysis and metric learning, including exact distance-preserving results and semi-metric considerations when injectivity fails. The approach provides a rigorous, unified language to pose design questions, benchmarks, and open problems, with potential to guide principled encoding choices and clarify when quantum advantages survive through the encoding layer.
Abstract
Harnessing the potential computational advantage of quantum computers for machine learning tasks relies on the uploading of classical data onto quantum computers through what are commonly referred to as quantum encodings. The choice of such encodings may vary substantially from one task to another, and there exist only a few cases where structure has provided insight into their design and implementation, such as symmetry in geometric quantum learning. Here, we propose the perspective that category theory offers a natural mathematical framework for analyzing encodings that respect structure inherent in datasets and learning tasks. We illustrate this with pedagogical examples, which include geometric quantum machine learning, quantum metric learning, topological data analysis, and more. Moreover, our perspective provides a language in which to ask meaningful and mathematically precise questions for the design of quantum encodings and circuits for quantum machine learning tasks.
