Opportunities and limitations of explaining quantum machine learning
Elies Gil-Fuster, Jonas R. Naujoks, Grégoire Montavon, Thomas Wiegand, Wojciech Samek, Jens Eisert
TL;DR
Quantum machine learning (QML) faces a gap in explainability, motivating the proposed XQML framework. The authors introduce two PQC-specific explainability methods—Taylor-$\infty$ and quantum layerwise relevance propagation (QLRP)—and compare them with existing XAI approaches on synthetic tasks. They provide a structured analysis of PQC explainability, discuss fundamental quantum constraints (e.g., no cloning, exponential Hilbert space), and present initial numerical demonstrations to illustrate pipelines and trade-offs. The work outlines scalability- and encoding-related challenges and sketches future directions toward inherently interpretable PQC architectures and efficient, hardware-aware explanation techniques.
Abstract
A common trait of many machine learning models is that it is often difficult to understand and explain what caused the model to produce the given output. While the explainability of neural networks has been an active field of research in the last years, comparably little is known for quantum machine learning models. Despite a few recent works analyzing some specific aspects of explainability, as of now there is no clear big picture perspective as to what can be expected from quantum learning models in terms of explainability. In this work, we address this issue by identifying promising research avenues in this direction and lining out the expected future results. We additionally propose two explanation methods designed specifically for quantum machine learning models, as first of their kind to the best of our knowledge. Next to our pre-view of the field, we compare both existing and novel methods to explain the predictions of quantum learning models. By studying explainability in quantum machine learning, we can contribute to the sustainable development of the field, preventing trust issues in the future.
