Why is topology hard to learn?
D. O. Oriekhov, Stan Bergkamp, Guliuxin Jin, Juan Daniel Torres Luna, Badr Zouggari, Sibren van der Meer, Naoual El Yazidi, Eliska Greplova
TL;DR
The paper investigates why topology is hard to learn by constructing a hybrid tensor-neural network that exactly expresses the real-space winding number (RSWN) for SSH-like models (AIII class) and benchmarking it against physics-agnostic networks. It finds that while the exact-topology network can encode the invariant with high precision, it suffers from reduced trainability and sensitivity to initialization, whereas simpler, symmetry-informed or fully generic networks can achieve strong performance but may rely on dataset proxies rather than the invariant itself. Through weight analysis and SVD pruning, the authors show that topology classification often hinges on a compact set of edge-state features rather than the full global invariant, and that disorder erodes simple correlation-based learning. This work clarifies interpretable ML strategies for condensed-matter topology and suggests architectures that better generalize under real-world imperfections.
Abstract
Much attention has been devoted to the use of machine learning to approximate physical concepts. Yet, due to challenges in interpretability of machine learning techniques, the question of what physics machine learning models are able to learn remains open. Here we bridge the concept a physical quantity and its machine learning approximation in the context of the original application of neural networks in physics: topological phase classification. We construct a hybrid tensor-neural network object that exactly expresses real space topological invariant and rigorously assess its trainability and generalization. Specifically, we benchmark the accuracy and trainability of a tensor-neural network to multiple types of neural networks, thus exemplifying the differences in trainability and representational power. Our work highlights the challenges in learning topological invariants and constitutes a stepping stone towards more accurate and better generalizable machine learning representations in condensed matter physics.
