Learning Chern Numbers of Topological Insulators with Gauge Equivariant Neural Networks
Longde Huang, Oleksandr Balabanov, Hampus Linander, Mats Granath, Daniel Persson, Jan E. Gerken
TL;DR
This work develops gauge-equivariant networks that respect local $U(N)$ gauge symmetry to predict discretized Chern numbers $\tilde{C}$ in multiband topological insulators. By integrating new gauge-equivariant layers (GEBL, GEAct, GEConv) and a stabilization layer (TrNorm), the approach achieves robust learning where non-equivariant models fail, and it proves a universal approximation theorem for class functions on $G=\mathrm{U}(N)$. The authors demonstrate strong generalization from trivial to nontrivial topology and extend results to higher dimensions (4D Chern numbers), with detailed ablations and data-generation strategies based on Wilson loops. The framework provides a scalable, locality-preserving method to extract topological invariants from discretized band structures, offering a practical tool for exploring topological phases in complex, multi-band systems.
Abstract
Equivariant network architectures are a well-established tool for predicting invariant or equivariant quantities. However, almost all learning problems considered in this context feature a global symmetry, i.e. each point of the underlying space is transformed with the same group element, as opposed to a local ``gauge'' symmetry, where each point is transformed with a different group element, exponentially enlarging the size of the symmetry group. Gauge equivariant networks have so far mainly been applied to problems in quantum chromodynamics. Here, we introduce a novel application domain for gauge-equivariant networks in the theory of topological condensed matter physics. We use gauge equivariant networks to predict topological invariants (Chern numbers) of multiband topological insulators. The gauge symmetry of the network guarantees that the predicted quantity is a topological invariant. We introduce a novel gauge equivariant normalization layer to stabilize the training and prove a universal approximation theorem for our setup. We train on samples with trivial Chern number only but show that our models generalize to samples with non-trivial Chern number. We provide various ablations of our setup. Our code is available at https://github.com/sitronsea/GENet/tree/main.
