Graph Transformer Networks for Accurate Band Structure Prediction: An End-to-End Approach
Weiyi Gong, Tao Sun, Hexin Bai, Jeng-Yuan Tsai, Haibin Ling, Qimin Yan
TL;DR
This work addresses the challenge of predicting electronic band structures directly from crystal structures, an end-to-end task not previously solved at scale. It introduces Bandformer, a graph transformer–based framework that encodes crystal structures and translates them into band energies along a continuous high-symmetry k-path, using a graph2seq decoder and an $rFFT$–based frequency representation to efficiently capture band oscillations. Trained on $N_b=6$ near-Fermi bands and a fixed $N_k=128$ path length from $27{,}772$ Materials Project structures, Bandformer achieves a band-energy MAE of $0.304$ eV and derives derived properties such as band gaps with MAEs around $0.205$ eV, demonstrating strong predictive performance and generalizability. This end-to-end approach enables fast electronic-structure predictions for high-throughput materials screening and lays the groundwork for scalable transformer models in band-structure discovery and inverse design.
Abstract
Predicting electronic band structures from crystal structures is crucial for understanding structure-property correlations in materials science. First-principles approaches are accurate but computationally intensive. Recent years, machine learning (ML) has been extensively applied to this field, while existing ML models predominantly focus on band gap predictions or indirect band structure estimation via solving predicted Hamiltonians. An end-to-end model to predict band structure accurately and efficiently is still lacking. Here, we introduce a graph Transformer-based end-to-end approach that directly predicts band structures from crystal structures with high accuracy. Our method leverages the continuity of the k-path and treat continuous bands as a sequence. We demonstrate that our model not only provides accurate band structure predictions but also can derive other properties (such as band gap, band center, and band dispersion) with high accuracy. We verify the model performance on large and diverse datasets.
