GEARS H: Accurate machine-learned Hamiltonians for next-generation device-scale modeling
Anubhab Haldar, Ali K. Hamze, Nikhil Sivadas, Yongwoo Shin
TL;DR
GEARS H presents a compact, E(3)-equivariant, LCAO-based Hamiltonian learning framework that enables device-scale electronic structure simulations with DFT-like accuracy. By decomposing the Hamiltonian into atom-centered and bond-centered descriptors and employing scale-shift readouts, it achieves sub-meV Hamiltonian-element MAEs across diverse materials, including molecules, 2D materials, alloys, and amorphous solids. The authors demonstrate a large-scale, Ni-doped a-HfO$_2$/WSe$_2$ case study, performing a Bayesian analysis to quantify how Ni doping, density, and Se vacancies influence hole concentration, and show inference times sufficient to explore thousands of structures in hours. The work provides a production-ready pathway for predictive, large-scale device modeling and offers broad applicability across complex chemical environments, supported by public code, datasets, and pretrained models.
Abstract
We introduce GEARS H, a state-of-the-art machine-learning Hamiltonian framework for large-scale electronic structure simulations. Using GEARS H, we present a statistical analysis of the hole concentration induced in defective $\mathrm{WSe}_2$ interfaced with Ni-doped amorphous $\mathrm{HfO}_2$ as a function of the Ni doping rate, system density, and Se vacancy rate in 72 systems ranging from 3326 to 4160 atoms-a quantity and scale of interface electronic structure calculation beyond the reach of conventional density functional theory codes and other machine-learning-based methods. We further demonstrate the versatility of our architecture by training models for a molecular system, 2D materials with and without defects, solid solution crystals, and bulk amorphous systems with covalent and ionic bonds. The mean absolute error of the inferred Hamiltonian matrix elements from the validation set is below 2.4 meV for all of these models. GEARS H outperforms other proposed machine-learning Hamiltonian frameworks, and our results indicate that machine-learning Hamiltonian methods, starting with GEARS H, are now production-ready techniques for DFT-accuracy device-scale simulation.
