Enhancing the Scalability and Applicability of Kohn-Sham Hamiltonians for Molecular Systems
Yunyang Li, Zaishuo Xia, Lin Huang, Xinran Wei, Han Yang, Sam Harshe, Zun Wang, Chang Liu, Jia Zhang, Bin Shao, Mark B. Gerstein
TL;DR
This work tackles the scalability gap in predictive Hamiltonians for DFT by introducing Wavefunction Alignment Loss (WALoss) which aligns predicted and true eigenspaces, and WANet, a scalable SE(3)-aware architecture leveraging eSCN, a two-part Hamiltonian Head, and a mixture of long-short-range experts. The authors release PubChemQH, a large dataset with 40–100-atom molecules to study scaling and the SAD phenomenon, showing that elementwise Hamiltonian MAE alone poorly predicts ground-state energies for large systems. WALoss, combined with WANet, achieves unprecedented improvements in energy predictions and accelerates SCF convergence, while enabling accurate predictions of additional properties such as dipole moments and electronic extents. The work demonstrates substantial practical impact for large-scale quantum chemistry and materials science tasks, though it notes high computational cost for data generation and suggests directions for further efficiency and generalization enhancements.
Abstract
Density Functional Theory (DFT) is a pivotal method within quantum chemistry and materials science, with its core involving the construction and solution of the Kohn-Sham Hamiltonian. Despite its importance, the application of DFT is frequently limited by the substantial computational resources required to construct the Kohn-Sham Hamiltonian. In response to these limitations, current research has employed deep-learning models to efficiently predict molecular and solid Hamiltonians, with roto-translational symmetries encoded in their neural networks. However, the scalability of prior models may be problematic when applied to large molecules, resulting in non-physical predictions of ground-state properties. In this study, we generate a substantially larger training set (PubChemQH) than used previously and use it to create a scalable model for DFT calculations with physical accuracy. For our model, we introduce a loss function derived from physical principles, which we call Wavefunction Alignment Loss (WALoss). WALoss involves performing a basis change on the predicted Hamiltonian to align it with the observed one; thus, the resulting differences can serve as a surrogate for orbital energy differences, allowing models to make better predictions for molecular orbitals and total energies than previously possible. WALoss also substantially accelerates self-consistent-field (SCF) DFT calculations. Here, we show it achieves a reduction in total energy prediction error by a factor of 1347 and an SCF calculation speed-up by a factor of 18%. These substantial improvements set new benchmarks for achieving accurate and applicable predictions in larger molecular systems.
