Overcoming the Barrier of Orbital-Free Density Functional Theory for Molecular Systems Using Deep Learning
He Zhang, Siyuan Liu, Jiacheng You, Chang Liu, Shuxin Zheng, Ziheng Lu, Tong Wang, Nanning Zheng, Bin Shao
TL;DR
M-OFDFT presents a deep-learning kinetic energy density functional (KEDF) for orbital-free DFT that integrates non-local density interactions through an atomic-basis expansion and a Graphormer-based attention scheme. By training on multiple density states per structure (with energy and gradient labels) and enforcing SE(3) invariance via local frames, the method achieves KSDFT-competitive accuracy for molecular systems while delivering superior extrapolation to much larger molecules, such as QMugs and biomolecules. Empirically, M-OFDFT exhibits an $O(N^{1.46})$ scaling and substantial speedups over KSDFT (up to ~27x on protein-sized systems), enabling practical studies of large-scale molecular systems with improved accuracy over classical OFDFT. The work also introduces a suite of practical techniques—density optimization-on-manifold initializations, enhancement modules for large gradient ranges, and projection-based training data—that collectively stabilize learning and improve transfer to unseen chemical environments, with clear implications for biomolecular and material-scale quantum simulations.
Abstract
Orbital-free density functional theory (OFDFT) is a quantum chemistry formulation that has a lower cost scaling than the prevailing Kohn-Sham DFT, which is increasingly desired for contemporary molecular research. However, its accuracy is limited by the kinetic energy density functional, which is notoriously hard to approximate for non-periodic molecular systems. Here we propose M-OFDFT, an OFDFT approach capable of solving molecular systems using a deep learning functional model. We build the essential non-locality into the model, which is made affordable by the concise density representation as expansion coefficients under an atomic basis. With techniques to address unconventional learning challenges therein, M-OFDFT achieves a comparable accuracy with Kohn-Sham DFT on a wide range of molecules untouched by OFDFT before. More attractively, M-OFDFT extrapolates well to molecules much larger than those seen in training, which unleashes the appealing scaling of OFDFT for studying large molecules including proteins, representing an advancement of the accuracy-efficiency trade-off frontier in quantum chemistry.
