Long-range electrostatics for machine learning interatomic potentials is easier than we thought
Dongjin Kim, Bingqing Cheng
TL;DR
The paper tackles the difficulty of incorporating long-range electrostatics into machine-learning interatomic potentials (MLIPs). It introduces Latent Ewald Summation (LES), which adds a physically grounded Coulomb energy term using environment-dependent latent charges learned from standard energy/force data rather than DFT charges. The authors establish two design principles and show that LES improves energy/force accuracy across architectures, yields interpretable charges, and enables prediction of electrical response properties like dipoles and Born effective charges with low data requirements and modest overhead. LES is shown to be universal across short-range MLIPs and extensible with dipole/BEC fine-tuning, Qeq, and other extensions, offering a practical path to accurate, scalable simulations of interfaces, polar/ionic materials, and biomolecules. The work positions long-range electrostatics as simpler and more broadly applicable in MLIPs than previously believed, while outlining limitations and future directions such as interfacial systems and foundation models.
Abstract
The lack of long-range electrostatics is a key limitation of modern machine learning interatomic potentials (MLIPs), hindering reliable applications to interfaces, charge-transfer reactions, polar and ionic materials, and biomolecules. In this Perspective, we distill two design principles behind the Latent Ewald Summation (LES) framework, which can capture long-range interactions, charges, and electrical response just by learning from standard energy and force training data: (i) use a Coulomb functional form with environment-dependent charges to capture electrostatic interactions, and (ii) avoid explicit training on ambiguous density functional theory (DFT) partial charges. When both principles are satisfied, substantial flexibility remains: essentially any short-range MLIP can be augmented; charge equilibration schemes can be added when desired; dipoles and Born effective charges can be inferred or finetuned; and charge/spin-state embeddings or tensorial targets can be further incorporated. We also discuss current limitations and open challenges. Together, these minimal, physics-guided design rules suggest that incorporating long-range electrostatics into MLIPs is simpler and perhaps more broadly applicable than is commonly assumed.
