E(3)-Equivariant Graph Neural Networks for Data-Efficient and Accurate Interatomic Potentials
Simon Batzner, Albert Musaelian, Lixin Sun, Mario Geiger, Jonathan P. Mailoa, Mordechai Kornbluth, Nicola Molinari, Tess E. Smidt, Boris Kozinsky
TL;DR
NequIP introduces an E(3)-equivariant neural interatomic potential that learns from ab-initio data to enable accurate, energy-conserving molecular dynamics with exceptional data efficiency. By representing atomic features as geometric tensors and using filters built from radial functions and spherical harmonics, NequIP captures rotationally equivariant interactions up to a configurable rank $l_{ ext{max}}$, surpassing prior invariant and equivariant approaches in both accuracy and data efficiency. Across MD-17, CCSD/CCSD(T)-level training, water/ice, surface reactions, amorphous glass formation, and superionic conduction, NequIP achieves state-of-the-art performance with orders-of-magnitude less training data in many cases, and faithfully reproduces structural and kinetic properties. The work demonstrates the practical impact of enforcing $E(3)$-equivariance in learned interatomic potentials, enabling high-fidelity simulations at high levels of theory and broad applicability to chemistry, materials science, and condensed matter physics.
Abstract
This work presents Neural Equivariant Interatomic Potentials (NequIP), an E(3)-equivariant neural network approach for learning interatomic potentials from ab-initio calculations for molecular dynamics simulations. While most contemporary symmetry-aware models use invariant convolutions and only act on scalars, NequIP employs E(3)-equivariant convolutions for interactions of geometric tensors, resulting in a more information-rich and faithful representation of atomic environments. The method achieves state-of-the-art accuracy on a challenging and diverse set of molecules and materials while exhibiting remarkable data efficiency. NequIP outperforms existing models with up to three orders of magnitude fewer training data, challenging the widely held belief that deep neural networks require massive training sets. The high data efficiency of the method allows for the construction of accurate potentials using high-order quantum chemical level of theory as reference and enables high-fidelity molecular dynamics simulations over long time scales.
