Stability-Aware Training of Machine Learning Force Fields with Differentiable Boltzmann Estimators
Sanjeev Raja, Ishan Amin, Fabian Pedregosa, Aditi S. Krishnapriyan
TL;DR
<3-5 sentence high-level summary>StABlE Training addresses the instability challenges of neural network-based machine learning force fields (MLFFs) in molecular dynamics by coupling traditional quantum-mechanical (QM) supervision with system observables via a Boltzmann-gradient mechanism. The method introduces the Boltzmann Estimator and its localized variant to enable end-to-end differentiable learning without backpropagating through long MD trajectories, and it alternates simulation and learning phases to identify and rectify unstable regions. Demonstrations on aspirin, Ac-Ala3-NHMe, and all-atom water across multiple architectures show substantial gains in stability, data efficiency, and fidelity of observables not explicitly trained, including improved RDFs and diffusivity. The approach is self-contained, broadly applicable, and promises practical impact by enabling stable, long-timescale MD with MLFFs even when large QM datasets are unavailable.
Abstract
Machine learning force fields (MLFFs) are an attractive alternative to ab-initio methods for molecular dynamics (MD) simulations. However, they can produce unstable simulations, limiting their ability to model phenomena occurring over longer timescales and compromising the quality of estimated observables. To address these challenges, we present Stability-Aware Boltzmann Estimator (StABlE) Training, a multi-modal training procedure which leverages joint supervision from reference quantum-mechanical calculations and system observables. StABlE Training iteratively runs many MD simulations in parallel to seek out unstable regions, and corrects the instabilities via supervision with a reference observable. We achieve efficient end-to-end automatic differentiation through MD simulations using our Boltzmann Estimator, a generalization of implicit differentiation techniques to a broader class of stochastic algorithms. Unlike existing techniques based on active learning, our approach requires no additional ab-initio energy and forces calculations to correct instabilities. We demonstrate our methodology across organic molecules, tetrapeptides, and condensed phase systems, using three modern MLFF architectures. StABlE-trained models achieve significant improvements in simulation stability, data efficiency, and agreement with reference observables. The stability improvements cannot be matched by reducing the simulation timestep; thus, StABlE Training effectively allows for larger timesteps. By incorporating observables into the training process alongside first-principles calculations, StABlE Training can be viewed as a general semi-empirical framework applicable across MLFF architectures and systems. This makes it a powerful tool for training stable and accurate MLFFs, particularly in the absence of large reference datasets. Our code is available at https://github.com/ASK-Berkeley/StABlE-Training.
