Minimalist machine-learned interatomic potentials can predict complex structural behaviors accurately
Iñigo Robredo-Magro, Binayak Mukherjee, Hugo Aramberri, Jorge Íñiguez-González
TL;DR
This work questions the prevailing view that reliable interatomic potentials require large, meticulously curated datasets. It tests minimalist MLIPs—kernel GAP-SOAP and E(3)-equivariant Allegro nets—trained with tiny, on-the-fly datasets derived from known phases, across BaTiO$_3$, BiFeO$_3$, PbZrO$_3$, and HfO$_2$. The models interpolate well to DFT data and, crucially, extrapolate to predict non-trivial phenomena such as vortex-antivortex textures, polarization switching paths, and near-degenerate low-energy polymorphs, often with quantitative accuracy. These findings suggest that simple, low-cost potentials can serve as effective tools for discovering and predicting emergent behavior in complex ferroelectrics, potentially broadening the practical scope of MLIPs and guiding future explorations in domain-generalization and interpretability.
Abstract
The past decade has witnessed a spectacular development of machine-learned interatomic potentials (MLIPs), to the extent that they are already the approach of choice for most atomistic simulation studies not requiring an explicit treatment of electrons. Typical MLIP usage guidelines emphasize the need for exhaustive training sets and warn against applying the models to situations not considered in the corresponding training space. This restricts the scope of MLIPs to interpolative calculations, essentially denying the possibility of using them to discover new phenomena in a serendipitous way. While there are reasons to be cautious, here we adopt a more sanguine view and challenge the predictive power of two representative and widely available MLIP approaches. We work with minimalist training sets that rely on little prior knowledge of the investigated materials. We show that the resulting models -- for which we adopt modest/default choices of the defining hyperparameters -- are very successful in predicting non-trivial structural effects (competing polymorphs, energy barriers for structural transformations, occurrence of non-trivial topologies) in a way that is qualitatively and quasi-quantitatively correct. Our results thus suggest an expanded scope of modern MLIP approaches, evidencing that somewhat trivial -- and easy to compute -- models can be an effective tool for the discovery of novel and complex physical phenomena.
