Cross Learning between Electronic Structure Theories for Unifying Molecular, Surface, and Inorganic Crystal Foundation Force Fields
Ilyes Batatia, Chen Lin, Joseph Hart, Elliott Kasoar, Alin M. Elena, Sam Walton Norwood, Thomas Wolf, Gábor Csányi
TL;DR
This work tackles the challenge of unifying interatomic potentials across molecular, surface, and inorganic crystal chemistry by developing a foundation MLIP with cross-domain learning. It introduces a strengthened MACE backbone with non-linear tensor decomposition and a multi-head replay fine-tuning protocol that transfers knowledge across domains while mitigating forgetting. Through extensive benchmarks spanning materials, molecular crystals, surfaces, and molecules, the cross-domain model (mace-mh-1-omat) achieves state-of-the-art or competitive performance across domains, often surpassing specialised baselines and demonstrating robust cross-learning. The results suggest a viable path toward a single, transferable MLIP capable of accurately simulating multi-scale chemical phenomena with broad practical impact in catalysis, materials design, and biomolecular modeling.
Abstract
Creating a single unified interatomic potential capable of attaining ab initio accuracy across all chemistry remains a long-standing challenge in computational chemistry and materials science. This work introduces a training protocol for foundation machine-learning interatomic potentials (MLIPs) that bridge molecular, surface, and materials chemistry through cross-domain learning. First, we introduce enhancements to the MACE architecture that improve its performance on chemically diverse databases by increasing weight sharing across chemical elements and introducing non-linear factors into the tensor decomposition of the product basis. Second, we develop a multi-head replay post-training methodology that enables efficient knowledge transfer across diverse chemical domains. By fine-tuning on datasets at different levels of electronic structure theory, including inorganic crystals, molecular systems, surface chemistry, and reactive organic chemistry, we demonstrate that a single unified model achieves state-of-the-art performance across several chemical domains. Comprehensive benchmarking reveals superior cross-domain transferability compared with existing specialised and multi-task models, with notable improvements in molecular and surface properties while maintaining state-of-the-art performance in materials-property prediction.
