Cross-functional transferability in universal machine learning interatomic potentials
Xu Huang, Bowen Deng, Peichen Zhong, Aaron D. Kaplan, Kristin A. Persson, Gerbrand Ceder
TL;DR
The paper tackles cross-functional transferability of universal MLIPs across DFT functionals (GGA/GGA+U and r^2SCAN) and identifies large energy-scale shifts as a key obstacle. It introduces a CHGNet-based transfer-learning approach that uses energy referencing via atomic reference energies (AtomRef) to align labels between functionals. The study finds that refitting AtomRef to the target functional dramatically improves inter-functional correlation and model accuracy, enabling data-efficient transfer even with sub-million high-fidelity data, and shows TL outperforms training from scratch across energy, forces, and stability predictions. It also highlights the need for multi-fidelity benchmarking when advancing uMLIPs to higher-accuracy functionals, outlining practical strategies for integrating diverse datasets to build next-generation interatomic potentials.
Abstract
The rapid development of universal machine learning interatomic potentials (uMLIPs) has demonstrated the possibility for generalizable learning of the universal potential energy surface. In principle, the accuracy of uMLIPs can be further improved by bridging the model from lower-fidelity datasets to high-fidelity ones. In this work, we analyze the challenge of this transfer learning problem within the CHGNet framework. We show that significant energy scale shifts and poor correlations between GGA and r$^2$SCAN pose challenges to cross-functional data transferability in uMLIPs. By benchmarking different transfer learning approaches on the MP-r$^2$SCAN dataset of 0.24 million structures, we demonstrate the importance of elemental energy referencing in the transfer learning of uMLIPs. By comparing the scaling law with and without the pre-training on a low-fidelity dataset, we show that significant data efficiency can still be achieved through transfer learning, even with a target dataset of sub-million structures. We highlight the importance of proper transfer learning and multi-fidelity learning in creating next-generation uMLIPs on high-fidelity data.
