Optimizing Cross-Domain Transfer for Universal Machine Learning Interatomic Potentials
Jaesun Kim, Jinmu You, Yutack Park, Yunsung Lim, Yujin Kang, Jisu Kim, Haekwan Jeon, Suyeon Ju, Deokgi Hong, Seung Yul Lee, Saerom Choi, Yongdeok Kim, Jae W. Lee, Seungwu Han
TL;DR
SevenNet-Omni addresses the core transferability challenge of universal MLIPs by training across 15 heterogeneous databases with a two-component parameterization: a shared PES represented by $\theta_C$ and task-specific corrections $\theta_T$. The framework employs selective regularization on $\theta_T$ together with a domain-bridging set (DBS) of cross-domain evaluations to align energy surfaces and enhance cross-domain generalization. Across diverse benchmarks—from torsion barriers to adsorption on metal surfaces and MOFs—the method achieves state-of-the-art cross-domain accuracy, including sub-0.1 eV adsorption energies and the ability to reproduce $r^2$SCAN energetics despite limited $r^2$SCAN data. The curriculum-driven training and energy-shift alignment enable effective cross-functional transfer from large PBE datasets to hybrid-functionals, offering a scalable path toward universal, transferable MLIPs that bridge quantum-mechanical fidelity and chemical diversity.
Abstract
Accurate yet transferable machine-learning interatomic potentials (MLIPs) are essential for accelerating materials and chemical discovery. However, most universal MLIPs overfit to narrow datasets or computational protocols, limiting their reliability across chemical and functional domains. We introduce a transferable multi-domain training strategy that jointly optimizes universal and task-specific parameters through selective regularization, coupled with a domain-bridging set (DBS) that aligns potential-energy surfaces across datasets. Systematic ablation experiments show that small DBS fractions (0.1%) and targeted regularization synergistically enhance out-of-distribution generalization while preserving in-domain fidelity. Trained on fifteen open databases spanning molecules, crystals, and surfaces, our model, SevenNet-Omni, achieves state-of-the-art cross-domain accuracy, including adsorption-energy errors below 0.06 eV on metallic surfaces and 0.1 eV on metal-organic frameworks. Despite containing only 0.5% r$^2$SCAN data, SevenNet-Omni reproduces high-fidelity r$^2$SCAN energetics, demonstrating effective cross-functional transfer from large PBE datasets. This framework offers a scalable route toward universal, transferable MLIPs that bridge quantum-mechanical fidelities and chemical domains.
