AdsorbML: A Leap in Efficiency for Adsorption Energy Calculations using Generalizable Machine Learning Potentials
Janice Lan, Aini Palizhati, Muhammed Shuaibi, Brandon M. Wood, Brook Wander, Abhishek Das, Matt Uyttendaele, C. Lawrence Zitnick, Zachary W. Ulissi
TL;DR
AdsorbML tackles the computational bottleneck of locating global minimum adsorption energies by combining generalizable ML potentials with an AdsorbML search strategy that ranks and refines multiple initial configurations. The Open Catalyst OC20-Dense dataset is introduced as a standardized benchmark, enabling rigorous evaluation across diverse adsorbates and surfaces. The results demonstrate substantial speedups (up to ~2,300×) with competitive success rates (~87%), revealing a practical path for high-throughput catalyst screening and highlighting opportunities for further optimization and generalization. Collectively, the work provides a scalable framework for accurate, efficient adsorption-energy estimation and points to future directions in global optimization and broader chemical-space applicability.
Abstract
Computational catalysis is playing an increasingly significant role in the design of catalysts across a wide range of applications. A common task for many computational methods is the need to accurately compute the adsorption energy for an adsorbate and a catalyst surface of interest. Traditionally, the identification of low energy adsorbate-surface configurations relies on heuristic methods and researcher intuition. As the desire to perform high-throughput screening increases, it becomes challenging to use heuristics and intuition alone. In this paper, we demonstrate machine learning potentials can be leveraged to identify low energy adsorbate-surface configurations more accurately and efficiently. Our algorithm provides a spectrum of trade-offs between accuracy and efficiency, with one balanced option finding the lowest energy configuration 87.36% of the time, while achieving a 2000x speedup in computation. To standardize benchmarking, we introduce the Open Catalyst Dense dataset containing nearly 1,000 diverse surfaces and 100,000 unique configurations.
