Minimising the Demand for High-Fidelity Training Data towards Chemically Accurate Adsorption Energy Predictions
Zhihao Zhang, Xiao-Ming Cao
TL;DR
DOTA is presented, a functional-independent deep learning model established on the map between local density of states (LDOS) and adsorption energy that provides a robust framework for efficient catalyst and electrode screening, bridging the gap between computational and experimental data.
Abstract
Adsorption energy is a critical descriptor for high-throughput screening of heterogeneous catalysts and electrode materials. However, precise experimental data are scarce due to the complexity of experiments, while high-fidelity density functional theory (DFT) calculations remain computationally expensive for large-scale material screening. Machine learning models trained on DFT data have emerged as a promising alternative but face challenges such as functional dependency and limited high-fidelity labelled data. Herein, we present DOS Transformer for Adsorption (DOTA), a functional-independent deep learning model established on the map between local density of states (LDOS) and adsorption energy. DOTA integrates multi-head self-attention mechanisms with LDOS feature engineering to capture latent orbital interaction patterns, enabling it to unify multi-fidelity and multi-source data. This minimises the demand for high-fidelity training data. Consequently, the predictive adsorption energy could reach chemical accuracy, requiring less than five high-fidelity experimental adsorption energies for model training. DOTA also resolves long-standing challenges, such as the "CO puzzle", and outperforms traditional theories, including the d-band centre and Fermi softness models. It provides a robust framework for efficient catalyst and electrode screening, bridging the gap between computational and experimental data.
