Teaching a Transformer to Think Like a Chemist: Predicting Nanocluster Stability

João Marcos T. Palheta; Octavio Rodrigues Filho; Mohammad Soleymanibrojeni; Alexandre Cavalheiro Dias; Diego Guedes-Sobrinho; Wolfgang Wenzel; Roland Aydin; Celso R. C. Rêgo; Maurício Jeomar Piotrowski

Teaching a Transformer to Think Like a Chemist: Predicting Nanocluster Stability

João Marcos T. Palheta, Octavio Rodrigues Filho, Mohammad Soleymanibrojeni, Alexandre Cavalheiro Dias, Diego Guedes-Sobrinho, Wolfgang Wenzel, Roland Aydin, Celso R. C. Rêgo, Maurício Jeomar Piotrowski

TL;DR

This work tackles the design of bimetallic nanoclusters by combining density functional theory with a physics-guided transformer to predict formation energies and in/out stability in 13-atom ICO X12TM clusters. A FTTransformer is pretrained on a large unary dataset from the Quantum Cluster Database and then fine-tuned on a small bimetallic set, achieving mean absolute errors around $0.67$ eV with calibrated uncertainty, and showing rapid transfer to unseen Fe-host domains. DFT reveals systematic trends in core-shell versus surface motifs linked to $d$-band center, ECN, bond lengths, and HOMO-LUMO gaps, while the transformer captures these physics-informed patterns via attention and SHAP explanations, yielding interpretable design rules. The approach is openly shared under FAIR/TRUE principles, enabling reproducible, interpretable screening of unexplored nanocluster chemistries for catalysis and energy conversion, with an emphasis on transferable knowledge across host–dopant combinations.

Abstract

Atomically precise metal nanoclusters bridge the molecular and bulk regimes, but designing bimetallic motifs with targeted stability and reactivity remains challenging. Here we combine density functional theory (DFT) and physics-grounded predictive artificial intelligence to map the configurational landscape of 13-atom icosahedral nanoclusters X$_{12}$TM, with hosts X = (Ti, Zr, Hf), and Fe and a single transition--metal dopant spanning the 3$d$-5$d$ series. Spin-polarized DFT calculations on 240 bimetallic clusters reveal systematic trends in binding and formation energies, distortion penalties, effective coordination number, d-band centre, and HOMO-LUMO gap that govern the competition between core-shell (in) and surface-segregated (out) arrangements. We then pretrain a transformer architecture on a curated set of 2968 unary clusters from the Quantum Cluster Database and fine-tune it on bimetallic data to predict formation energies and in/out preference, achieving mean absolute errors of about $0.6-0.7$eV and calibrated uncertainty intervals. The resulting model rapidly adapts to an unseen Fe-host domain with only a handful of labelled examples. At the same time, attention patterns and Shapley attributions highlight size mismatch, $d$-electron count, and coordination environment as key descriptors. All data, code, and workflows follow FAIR/TRUE principles, enabling reproducible, interpretable screening of unexplored nanocluster chemistries for catalysis and energy conversion.

Teaching a Transformer to Think Like a Chemist: Predicting Nanocluster Stability

TL;DR

eV with calibrated uncertainty, and showing rapid transfer to unseen Fe-host domains. DFT reveals systematic trends in core-shell versus surface motifs linked to

-band center, ECN, bond lengths, and HOMO-LUMO gaps, while the transformer captures these physics-informed patterns via attention and SHAP explanations, yielding interpretable design rules. The approach is openly shared under FAIR/TRUE principles, enabling reproducible, interpretable screening of unexplored nanocluster chemistries for catalysis and energy conversion, with an emphasis on transferable knowledge across host–dopant combinations.

Abstract

TM, with hosts X = (Ti, Zr, Hf), and Fe and a single transition--metal dopant spanning the 3

-5

series. Spin-polarized DFT calculations on 240 bimetallic clusters reveal systematic trends in binding and formation energies, distortion penalties, effective coordination number, d-band centre, and HOMO-LUMO gap that govern the competition between core-shell (in) and surface-segregated (out) arrangements. We then pretrain a transformer architecture on a curated set of 2968 unary clusters from the Quantum Cluster Database and fine-tune it on bimetallic data to predict formation energies and in/out preference, achieving mean absolute errors of about

eV and calibrated uncertainty intervals. The resulting model rapidly adapts to an unseen Fe-host domain with only a handful of labelled examples. At the same time, attention patterns and Shapley attributions highlight size mismatch,

-electron count, and coordination environment as key descriptors. All data, code, and workflows follow FAIR/TRUE principles, enabling reproducible, interpretable screening of unexplored nanocluster chemistries for catalysis and energy conversion.

Teaching a Transformer to Think Like a Chemist: Predicting Nanocluster Stability

TL;DR

Abstract

Teaching a Transformer to Think Like a Chemist: Predicting Nanocluster Stability

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)