Machine learning intermolecular transfer integrals with compact atomic cluster representations
Keerati Keeratikarn, Christoph Ortner, Jarvist Moore Frost
TL;DR
This work develops a symmetry-aware machine learning surrogate for intermolecular transfer integrals $J_{ab}$ in organic semiconductors by extending the Atomic Cluster Expansion (ACE) to model transfer with a linear, locality-based representation. It introduces two molecular representations—heavy-atom and Bead coarse-grained—within ACE and demonstrates data-efficient performance on ethylene, thiophene, and naphthalene dimers, achieving sub-μeV to meV accuracy with modest training sets. The results show heavy-atom representations outperform Bead for small to medium conjugated dimers, with naphthalene achieving robust predictions under both schemes, underscoring ACE’s potential for high-throughput charge-transfer simulations. The approach provides a principled, symmetry-respecting framework for rapid prediction of electronic couplings, enabling scalable modeling of organic electronic materials and guiding material design.
Abstract
Calculating intermolecular charge transfer integrals in organic semiconductors requires substantial computer resource for each individual calculation. We might alternatively construct a machine learning model for transfer integrals, which model the full six-degrees of freedom for the relative position of dimer pairs, trained on representative calculations for the molecules of interest. Recent developments have produced effective machine learning force fields, which model the total energy of atomic assemblies. We extend the Atomic Cluster Expansion (ACE) with the correct symmetries for transfer (kinetic-energy) integrals. Combined with a spherical harmonic basis makes, this forms a strong inductive bias and makes for a data efficient model. We introduce coarse-grained and heavy-atom representations, and assess the methodology on representative conjugated semiconductors: ethylene, thiophene, and naphthalene.
