Accurate molecular polarizabilities with coupled-cluster theory and machine learning

David M. Wilkins; Andrea Grisafi; Yang Yang; Ka Un Lao; Robert A. DiStasio; Michele Ceriotti

Accurate molecular polarizabilities with coupled-cluster theory and machine learning

David M. Wilkins, Andrea Grisafi, Yang Yang, Ka Un Lao, Robert A. DiStasio, Michele Ceriotti

TL;DR

The paper addresses the accurate prediction of molecular polarizabilities $\boldsymbol{\alpha}$, a tensor governing induction and dispersion, which is challenging for standard electronic-structure methods. They benchmark LR-CCSD polarizabilities on the QM7b dataset using $d$-aug-cc-pVDZ and compare to DFT, then introduce ALPHA-ML, a symmetry-adapted Gaussian process regression model based on $\lambda$-SOAP descriptors to predict the full tensor $\boldsymbol{\alpha}$ with LR-CCSD-level accuracy at a fraction of the cost. Delta-learning from a DFT baseline and an atom-centered decomposition provide both performance gains and interpretability, with near-CCSD accuracy on validation and successful extrapolation to 52 larger molecules. This approach offers a scalable route to accurate polarizable force fields and spectroscopy-informed predictions in large systems.

Abstract

The molecular polarizability describes the tendency of a molecule to deform or polarize in response to an applied electric field. As such, this quantity governs key intra- and inter-molecular interactions such as induction and dispersion, plays a key role in determining the spectroscopic signatures of molecules, and is an essential ingredient in polarizable force fields and other empirical models for collective interactions. Compared to other ground-state properties, an accurate and reliable prediction of the molecular polarizability is considerably more difficult as this response quantity is quite sensitive to the description of the underlying molecular electronic structure. In this work, we present state-of-the-art quantum mechanical calculations of the static dipole polarizability tensors of 7,211 small organic molecules computed using linear-response coupled-cluster singles and doubles theory (LR-CCSD). Using a symmetry-adapted machine-learning based approach, we demonstrate that it is possible to predict the molecular polarizability with LR-CCSD accuracy at a negligible computational cost. The employed model is quite robust and transferable, yielding molecular polarizabilities for a diverse set of 52 larger molecules (which includes challenging conjugated systems, carbohydrates, small drugs, amino acids, nucleobases, and hydrocarbon isomers) at an accuracy that exceeds that of hybrid density functional theory (DFT). The atom-centered decomposition implicit in our machine-learning approach offers some insight into the shortcomings of DFT in the prediction of this fundamental quantity of interest.

Accurate molecular polarizabilities with coupled-cluster theory and machine learning

TL;DR

Abstract

Accurate molecular polarizabilities with coupled-cluster theory and machine learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)