Are neural scaling laws leading quantum chemistry astray?
Siwoo Lee, Adji Bousso Dieng
TL;DR
This work tests whether neural scaling laws can yield physically faithful quantum-chemical predictions by scaling model capacity and data on large QM datasets and evaluating on the H$_2$ bond-dissociation curve. Despite improvements in predictive error with more data and larger models, the models fail to reproduce the H$_2$ energy curve and, strikingly, cannot learn basic Coulomb’s law, even when explicit inductive biases are present in foundation models. Foundation-model approaches may capture the H$_2$ curve but lack transferability to other diatomics, revealing a fundamental limitation of scaling-based strategies for learning electronic structure. The study argues that scaling alone does not guarantee physical generalization and calls for physics-informed, data-efficient methods to achieve reliable, extrapolatable quantum-chemical predictions. The results highlight a critical boundary for current large-scale ML in quantum chemistry and motivate alternative strategies beyond pure scaling.
Abstract
Neural scaling laws are driving the machine learning community toward training ever-larger foundation models across domains, assuring high accuracy and transferable representations for extrapolative tasks. We test this promise in quantum chemistry by scaling model capacity and training data from quantum chemical calculations. As a generalization task, we evaluate the resulting models' predictions of the bond dissociation energy of neutral H$_2$, the simplest possible molecule. We find that, regardless of dataset size or model capacity, models trained only on stable structures fail dramatically to even qualitatively reproduce the H$_2$ energy curve. Only when compressed and stretched geometries are explicitly included in training do the predictions roughly resemble the correct shape. Nonetheless, the largest foundation models trained on the largest and most diverse datasets containing dissociating diatomics exhibit serious failures on simple diatomic molecules. Most strikingly, they cannot reproduce the trivial repulsive energy curve of two bare protons, revealing their failure to learn the basic Coulomb's law involved in electronic structure theory. These results suggest that scaling alone is insufficient for building reliable quantum chemical models.
