Fast and Uncertainty-Aware Directional Message Passing for Non-Equilibrium Molecules
Johannes Gasteiger, Shankari Giri, Johannes T. Margraf, Stephan Günnemann
TL;DR
The paper tackles molecular property prediction in non-equilibrium regimes by introducing DimeNet++ to accelerate directional message passing without sacrificing accuracy, and by creating COLL, a large non-equilibrium dataset to stress-test models. It systematically analyzes uncertainty quantification methods, finding ensembles provide better force uncertainty at a cost and that mean-variance estimation struggles for forces. Empirically, DimeNet++ delivers substantial speedups and accuracy gains over DimeNet and outperforms SchNet on COLL, with code and data released for community use. Together, these contributions push ML for molecules beyond equilibrium dynamics toward reliable, uncertainty-aware predictions in reactive systems.
Abstract
Many important tasks in chemistry revolve around molecules during reactions. This requires predictions far from the equilibrium, while most recent work in machine learning for molecules has been focused on equilibrium or near-equilibrium states. In this paper we aim to extend this scope in three ways. First, we propose the DimeNet++ model, which is 8x faster and 10% more accurate than the original DimeNet on the QM9 benchmark of equilibrium molecules. Second, we validate DimeNet++ on highly reactive molecules by developing the challenging COLL dataset, which contains distorted configurations of small molecules during collisions. Finally, we investigate ensembling and mean-variance estimation for uncertainty quantification with the goal of accelerating the exploration of the vast space of non-equilibrium structures. Our DimeNet++ implementation as well as the COLL dataset are available online.
