Transferable Boltzmann Generators
Leon Klein, Frank Noé
TL;DR
The paper tackles the enduring challenge of sampling equilibrium molecular ensembles by introducing transferable Boltzmann Generators (TBGs) built on continuous normalizing flows and flow matching. These models learn a transfer strategy across chemical space, enabling zero-shot Boltzmann sampling and efficient reweighting for unseen dipeptides, with architecture that encodes topology through equivariant graph networks. Empirical results on alanine dipeptide and 2AA dipeptides show superior effective sample sizes, accurate free-energy projections, and reliable metastable-state coverage for unseen systems, often with data-efficient training. This work suggests a path toward scalable, transferable, and fast Boltzmann sampling for small molecules, with potential extensions to larger systems and more expensive force fields while acknowledging limitations such as control over convergence and the need for topology validation during inference.
Abstract
The generation of equilibrium samples of molecular systems has been a long-standing problem in statistical physics. Boltzmann Generators are a generative machine learning method that addresses this issue by learning a transformation via a normalizing flow from a simple prior distribution to the target Boltzmann distribution of interest. Recently, flow matching has been employed to train Boltzmann Generators for small molecular systems in Cartesian coordinates. We extend this work and propose a first framework for Boltzmann Generators that are transferable across chemical space, such that they predict zero-shot Boltzmann distributions for test molecules without being retrained for these systems. These transferable Boltzmann Generators allow approximate sampling from the target distribution of unseen systems, as well as efficient reweighting to the target Boltzmann distribution. The transferability of the proposed framework is evaluated on dipeptides, where we show that it generalizes efficiently to unseen systems. Furthermore, we demonstrate that our proposed architecture enhances the efficiency of Boltzmann Generators trained on single molecular systems.
