Enhancing Diffusion-Based Sampling with Molecular Collective Variables
Juno Nam, Bálint Máté, Artur P. Toshev, Manasa Kaniselvan, Rafael Gómez-Bombarelli, Ricky T. Q. Chen, Brandon Wood, Guan-Horng Liu, Benjamin Kurt Miller
TL;DR
The paper addresses the difficulty of sampling Boltzmann ensembles for molecular systems with diffusion-based samplers, which often miss low-population but thermodynamically important states. It introduces WT-ASBS, a Well-Tempered Adjoint Schrödinger Bridge Sampler that injects a well-tempered bias along selected collective variables via an online repulsive potential, enabling broader exploration while preserving Boltzmann-consistent reweighting. Key contributions include a convergence guarantee to the well-tempered target, a practical training recipe (pretraining, CV selection, restraints, and sampling/refinement), and demonstrations on Ala2, Ala4, SN2, and post-TS bifurcation tasks, achieving accurate PMFs and shorter wall-clock times than standard enhanced sampling baselines. The approach bridges diffusion-based sampling with classical enhanced-sampling ideas, extending practical diffusion samplers to complex molecular landscapes and reactive energy surfaces using both classical force fields and near-DFT accuracy interatomic potentials. Overall, WT-ASBS enables efficient, Boltzmann-consistent exploration of molecular configurations and reactive pathways, offering a scalable path toward broader adoption of diffusion-based samplers in molecular sciences.
Abstract
Diffusion-based samplers learn to sample complex, high-dimensional distributions using energies or log densities alone, without training data. Yet, they remain impractical for molecular sampling because they are often slower than molecular dynamics and miss thermodynamically relevant modes. Inspired by enhanced sampling, we encourage exploration by introducing a sequential bias along bespoke, information-rich, low-dimensional projections of atomic coordinates known as collective variables (CVs). We introduce a repulsive potential centered on the CVs from recent samples, which pushes future samples towards novel CV regions and effectively increases the temperature in the projected space. Our resulting method improves efficiency, mode discovery, enables the estimation of free energy differences, and retains independent sampling from the approximate Boltzmann distribution via reweighting by the bias. On standard peptide conformational sampling benchmarks, the method recovers diverse conformational states and accurate free energy profiles. We are the first to demonstrate reactive sampling using a diffusion-based sampler, capturing bond breaking and formation with universal interatomic potentials at near-first-principles accuracy. The approach resolves reactive energy landscapes at a fraction of the wall-clock time of standard sampling methods, advancing diffusion-based sampling towards practical use in molecular sciences.
