Stochastic Optimal Control for Collective Variable Free Sampling of Molecular Transition Paths
Lars Holdijk, Yuanqi Du, Ferry Hooft, Priyank Jaini, Bernd Ensing, Max Welling
TL;DR
The paper tackles sampling molecular transition paths between metastable states under high-energy barriers where conventional MD struggles. It proposes PIPS, a CV-free approach that casts the problem as a Schrödinger Bridge and solves it via stochastic optimal control, specifically using Path Integral Control and the PICE framework to learn a bias that guides trajectories toward target states. The method is adapted for molecular dynamics with bias potential or force representations, smoothed loss functions, and integration with OpenMM, and it is demonstrated on Alanine Dipeptide, Polyproline, and Chignolin, showing CV-free success and competitive alignment with CV-based baselines. This CV-agnostic approach enables scalable transition-path sampling without relying on expert CV selection, potentially accelerating exploration of conformational changes in larger biomolecular systems.
Abstract
We consider the problem of sampling transition paths between two given metastable states of a molecular system, e.g. a folded and unfolded protein or products and reactants of a chemical reaction. Due to the existence of high energy barriers separating the states, these transition paths are unlikely to be sampled with standard Molecular Dynamics (MD) simulation. Traditional methods to augment MD with a bias potential to increase the probability of the transition rely on a dimensionality reduction step based on Collective Variables (CVs). Unfortunately, selecting appropriate CVs requires chemical intuition and traditional methods are therefore not always applicable to larger systems. Additionally, when incorrect CVs are used, the bias potential might not be minimal and bias the system along dimensions irrelevant to the transition. Showing a formal relation between the problem of sampling molecular transition paths, the Schrödinger bridge problem and stochastic optimal control with neural network policies, we propose a machine learning method for sampling said transitions. Unlike previous non-machine learning approaches our method, named PIPS, does not depend on CVs. We show that our method successful generates low energy transitions for Alanine Dipeptide as well as the larger Polyproline and Chignolin proteins.
