From Biased to Unbiased Dynamics: An Infinitesimal Generator Approach
Timothée Devergne, Vladimir Kostic, Michele Parrinello, Massimiliano Pontil
TL;DR
The work addresses extracting spectral properties of Langevin-type dynamics when only biased simulations are affordable, by learning the infinitesimal generator through its resolvent. It develops a debiasing framework that leverages the Radon-Nikodym relationship between biased and unbiased measures and optimizes a regularized energy kernel to recover leading generator eigenpairs, with a ridge regression estimator $G=(W+\gamma I)^{-1}C$ guiding the computation. A neural-network extension learns expressive dictionaries $z^\theta$ to capture slow modes, supported by a theoretical guarantee that, under boundedness and sufficient approximation capacity, the leading eigenpairs converge with high probability. Empirical evaluations on one- and two-dimensional benchmarks and a small biomolecule suite demonstrate superior performance over transfer-operator methods and competitive results with recent generator-learning approaches, even when biasing yields only a few transitions. The method promises practical impact for uncovering transition mechanisms and timescales in complex molecular systems, and it invites extensions to time-dependent bias and large-scale applications.
Abstract
We investigate learning the eigenfunctions of evolution operators for time-reversal invariant stochastic processes, a prime example being the Langevin equation used in molecular dynamics. Many physical or chemical processes described by this equation involve transitions between metastable states separated by high potential barriers that can hardly be crossed during a simulation. To overcome this bottleneck, data are collected via biased simulations that explore the state space more rapidly. We propose a framework for learning from biased simulations rooted in the infinitesimal generator of the process and the associated resolvent operator. We contrast our approach to more common ones based on the transfer operator, showing that it can provably learn the spectral properties of the unbiased system from biased data. In experiments, we highlight the advantages of our method over transfer operator approaches and recent developments based on generator learning, demonstrating its effectiveness in estimating eigenfunctions and eigenvalues. Importantly, we show that even with datasets containing only a few relevant transitions due to sub-optimal biasing, our approach recovers relevant information about the transition mechanism.
