Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning

Bo Lin; Yangzheng Zhong; Weiqing Ren

Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning

Bo Lin, Yangzheng Zhong, Weiqing Ren

TL;DR

This work tackles the challenge of computing transition pathways for rare events in high-dimensional systems with rough energy landscapes by reframing the problem as a cost-minimization task over a path space and introducing an effective-force extension of the Freidlin-Wentzell functional. It then solves the resulting optimization with an actor-critic reinforcement learning method based on deep deterministic policy gradient, incorporating physical invariances via a transformation $\mathcal{T}$ and using a continuous, stochastic policy to explore transition regions. The authors demonstrate the approach on a 2D system, a 10D extended Mueller potential, and a Lennard-Jones cluster, showing convergence to globally optimal pathways, robustness to landscape roughness, and agreement with established reference pathways. The method offers a scalable, exploration-driven alternative to traditional path-finding methods and holds promise for enabling accurate transition-path analyses in complex molecular and materials systems.

Abstract

Understanding the transition events between metastable states in complex systems is an important subject in the fields of computational physics, chemistry and biology. The transition pathway plays an important role in characterizing the mechanism underlying the transition, for example, in the study of conformational changes of bio-molecules. In fact, computing the transition pathway is a challenging task for complex and high-dimensional systems. In this work, we formulate the path-finding task as a cost minimization problem over a particular path space. The cost function is adapted from the Freidlin-Wentzell action functional so that it is able to deal with rough potential landscapes. The path-finding problem is then solved using a actor-critic method based on the deep deterministic policy gradient algorithm (DDPG). The method incorporates the potential force of the system in the policy for generating episodes and combines physical properties of the system with the learning process for molecular systems. The exploitation and exploration nature of reinforcement learning enables the method to efficiently sample the transition events and compute the globally optimal transition pathway. We illustrate the effectiveness of the proposed method using three benchmark systems including an extended Mueller system and the Lennard-Jones system of seven particles.

Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning

TL;DR

and using a continuous, stochastic policy to explore transition regions. The authors demonstrate the approach on a 2D system, a 10D extended Mueller potential, and a Lennard-Jones cluster, showing convergence to globally optimal pathways, robustness to landscape roughness, and agreement with established reference pathways. The method offers a scalable, exploration-driven alternative to traditional path-finding methods and holds promise for enabling accurate transition-path analyses in complex molecular and materials systems.

Abstract

Paper Structure (14 sections, 1 theorem, 65 equations, 11 figures, 1 table, 1 algorithm)

This paper contains 14 sections, 1 theorem, 65 equations, 11 figures, 1 table, 1 algorithm.

Introduction
Method
Formulation of the Problem
Reinforcement Learning Method
Numerical Examples
A Two-dimensional System
Extended Mueller System
Lennard-Jones System
Conclusions
The path space $\mathbb{C}_{\gamma}$.
Numerical quadratures for approximating the integral in problem \ref{['min_Sr']}.
The parameters in Algorithm \ref{['alg2']} used for the numerical examples in Section \ref{['Examples']}.
Computing the transition tube for the Mueller system.
Construction of the transformation function $\mathcal{T}(x)$ for the Lennard-Jones system.

Key Result

Theorem 1.1

The set $\bigcup_{\gamma>0}\mathbb{C}_{\gamma}$ is a dense subset of $\bigcup_{T>0}\mathbb{C}_{[0,T]}$.

Figures (11)

Figure 1: Schematic illustration of an absolute continuous path (black solid line) connecting the two metastable states $A$ and $B$, which is approximated by a path $\varphi(t)$ (green dashed line) in $\mathbb{C}_{\gamma}$ represented by a polygonal chain with a sequence of states $(z_0,\dots,z_N)$.
Figure 2: Plots of the MEPs computed using the string method and the transition pathway computed from the actor network $\mu_{\theta}(s)$ ( Upper). Plots of the potential function $V(x)$ along the three paths ( Lower). The two MEPs are referred to as the upper/lower MEPs. The contour lines in the upper panel indicate the potential function $V(x)$ in Eq. \ref{['VV']}.
Figure 3: Plots of the computed transition pathway $\varphi_{\theta}(\alpha)$ between the metastable states $A$ and $B$ and the minimum energy path (MEP) $\varphi(\alpha)$ which are projected on the $(x_1,x_2)$ plane. The inset plot shows the potential function along the two paths. The contour lines indicate the potential function $V(x)$ in Eq. \ref{['Example1_V']}.
Figure 4: Plots of the temporal-difference (TD) loss function $L_Q$ and the average return $J_{\mu}$ versus the training steps in Algorithm \ref{['alg2']}.
Figure 5: Plot of the relative error for the path $\varphi_{\theta}$ computed from the actor network $\mu_{\theta}(s)$ versus the training steps in Algorithm \ref{['alg2']}. The error is defined in Eq. \ref{['error']}.
...and 6 more figures

Theorems & Definitions (2)

Theorem 1.1
proof

Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning

TL;DR

Abstract

Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (2)