Flow Matching for Optimal Reaction Coordinates of Biomolecular System
Mingyuan Zhang, Zhicheng Zhang, Hao Wu, Yong Wang
TL;DR
The paper tackles the challenge of identifying optimal reaction coordinates (RCs) for reversible biomolecular dynamics without relying on pre-defined state labels or explicit transfer-operator eigenfunctions. It introduces Flow Matching for Reaction Coordinates (FMRC), which rephrases lumpability and decomposability as conditional-probability targets and implements them with an encoder and continuous normalizing-flow decoders trained via a simulation-free flow-matching objective. Across three biomolecular systems (CLN025, Trp-Cage, NTL-9), FMRC consistently preserves more of the system’s slow dynamics in a 2D RC space than VAC-based methods and exhibits lower training variance, while enabling deeper insights into metastable networks via MSMs and PCCA+ analyses. The study also demonstrates FMRC’s practical utility for enhanced sampling by bias deposition in Ala2, indicating potential for improving MSM construction and CV-based sampling with minimal prior knowledge and robust performance. Overall, FMRC offers a scalable, principled framework to obtain informative low-dimensional RCs that faithfully reflect the underlying transfer dynamics and facilitate downstream applications.
Abstract
We present flow matching for reaction coordinates (FMRC), a novel deep learning algorithm designed to identify optimal reaction coordinates (RC) in biomolecular reversible dynamics. FMRC is based on the mathematical principles of lumpability and decomposability, which we reformulate into a conditional probability framework for efficient data-driven optimization using deep generative models. While FMRC does not explicitly learn the well-established transfer operator or its eigenfunctions, it can effectively encode the dynamics of leading eigenfunctions of the system transfer operator into its low-dimensional RC space. We further quantitatively compare its performance with several state-of-the-art algorithms by evaluating the quality of Markov state models (MSM) constructed in their respective RC spaces, demonstrating the superiority of FMRC in three increasingly complex biomolecular systems. In addition, we successfully demonstrated the efficacy of FMRC for bias deposition in the enhanced sampling of a simple model system. Finally, we discuss its potential applications in downstream applications such as enhanced sampling methods and MSM construction.
