Table of Contents
Fetching ...

Optimal Low-dimensional Approximation of Transfer Operators via Flow Matching: Computation and Error Analysis

Zhicheng Zhang, Ling Guo, Hao Wu

TL;DR

The paper addresses the challenge of extracting low-dimensional reaction coordinates that preserve long-term dynamics by connecting lumpability and decomposability to reduced transfer operators. It introduces flow matching (FM) and its RC-augmented variant FMRC to learn RCs from data, with a loss that upper-bounds the discrepancy between reduced and full transfer operators. Theoretical results establish bounds on operator errors in terms of Wasserstein-2 distance and $\dot{H}^{-1}$ norms, showing how RC quality controls global dynamics accuracy; a practical SGD-based training procedure is provided. Numerical experiments on a drift-diffusion system with a seven-well structure demonstrate that the learned RCs separate metastable states while preserving transition dynamics, validating FMRC as a scalable, principled approach for non-equilibrium data. Overall, FMRC offers a rigorous, data-driven path to obtain reduced-order operators and robust RCs for complex high-dimensional systems.

Abstract

Reaction coordinates (RCs) are low-dimensional representations of complex dynamical systems that capture their long-term dynamics. In this work, we focus on the criteria of lumpability and decomposability, previously established for assessing RCs, and propose a new flow matching approach for the analysis and optimization of reaction coordinates based on these criteria. This method effectively utilizes data to quantitatively determine whether a given RC satisfies these criteria and enables end-to-end optimization of the reaction coordinate mapping model. Furthermore, we provide a theoretical analysis of the relationship between the loss function used in our approach and the operator error induced by dimension reduction.

Optimal Low-dimensional Approximation of Transfer Operators via Flow Matching: Computation and Error Analysis

TL;DR

The paper addresses the challenge of extracting low-dimensional reaction coordinates that preserve long-term dynamics by connecting lumpability and decomposability to reduced transfer operators. It introduces flow matching (FM) and its RC-augmented variant FMRC to learn RCs from data, with a loss that upper-bounds the discrepancy between reduced and full transfer operators. Theoretical results establish bounds on operator errors in terms of Wasserstein-2 distance and norms, showing how RC quality controls global dynamics accuracy; a practical SGD-based training procedure is provided. Numerical experiments on a drift-diffusion system with a seven-well structure demonstrate that the learned RCs separate metastable states while preserving transition dynamics, validating FMRC as a scalable, principled approach for non-equilibrium data. Overall, FMRC offers a rigorous, data-driven path to obtain reduced-order operators and robust RCs for complex high-dimensional systems.

Abstract

Reaction coordinates (RCs) are low-dimensional representations of complex dynamical systems that capture their long-term dynamics. In this work, we focus on the criteria of lumpability and decomposability, previously established for assessing RCs, and propose a new flow matching approach for the analysis and optimization of reaction coordinates based on these criteria. This method effectively utilizes data to quantitatively determine whether a given RC satisfies these criteria and enables end-to-end optimization of the reaction coordinate mapping model. Furthermore, we provide a theoretical analysis of the relationship between the loss function used in our approach and the operator error induced by dimension reduction.
Paper Structure (16 sections, 7 theorems, 47 equations, 2 figures)

This paper contains 16 sections, 7 theorems, 47 equations, 2 figures.

Key Result

Lemma 3.4

\newlabellem:simple-decomp0 A system is decomposable with respect to $r$ if and only if there exists a function $p_{-\tau}^D\colon \mathbb{Z} \times \mathbb{X} \to \mathbb{R}^+$ such that

Figures (2)

  • Figure 1: (a) Energy landscape: circular potential with seven wells. (b) Clustering of 2D data, generated by potential $V'(x_1,x_2)$. (c) Corresponding 3D data generated by full potential $V(x_1,x_2,x_3)$. (d) Swiss roll data, by applying a nonlinear transformation to data in (c).
  • Figure 2: (a) Plot RC $r(x)$ onto 2D projected data. The values of the reaction coordinate distinguish the different wells. (b) Apply PCCA+ to the projected data to obtain clustering results and analyze the values of the reaction coordinate within each cluster. (c) The histogram of the RC values, categorized by the PCCA+ results, shows clear gaps between different clusters.

Theorems & Definitions (22)

  • Definition 2.1
  • Definition 3.1: Lumpability
  • Definition 3.2: Decomposability
  • Remark 3.3
  • Lemma 3.4
  • Proof 1
  • Theorem 4.1
  • Proof 2
  • Remark 4.2
  • Definition 5.1: Reduced-Order Transfer Operators
  • ...and 12 more