Table of Contents
Fetching ...

Generative Modeling through Spectral Analysis of Koopman Operator

Yuanchao Xu, Fengyi Li, Masahiro Fujisawa, Youssef Marzouk, Isao Ishikawa

TL;DR

The paper tackles the challenge of generative modeling without access to explicit target densities by proposing KSWGD, a training-free method that marries Koopman spectral analysis with Wasserstein gradient flows. By estimating a finite-rank surrogate for the inverse Langevin generator from trajectory data, KSWGD achieves fast, linearly convergent sampling and provides explicit error bounds tied to spectral truncation and data quality. The framework is grounded in a Feynman-Kac interpretation and validated across diverse tasks, including manifold sampling, metastable systems, MNIST latent generation, and SPDEs, often outperforming data-driven baselines that rely on neural networks. The work offers a scalable alternative to neural training for high-quality sample generation and opens avenues for conditional sampling via nonzero killing rates.

Abstract

We propose Koopman Spectral Wasserstein Gradient Descent (KSWGD), a generative modeling framework that combines operator-theoretic spectral analysis with optimal transport. The novel insight is that the spectral structure required for accelerated Wasserstein gradient descent can be directly estimated from trajectory data via Koopman operator approximation which can eliminate the need for explicit knowledge of the target potential or neural network training. We provide rigorous convergence analysis and establish connection to Feynman-Kac theory that clarifies the method's probabilistic foundation. Experiments across diverse settings, including compact manifold sampling, metastable multi-well systems, image generation, and high dimensional stochastic partial differential equation, demonstrate that KSWGD consistently achieves faster convergence than other existing methods while maintaining high sample quality.

Generative Modeling through Spectral Analysis of Koopman Operator

TL;DR

The paper tackles the challenge of generative modeling without access to explicit target densities by proposing KSWGD, a training-free method that marries Koopman spectral analysis with Wasserstein gradient flows. By estimating a finite-rank surrogate for the inverse Langevin generator from trajectory data, KSWGD achieves fast, linearly convergent sampling and provides explicit error bounds tied to spectral truncation and data quality. The framework is grounded in a Feynman-Kac interpretation and validated across diverse tasks, including manifold sampling, metastable systems, MNIST latent generation, and SPDEs, often outperforming data-driven baselines that rely on neural networks. The work offers a scalable alternative to neural training for high-quality sample generation and opens avenues for conditional sampling via nonzero killing rates.

Abstract

We propose Koopman Spectral Wasserstein Gradient Descent (KSWGD), a generative modeling framework that combines operator-theoretic spectral analysis with optimal transport. The novel insight is that the spectral structure required for accelerated Wasserstein gradient descent can be directly estimated from trajectory data via Koopman operator approximation which can eliminate the need for explicit knowledge of the target potential or neural network training. We provide rigorous convergence analysis and establish connection to Feynman-Kac theory that clarifies the method's probabilistic foundation. Experiments across diverse settings, including compact manifold sampling, metastable multi-well systems, image generation, and high dimensional stochastic partial differential equation, demonstrate that KSWGD consistently achieves faster convergence than other existing methods while maintaining high sample quality.

Paper Structure

This paper contains 31 sections, 3 theorems, 59 equations, 12 figures, 2 algorithms.

Key Result

Proposition 4.3

Assume that Assumptions ass:regularity and ass:spec_tail_bd holds. Let $(\mu_t)_{t\geq 0}$ be the solution to the truncated LAWGD dynamics with exact eigenpairs and initial distribution $\mu_0$: where $\mathcal{K}_r = \sum_{i=1}^r \frac{1}{\lambda_i}\langle \cdot, \phi_i\rangle_\pi\phi_i$ defined in eq:Kr_def satisfies eq:main_proj. Then

Figures (12)

  • Figure 1: 1-Sphere $S^1$ example using KSWGD with Kernel-EDMD.
  • Figure 2: Quadruple potential well example using KSWGD with SDMD.
  • Figure 3: MNIST example using KSWGD with EDMD (Dictionary Learning).
  • Figure 4: Allen-Cahn equation example using KSWGD with EDMD (Polynomial): $\epsilon = 0.01$
  • Figure 5: 1-Sphere $S^1$ example using DMPS with different step size $h$.
  • ...and 7 more figures

Theorems & Definitions (14)

  • Proposition 4.3: Convergence with spectral truncation
  • proof
  • Remark 4.4
  • Remark 4.5
  • Remark 4.7
  • Theorem 4.8: Error Bound Analysis
  • proof
  • Remark 4.9
  • Corollary 4.10: Discrete-Time Linear Convergence
  • proof
  • ...and 4 more