Table of Contents
Fetching ...

Variational Optimality of Föllmer Processes in Generative Diffusions

Yifan Chen, Eric Vanden-Eijnden

TL;DR

This work formulates a variational view of generative diffusions that transport a point mass to a target distribution by fixing time-marginal evolution via stochastic interpolants. By allowing a posteriori tuning of the diffusion coefficient, the authors derive a KL-based criterion whose minimizer yields a Föllmer process relative to a reference diffusion determined solely by the interpolation schedules. This yields a new variational characterization of Föllmer processes, linking them to Schrödinger bridges and stochastic control without fixing the reference dynamics a priori. A key result is schedule invariance: once the diffusion is optimally tuned, the path-space KL divergence depends only on score estimation accuracy across noise scales, making different interpolation schedules statistically equivalent in this variational sense. The framework provides a principled approach to design and analyze diffusion-based samplers for probabilistic forecasting and data assimilation, with simulation-free drift estimation and robust marginal matching.

Abstract

We construct and analyze generative diffusions that transport a point mass to a prescribed target distribution over a finite time horizon using the stochastic interpolant framework. The drift is expressed as a conditional expectation that can be estimated from independent samples without simulating stochastic processes. We show that the diffusion coefficient can be tuned \emph{a~posteriori} without changing the time-marginal distributions. Among all such tunings, we prove that minimizing the impact of estimation error on the path-space Kullback--Leibler divergence selects, in closed form, a Föllmer process -- a diffusion whose path measure minimizes relative entropy with respect to a reference process determined by the interpolation schedules alone. This yields a new variational characterization of Föllmer processes, complementing classical formulations via Schrödinger bridges and stochastic control. We further establish that, under this optimal diffusion coefficient, the path-space Kullback--Leibler divergence becomes independent of the interpolation schedule, rendering different schedules statistically equivalent in this variational sense.

Variational Optimality of Föllmer Processes in Generative Diffusions

TL;DR

This work formulates a variational view of generative diffusions that transport a point mass to a target distribution by fixing time-marginal evolution via stochastic interpolants. By allowing a posteriori tuning of the diffusion coefficient, the authors derive a KL-based criterion whose minimizer yields a Föllmer process relative to a reference diffusion determined solely by the interpolation schedules. This yields a new variational characterization of Föllmer processes, linking them to Schrödinger bridges and stochastic control without fixing the reference dynamics a priori. A key result is schedule invariance: once the diffusion is optimally tuned, the path-space KL divergence depends only on score estimation accuracy across noise scales, making different interpolation schedules statistically equivalent in this variational sense. The framework provides a principled approach to design and analyze diffusion-based samplers for probabilistic forecasting and data assimilation, with simulation-free drift estimation and robust marginal matching.

Abstract

We construct and analyze generative diffusions that transport a point mass to a prescribed target distribution over a finite time horizon using the stochastic interpolant framework. The drift is expressed as a conditional expectation that can be estimated from independent samples without simulating stochastic processes. We show that the diffusion coefficient can be tuned \emph{a~posteriori} without changing the time-marginal distributions. Among all such tunings, we prove that minimizing the impact of estimation error on the path-space Kullback--Leibler divergence selects, in closed form, a Föllmer process -- a diffusion whose path measure minimizes relative entropy with respect to a reference process determined by the interpolation schedules alone. This yields a new variational characterization of Föllmer processes, complementing classical formulations via Schrödinger bridges and stochastic control. We further establish that, under this optimal diffusion coefficient, the path-space Kullback--Leibler divergence becomes independent of the interpolation schedule, rendering different schedules statistically equivalent in this variational sense.
Paper Structure (34 sections, 16 theorems, 172 equations)

This paper contains 34 sections, 16 theorems, 172 equations.

Key Result

Theorem 1.1

The SDE eq:intro:sde with $g_t = \sigma_t$ and drift where $z \sim \mathsf{N}(0,\mathrm{I})$ is independent of $x_\star$, satisfies $\mathrm{Law}(X_t) = \mathrm{Law}(I_t)$ for all $t \in [0,1]$, and in particular $X_1 \sim \mu_\star$.

Theorems & Definitions (38)

  • Theorem 1.1: Baseline generative diffusion; informal, see Theorem \ref{['thm:1:b']} for the precise statement
  • Theorem 1.2: KL-optimal diffusion coefficient; informal, see Theorem \ref{['prop-min-KL']} for the precise statement
  • Theorem 1.3: Variational characterization of Föllmer processes; informal, see Theorem \ref{['th:foll:app']} for the precise statement
  • Theorem 1.4: Schedule invariance; informal, see Theorem \ref{['thm:schedule-invariance']} for the precise statement
  • Remark 2.2
  • Definition 2.3
  • Theorem 2.4
  • Remark 2.5
  • proof : Proof of Theorem \ref{['thm:1:b']}
  • Theorem 3.1
  • ...and 28 more