Operator-Informed Score Matching for Markov Diffusion Models
Zheyang Shen, Huihui Wang, Marina Riabiz, Chris J. Oates
TL;DR
The paper argues that Markov diffusion operators offer theoretical and practical advantages for score-based generative modeling by exploiting their explicit spectral structure. It introduces operator-informed score matching (oism), which expresses the score across all noise levels as a linear combination of eigenfunctions of the infinitesimal generator $\mathscr{L}$ and uses $P_t\phi_n = e^{\lambda_n t}\phi_n$ to compute scores without forward-simulation. This yields a quadratic form in eigen-coefficients with a closed-form minimizer $\hat{\boldsymbol{\alpha}} = \mathbf{A}_t^{-1} \mathbf{b}_t$, producing a score estimator $\tilde{\mathbf{s}}_t(x)$. Empirically, oism accelerates training on low-dimensional tasks and complements neural estimators in high dimensions via a residual approach, though higher-order eigenfunctions do not always improve performance. The work highlights a practical path to integrating spectral diffusion-operator insights with standard diffusion models for faster, potentially more data-efficient generative modeling.
Abstract
Diffusion models are typically trained using score matching, a learning objective agnostic to the underlying noising process that guides the model. This paper argues that Markov noising processes enjoy an advantage over alternatives, as the Markov operators that govern the noising process are well-understood. Specifically, by leveraging the spectral decomposition of the infinitesimal generator of the Markov noising process, we obtain parametric estimates of the score functions simultaneously for all marginal distributions, using only sample averages with respect to the data distribution. The resulting operator-informed score matching provides both a standalone approach to sample generation for low-dimensional distributions, as well as a recipe for better informed neural score estimators in high-dimensional settings.
