A Malliavin calculus approach to score functions in diffusion generative models
Ehsan Mirafzali, Frank Proske, Utkarsh Gupta, Daniele Venturi, Razvan Marinescu
TL;DR
The paper addresses the challenge of obtaining a tractable score function for nonlinear diffusion processes used in score-based diffusion models. It develops a closed-form, Malliavin-calculus–based representation of the score $\nabla_y \log p_T(y)$ as $-\mathbb{E}[\delta(u_k)\mid X_T=y]$, where $\delta(u_k)$ is a Skorokhod integral expressed entirely through first and second variation processes, thus eliminating explicit Malliavin derivatives from the final formula. The main contribution is a novel BEL-type formula that converts abstract Malliavin objects into pathwise, computable quantities, plus a state-independent diffusion simplification relevant for common diffusion schedulers. This framework provides a principled foundation for designing new sampling and score-estimation algorithms for nonlinear SDEs, with potential impact on high-fidelity data generation and applications requiring nonlinear diffusion dynamics. The results extend to broader diffusion classes and offer a robust, implementable route to accurate score estimation in nonlinear diffusion-based generative modelling.
Abstract
Score-based diffusion generative models have recently emerged as a powerful tool for modelling complex data distributions. These models aim at learning the score function, which defines a map from a known probability distribution to the target data distribution via deterministic or stochastic differential equations (SDEs). The score function is typically estimated from data using a variety of approximation techniques, such as denoising or sliced score matching, Hyvärien's method, or Schrödinger bridges. In this paper, we derive an exact, closed-form, expression for the score function for a broad class of nonlinear diffusion generative models. Our approach combines modern stochastic analysis tools such as Malliavin derivatives and their adjoint operators (Skorokhod integrals or Malliavin Divergence) with a new Bismut-type formula. The resulting expression for the score function can be written entirely in terms of the first and second variation processes, with all Malliavin derivatives systematically eliminated, thereby enhancing its practical applicability. The theoretical framework presented in this work offers a principled foundation for advancing score estimation methods in generative modelling, enabling the design of new sampling algorithms for complex probability distributions. Our results can be extended to broader classes of stochastic differential equations, opening new directions for the development of score-based diffusion generative models.
