The Score-Difference Flow for Implicit Generative Modeling
Romann M. Weber
TL;DR
The paper introduces Score-Difference Flow (SD flow) as the optimal deterministic trajectory for aligning a source distribution $q$ with a target distribution $p$ by following the score difference $\nabla\log p - \nabla\log q$. It derives this flow from probability-flow dynamics and stochastic differential equations, showing that small perturbations along the SD direction minimize the KL divergence $\mathbb{D}_{\mathrm{KL}}(q||p)$ and relate to the Fisher divergence. To enable practical use, it replaces intractable $p$ and $q$ with noise- corrupted proxy distributions, and proves that aligning the proxies suffices to align the originals; it also presents a denoiser-based and kernel-based formulation, connecting SD flow to denoising diffusion models and kernel-based methods. The work further reveals that SD flow naturally emerges in GAN training under certain loss formulations, offering a unified view that links diffusion models and GANs. Comprehensive particle- and model-optimization algorithms demonstrate robustness on low-dimensional toy data, arguing that SD flow can address high sample quality, mode coverage, and fast sampling without restricting priors, paving the way for unified, efficient generative modeling approaches.
Abstract
Implicit generative modeling (IGM) aims to produce samples of synthetic data matching the characteristics of a target data distribution. Recent work (e.g. score-matching networks, diffusion models) has approached the IGM problem from the perspective of pushing synthetic source data toward the target distribution via dynamical perturbations or flows in the ambient space. In this direction, we present the score difference (SD) between arbitrary target and source distributions as a flow that optimally reduces the Kullback-Leibler divergence between them. We apply the SD flow to convenient proxy distributions, which are aligned if and only if the original distributions are aligned. We demonstrate the formal equivalence of this formulation to denoising diffusion models under certain conditions. We also show that the training of generative adversarial networks includes a hidden data-optimization sub-problem, which induces the SD flow under certain choices of loss function when the discriminator is optimal. As a result, the SD flow provides a theoretical link between model classes that individually address the three challenges of the "generative modeling trilemma" -- high sample quality, mode coverage, and fast sampling -- thereby setting the stage for a unified approach.
