Target Score Matching
Valentin De Bortoli, Michael Hutchinson, Peter Wirnsberger, Arnaud Doucet
TL;DR
This work tackles the poor low-noise performance of denoising score matching by introducing Target Score Identity (TSI), which uses the known score of the clean target $\nabla \log p_X(x)$ to derive Target Score Matching (TSM) losses that exhibit favorable low-noise behavior. For additive noise, TSI yields $\nabla \log p_Y(y)=\int \nabla \log p_X(x)\, p_{X|Y}(x|y)\, \mathrm{d}x$, while DSM relies on $\nabla \log p_{Y|X}(y|x)$; the paper proves a relationship $\ell_{\text{TSM}}(\theta)=\ell_{\text{DSM}}(\theta)+$ constants and shows substantial variance reductions in practice. The authors extend these ideas to non-additive noise, Lie groups, and Bridge Matching, deriving corresponding score identities and practical estimators that can be implemented via MCMC or regression targets. Experiments on analytic targets and trained models demonstrate reduced estimator variance at low noise and faster convergence, validating the approach for physics-informed and manifold-valued applications. Overall, target-informed score matching provides a principled pathway to more stable, low-noise score estimation in diffusion-like models with broad applicability.
Abstract
Denoising Score Matching estimates the score of a noised version of a target distribution by minimizing a regression loss and is widely used to train the popular class of Denoising Diffusion Models. A well known limitation of Denoising Score Matching, however, is that it yields poor estimates of the score at low noise levels. This issue is particularly unfavourable for problems in the physical sciences and for Monte Carlo sampling tasks for which the score of the clean original target is known. Intuitively, estimating the score of a slightly noised version of the target should be a simple task in such cases. In this paper, we address this shortcoming and show that it is indeed possible to leverage knowledge of the target score. We present a Target Score Identity and corresponding Target Score Matching regression loss which allows us to obtain score estimates admitting favourable properties at low noise levels.
