Latent Target Score Matching, with an application to Simulation-Based Inference
Joohwan Ko, Tomas Geffner
TL;DR
This work addresses high-variance diffusion-based score matching when latent variables are present by introducing Latent Target Score Matching (LTSM), which leverages the latent-target identity $\nabla_{\theta_t} \log p_t(\theta_t) = \frac{1}{\alpha(t)} \mathbb{E}_{\theta_0,z|\theta_t}[\nabla_{\theta_0} \log p(\theta_0,z)]$ to provide low-variance supervision of the marginal score. It extends Target Score Matching to latent-variable settings and couples LTSM with a diffusion-time dependent DSM mixture, enabling robust performance across noise scales. The authors derive a variance-minimizing optimal mixture weight $w_t^*$ and also train a learned schedule to approximate it, forming a MIX objective $y_{MIX} = w_t y_{DSM} + (1-w_t) y_{LTSM}$. Across simulation-based inference tasks with gray-box simulators, MIX consistently improves score accuracy and posterior quality, particularly under limited simulator budgets, demonstrating improved sample efficiency for diffusion-based SBI.
Abstract
Denoising score matching (DSM) for training diffusion models may suffer from high variance at low noise levels. Target Score Matching (TSM) mitigates this when clean data scores are available, providing a low-variance objective. In many applications clean scores are inaccessible due to the presence of latent variables, leaving only joint signals exposed. We propose Latent Target Score Matching (LTSM), an extension of TSM to leverage joint scores for low-variance supervision of the marginal score. While LTSM is effective at low noise levels, a mixture with DSM ensures robustness across noise scales. Across simulation-based inference tasks, LTSM consistently improves variance, score accuracy, and sample quality.
