Likelihood Matching for Diffusion Models
Lei Qian, Wu Su, Yanqi Huang, Song Xi Chen
TL;DR
Likelihood Matching (LM) reframes diffusion model training as direct likelihood optimization by leveraging the equivalence between the data distribution likelihood and the reverse-time path likelihood, and it uses a quasi-maximum likelihood approach with Gaussian proxies for reverse transitions. Score and Hessian functions are learned via neural nets to perform joint score and covariance matching, forming a LM objective that directly targets the data likelihood. The authors establish consistency and non-asymptotic convergence guarantees for the LM sampler and demonstrate empirical gains over score matching on synthetic mixtures and image datasets, with ablations highlighting the benefit of Hessian information and practical rank settings. Overall, LM provides a principled, likelihood-based alternative to score matching that improves sample fidelity and offers a scalable training paradigm for diffusion models.
Abstract
We propose a Likelihood Matching approach for training diffusion models by first establishing an equivalence between the likelihood of the target data distribution and a likelihood along the sample path of the reverse diffusion. To efficiently compute the reverse sample likelihood, a quasi-likelihood is considered to approximate each reverse transition density by a Gaussian distribution with matched conditional mean and covariance, respectively. The score and Hessian functions for the diffusion generation are estimated by maximizing the quasi-likelihood, ensuring a consistent matching of both the first two transitional moments between every two time points. A stochastic sampler is introduced to facilitate computation that leverages both the estimated score and Hessian information. We establish consistency of the quasi-maximum likelihood estimation, and provide non-asymptotic convergence guarantees for the proposed sampler, quantifying the rates of the approximation errors due to the score and Hessian estimation, dimensionality, and the number of diffusion steps. Empirical and simulation evaluations demonstrate the effectiveness of the proposed Likelihood Matching and validate the theoretical results.
