Diffusion Density Estimators
Akhil Premkumar
TL;DR
The paper investigates diffusion models as neural density estimators for $p_d$ and highlights limitations of the standard Probability Flow ODE approach. It introduces a parallelizable path-integral Monte Carlo method to estimate log-densities without solving a flow, moving derivatives to a Gaussian transition term and enabling vectorized computation. Through experiments on Gaussian mixtures, it studies how training parameters ($N$, $n_t$, $n_{ep}$) and the choice of entropy matching vs score matching influence accuracy and efficiency, finding entropy matching to be faster and more memory-efficient. The work demonstrates a viable, scalable diffusion-based density estimator and highlights directions for applying to higher-dimensional data and inference tasks.
Abstract
We investigate the use of diffusion models as neural density estimators. The current approach to this problem involves converting the generative process to a smooth flow, known as the Probability Flow ODE. The log density at a given sample can be obtained by solving the ODE with a black-box solver. We introduce a new, highly parallelizable method that computes log densities without the need to solve a flow. Our approach is based on estimating a path integral by Monte Carlo, in a manner identical to the simulation-free training of diffusion models. We also study how different training parameters affect the accuracy of the density calculation, and offer insights into how these models can be made more scalable and efficient.
