Improving Adversarial Energy-Based Model via Diffusion Process
Cong Geng, Tian Han, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Søren Hauberg, Bo Li
TL;DR
This work addresses the difficulty of training energy-based models (EBMs) with expensive MCMC by embedding adversarial EBMs into a denoising diffusion process, breaking long generation into smaller conditional steps. It introduces a generator-driven variational distribution and a symmetric Jeffrey divergence to stabilize training and better match distributions, along with a gradient-penalty term to stabilize energy optimization. The approach yields substantial gains in sample quality and density estimation over existing adversarial EBMs, performs competitively with diffusion-based models on image generation, and demonstrates useful out-of-distribution detection via energy scores. The framework offers a scalable, MCMC-free pathway to joint generation and density estimation, with practical impact on tasks requiring both high-fidelity samples and tractable likelihoods.
Abstract
Generative models have shown strong generation ability while efficient likelihood estimation is less explored. Energy-based models~(EBMs) define a flexible energy function to parameterize unnormalized densities efficiently but are notorious for being difficult to train. Adversarial EBMs introduce a generator to form a minimax training game to avoid expensive MCMC sampling used in traditional EBMs, but a noticeable gap between adversarial EBMs and other strong generative models still exists. Inspired by diffusion-based models, we embedded EBMs into each denoising step to split a long-generated process into several smaller steps. Besides, we employ a symmetric Jeffrey divergence and introduce a variational posterior distribution for the generator's training to address the main challenges that exist in adversarial EBMs. Our experiments show significant improvement in generation compared to existing adversarial EBMs, while also providing a useful energy function for efficient density estimation.
