Energy-based generator matching: A neural sampler for general state space
Dongyeop Woo, Minsu Kim, Minkyu Kim, Kiyoung Seong, Sungsoo Ahn
TL;DR
Energy-based generator matching (EGM) tackles sampling from energy-based targets $p_{target}(x) \propto \exp(-\\mathcal{E}(x))$ without equilibrium samples by learning neural samplers for general CTMPs. It extends generator matching to energy-driven training, using self-normalized importance sampling (SNIS) to estimate the marginal generator and a bootstrapping scheme with intermediate energies to reduce variance. The framework supports diffusion, flow, and discrete jumps across continuous, discrete, and mixed state spaces, and is validated on Ising models and multimodal discrete-continuous tasks, showing robust mode coverage and competitive energy-based Wasserstein metrics. This approach broadens neural-sampler applicability beyond diffusion models, enabling efficient, simulation-free training for complex energy landscapes with practical impact in physics-inspired learning and multimodal generation.
Abstract
We propose Energy-based generator matching (EGM), a modality-agnostic approach to train generative models from energy functions in the absence of data. Extending the recently proposed generator matching, EGM enables training of arbitrary continuous-time Markov processes, e.g., diffusion, flow, and jump, and can generate data from continuous, discrete, and a mixture of two modalities. To this end, we propose estimating the generator matching loss using self-normalized importance sampling with an additional bootstrapping trick to reduce variance in the importance weight. We validate EGM on both discrete and multimodal tasks up to 100 and 20 dimensions, respectively.
