Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures
Nina Vesseron, Louis Béthune, Marco Cuturi
TL;DR
This work proposes a shift from the standard two-block generative paradigm by introducing conjugate moment measures, which express a target density ρ as ρ = ∇ w^* # 𝔓_w with 𝔓_w ∝ e^{−w}. The authors establish a theoretical existence result via a Schauder fixed-point argument and provide practical learning and sampling procedures, including an ICNN-based parameterization of w and two methods CMFGen and CMFMA for learning from samples or energies. They derive Monge–Ampère based relations linking ρ to the conjugate potential and demonstrate the approach on univariate, 2D, and high-dimensional tasks such as MNIST and Cartoon image generation and inpainting, where conjugate-based sampling outperforms standard generative ICNN baselines. The results suggest that aligning the Gibbs factor with the target distribution improves sampling quality and opens avenues for using 𝔓_w as a pre-trained noise model within broader generative pipelines.
Abstract
The canonical approach in generative modeling is to split model fitting into two blocks: define first how to sample noise (e.g. Gaussian) and choose next what to do with it (e.g. using a single map or flows). We explore in this work an alternative route that ties sampling and mapping. We find inspiration in moment measures, a result that states that for any measure $ρ$, there exists a unique convex potential $u$ such that $ρ=\nabla u \sharp e^{-u}$. While this does seem to tie effectively sampling (from log-concave distribution $e^{-u}$) and action (pushing particles through $\nabla u$), we observe on simple examples (e.g., Gaussians or 1D distributions) that this choice is ill-suited for practical tasks. We study an alternative factorization, where $ρ$ is factorized as $\nabla w^*\sharp e^{-w}$, where $w^*$ is the convex conjugate of a convex potential $w$. We call this approach conjugate moment measures, and show far more intuitive results on these examples. Because $\nabla w^*$ is the Monge map between the log-concave distribution $e^{-w}$ and $ρ$, we rely on optimal transport solvers to propose an algorithm to recover $w$ from samples of $ρ$, and parameterize $w$ as an input-convex neural network. We also address the common sampling scenario in which the density of $ρ$ is known only up to a normalizing constant, and propose an algorithm to learn $w$ in this setting.
