Generative Learning of Densities on Manifolds
Dimitris G. Giovanis, Ellis Crabtree, Roger G. Ghanem, Ioannis G. Kevrekidis
TL;DR
This work addresses sampling high-dimensional data densities constrained to unknown low-dimensional manifolds. It combines diffusion-model-based generative methods with manifold learning by using Diffusion Maps to uncover latent coordinates and Double Diffusion Maps to lift samples back to the ambient space. Two latent-space sampling strategies are proposed: a score-based diffusion model (m-SGM) and a probabilistic Itô SDE/PLoM approach (m-PLoM). Through a synthetic S-shaped dataset and a multiscale material system, the framework demonstrates efficient, manifold-respecting density estimation and realistic realizations suitable for applications in digital twins and uncertainty quantification.
Abstract
A generative modeling framework is proposed that combines diffusion models and manifold learning to efficiently sample data densities on manifolds. The approach utilizes Diffusion Maps to uncover possible low-dimensional underlying (latent) spaces in the high-dimensional data (ambient) space. Two approaches for sampling from the latent data density are described. The first is a score-based diffusion model, which is trained to map a standard normal distribution to the latent data distribution using a neural network. The second one involves solving an Itô stochastic differential equation in the latent space. Additional realizations of the data are generated by lifting the samples back to the ambient space using Double Diffusion Maps, a recently introduced technique typically employed in studying dynamical system reduction; here the focus lies in sampling densities rather than system dynamics. The proposed approaches enable sampling high dimensional data densities restricted to low-dimensional, a priori unknown manifolds. The efficacy of the proposed framework is demonstrated through a benchmark problem and a material with multiscale structure.
