Graph Generation with Diffusion Mixture
Jaehyeong Jo, Dongki Kim, Sung Ju Hwang
TL;DR
This work tackles the challenge of generating graphs with accurate topology using diffusion models. It introduces GruM, a diffusion framework that explicitly models graph topology by predicting the final graph through a graph mixture, implemented as a weighted mean of endpoints from endpoint-conditioned OU bridge processes. The training objective derives from a likelihood bound via the Girsanov theorem, enabling simulation-free learning of the graph mixture with an efficient objective that maximizes data likelihood. GruM demonstrates strong performance across general graphs and 2D/3D molecular generation tasks, outperforming previous diffusion-based and graph-generative baselines in topology accuracy and molecule stability. The approach supports both discrete and continuous features, achieves fast convergence of the graph mixture, and offers practical benefits such as early stopping to reduce generation time, with potential impact on drug design and related domains.
Abstract
Generation of graphs is a major challenge for real-world tasks that require understanding the complex nature of their non-Euclidean structures. Although diffusion models have achieved notable success in graph generation recently, they are ill-suited for modeling the topological properties of graphs since learning to denoise the noisy samples does not explicitly learn the graph structures to be generated. To tackle this limitation, we propose a generative framework that models the topology of graphs by explicitly learning the final graph structures of the diffusion process. Specifically, we design the generative process as a mixture of endpoint-conditioned diffusion processes which is driven toward the predicted graph that results in rapid convergence. We further introduce a simple parameterization of the mixture process and develop an objective for learning the final graph structure, which enables maximum likelihood training. Through extensive experimental validation on general graph and 2D/3D molecule generation tasks, we show that our method outperforms previous generative models, generating graphs with correct topology with both continuous (e.g. 3D coordinates) and discrete (e.g. atom types) features. Our code is available at https://github.com/harryjo97/GruM.
