Efficient and Scalable Graph Generation through Iterative Local Expansion
Andreas Bergmeister, Karolis Martinkus, Nathanaël Perraudin, Roger Wattenhofer
TL;DR
This work tackles the scalability gap in graph generation by introducing iterative local expansion, which grows graphs from a single node and refines local structure via diffusion. It combines a coarsening-inversion framework with a diffusion-based denoiser, a locally expressive Local PPGN, and spectral conditioning to preserve global properties while maintaining subquadratic runtime for sparse graphs. The method achieves state-of-the-art or competitive results on standard benchmarks, scales to graphs with thousands of nodes, and uniquely extrapolates to out-of-distribution sizes while retaining key structural characteristics. These contributions enable robust, scalable graph generation with practical applicability to large, real-world graphs.
Abstract
In the realm of generative models for graphs, extensive research has been conducted. However, most existing methods struggle with large graphs due to the complexity of representing the entire joint distribution across all node pairs and capturing both global and local graph structures simultaneously. To overcome these issues, we introduce a method that generates a graph by progressively expanding a single node to a target graph. In each step, nodes and edges are added in a localized manner through denoising diffusion, building first the global structure, and then refining the local details. The local generation avoids modeling the entire joint distribution over all node pairs, achieving substantial computational savings with subquadratic runtime relative to node count while maintaining high expressivity through multiscale generation. Our experiments show that our model achieves state-of-the-art performance on well-established benchmark datasets while successfully scaling to graphs with at least 5000 nodes. Our method is also the first to successfully extrapolate to graphs outside of the training distribution, showcasing a much better generalization capability over existing methods.
