LGDC: Latent Graph Diffusion via Spectrum-Preserving Coarsening
Nagham Osman, Keyue Jiang, Davide Buffelli, Xiaowen Dong, Laura Toni
TL;DR
LGDC introduces a hybrid graph generation framework that combines spectrum-preserving coarsening with latent discrete diffusion. By operating diffusion in a compact latent space and performing a single expansion–refinement decoding, LGDC achieves strong global coherence and fine-grained local fidelity while reducing computational cost compared to pure diffusion or autoregressive methods. Empirical results on Tree, Planar, and Community-20 graphs show LGDC matches autoregressive performance on locally structured data and diffusion performance on globally structured data, validating the local-global trade-off in a single pipeline. The approach offers practical efficiency and scalability advantages for structure-aware graph generation across domains, with potential extensions to richer attributes and larger heterogeneous graphs.
Abstract
Graph generation is a critical task across scientific domains. Existing methods fall broadly into two categories: autoregressive models, which iteratively expand graphs, and one-shot models, such as diffusion, which generate the full graph at once. In this work, we provide an analysis of these two paradigms and reveal a key trade-off: autoregressive models stand out in capturing fine-grained local structures, such as degree and clustering properties, whereas one-shot models excel at modeling global patterns, such as spectral distributions. Building on this, we propose LGDC (latent graph diffusion via spectrum-preserving coarsening), a hybrid framework that combines strengths of both approaches. LGDC employs a spectrum-preserving coarsening-decoarsening to bidirectionally map between graphs and a latent space, where diffusion efficiently generates latent graphs before expansion restores detail. This design captures both local and global properties with improved efficiency. Empirically, LGDC matches autoregressive models on locally structured datasets (Tree) and diffusion models on globally structured ones (Planar, Community-20), validating the benefits of hybrid generation.
