Graph Diffusion that can Insert and Delete
Matteo Ninniri, Marco Podda, Davide Bacciu
TL;DR
GrIDDD introduces a size-adaptive discrete graph diffusion model that can monotically insert or delete nodes during generation, addressing the fixed graph-size limitation of prior graph DDPMs. It achieves this by modeling forward deletions with DEL/DEL* states and forward insertions with marginal-label sampling, plus a logistic-based timing schedule for insertion/deletion steps and an auxiliary predictor for reinsertions. During training, GrIDDD learns to predict node/edge types, activation times, and the number of DEL* reinforcements, while sampling alternates insertions and deletions to produce a valid target graph G^0 from a latent G^T. Empirically, GrIDDD matches or surpasses state-of-the-art in property targeting on QM9 and ZINC-250k and demonstrates competitive performance in property optimization, particularly when MW is a target, underscoring the practical impact of size-adaptive graph diffusion for molecular design.
Abstract
Generative models of graphs based on discrete Denoising Diffusion Probabilistic Models (DDPMs) offer a principled approach to molecular generation by systematically removing structural noise through iterative atom and bond adjustments. However, existing formulations are fundamentally limited by their inability to adapt the graph size (that is, the number of atoms) during the diffusion process, severely restricting their effectiveness in conditional generation scenarios such as property-driven molecular design, where the targeted property often correlates with the molecular size. In this paper, we reformulate the noising and denoising processes to support monotonic insertion and deletion of nodes. The resulting model, which we call GrIDDD, dynamically grows or shrinks the chemical graph during generation. GrIDDD matches or exceeds the performance of existing graph diffusion models on molecular property targeting despite being trained on a more difficult problem. Furthermore, when applied to molecular optimization, GrIDDD exhibits competitive performance compared to specialized optimization models. This work paves the way for size-adaptive molecular generation with graph diffusion.
