Dynamic Generation of Multi-LLM Agents Communication Topologies with Graph Diffusion Models
Eric Hanchen Jiang, Guancheng Wan, Sophia Yin, Mengting Li, Yuchen Wu, Xiao Liang, Xinfeng Li, Yizhou Sun, Wei Wang, Kai-Wei Chang, Ying Nian Wu
TL;DR
This work tackles the dynamic design of communication topologies for LLM-driven multi-agent systems by introducing Guided Topology Diffusion (GTD), a conditional discrete graph-diffusion framework that generates task-specific topologies conditioned on context $C$. GTD integrates a lightweight surrogate reward model with a diffusion-based graph generator and employs zeroth-order guidance to steer per-step denoising toward high-reward topologies, balancing multi-objective criteria such as Utility, Token Cost, Sparsity, and Robustness. Empirically, GTD achieves state-of-the-art or highly competitive task performance across GSM8K, MATH, MultiArith, and SVAMP, while significantly reducing communication tokens and exhibiting strong robustness to agent failures. The combination of diffusion-based topology synthesis and gradient-free, proxy-guided optimization offers a scalable, data-efficient approach to Pareto-optimal topology design, with practical implications for energy-efficient and reliable multi-agent reasoning with LLMs.
Abstract
The efficiency of multi-agent systems driven by large language models (LLMs) largely hinges on their communication topology. However, designing an optimal topology is a non-trivial challenge, as it requires balancing competing objectives such as task performance, communication cost, and robustness. Existing frameworks often rely on static or hand-crafted topologies, which inherently fail to adapt to diverse task requirements, leading to either excessive token consumption for simple problems or performance bottlenecks for complex ones. To address this challenge, we introduce a novel generative framework called \textit{Guided Topology Diffusion (GTD)}. Inspired by conditional discrete graph diffusion models, GTD formulates topology synthesis as an iterative construction process. At each step, the generation is steered by a lightweight proxy model that predicts multi-objective rewards (e.g., accuracy, utility, cost), enabling real-time, gradient-free optimization towards task-adaptive topologies. This iterative, guided synthesis process distinguishes GTD from single-step generative frameworks, enabling it to better navigate complex design trade-offs. We validated GTD across multiple benchmarks, and experiments show that this framework can generate highly task-adaptive, sparse, and efficient communication topologies, significantly outperforming existing methods in LLM agent collaboration.
