Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks
Francesco Cozzi, Marco Pangallo, Alan Perotti, André Panisson, Corrado Monti
TL;DR
The paper tackles the challenge of aligning agent-based models (ABMs) with real data by learning a differentiable surrogate that preserves micro-level, decentralized dynamics. It introduces Graph Diffusion Network (GDN), which fuses a graph neural network for local interactions with a conditional diffusion model to learn per-agent transition distributions from ABM data. The approach enables gradient-based calibration and direct estimation of agent states, demonstrated on Schelling and Predator-Prey to reproduce both micro transitions and emergent macro behavior beyond training. This data-driven framework paves the way for flexible, interpretable ABM surrogates with potential applications across economics, epidemiology, urban science, and ecology.
Abstract
Agent-Based Models (ABMs) are powerful tools for studying emergent properties in complex systems. In ABMs, agent behaviors are governed by local interactions and stochastic rules. However, these rules are, in general, non-differentiable, limiting the use of gradient-based methods for optimization, and thus integration with real-world data. We propose a novel framework to learn a differentiable surrogate of any ABM by observing its generated data. Our method combines diffusion models to capture behavioral stochasticity and graph neural networks to model agent interactions. Distinct from prior surrogate approaches, our method introduces a fundamental shift: rather than approximating system-level outputs, it models individual agent behavior directly, preserving the decentralized, bottom-up dynamics that define ABMs. We validate our approach on two ABMs (Schelling's segregation model and a Predator-Prey ecosystem) showing that it replicates individual-level patterns and accurately forecasts emergent dynamics beyond training. Our results demonstrate the potential of combining diffusion models and graph learning for data-driven ABM simulation.
