MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments
Zhiyu Huang, Zewei Zhou, Tianhui Cai, Yun Zhang, Jiaqi Ma
TL;DR
MDG reframes multi-agent trajectory generation as masked denoising of structured spatiotemporal tensors, replacing diffusion steps and token-based generation with per-element noise masks and a Transformer denoiser. It supports one-step or few-step reconstruction, with flexible inference modes that enable open-loop prediction, closed-loop simulation, and planning within a single model. Trained on large real-world driving data, MDG achieves competitive closed-loop performance on Waymo Sim Agents and nuPlan benchmarks while offering efficient, controllable open-loop generation. The framework unifies diverse traffic-behavior tasks in a simple, scalable approach with strong implications for reusable, data-driven autonomous systems.
Abstract
Modeling realistic and interactive multi-agent behavior is critical to autonomous driving and traffic simulation. However, existing diffusion and autoregressive approaches are limited by iterative sampling, sequential decoding, or task-specific designs, which hinder efficiency and reuse. We propose Masked Denoising Generation (MDG), a unified generative framework that reformulates multi-agent behavior modeling as the reconstruction of independently noised spatiotemporal tensors. Instead of relying on diffusion time steps or discrete tokenization, MDG applies continuous, per-agent and per-timestep noise masks that enable localized denoising and controllable trajectory generation in a single or few forward passes. This mask-driven formulation generalizes across open-loop prediction, closed-loop simulation, motion planning, and conditional generation within one model. Trained on large-scale real-world driving datasets, MDG achieves competitive closed-loop performance on the Waymo Sim Agents and nuPlan Planning benchmarks, while providing efficient, consistent, and controllable open-loop multi-agent trajectory generation. These results position MDG as a simple yet versatile paradigm for multi-agent behavior modeling.
