MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion
Chiyu Max Jiang, Andre Cornman, Cheolho Park, Ben Sapp, Yin Zhou, Dragomir Anguelov
TL;DR
MotionDiffuser introduces a diffusion-model-based framework for joint multi-agent trajectory prediction that is permutation-invariant and capable of modeling highly multimodal futures. It combines a transformer-based set denoiser with PCA-augmented latent diffusion to efficiently represent trajectories and enable exact log-probability inference. A flexible constrained sampling scheme using differentiable costs (attractor and repeller) allows controllable trajectory synthesis, making it suitable for enforcing rules, priors, and custom scenarios. The method delivers state-of-the-art results on the Waymo Open Motion Dataset Interactive split and demonstrates robust ablations and controllable generation capabilities. Overall, MotionDiffuser advances probabilistic, interactive, and controllable motion forecasting for multi-agent systems in autonomous driving contexts.
Abstract
We present MotionDiffuser, a diffusion based representation for the joint distribution of future trajectories over multiple agents. Such representation has several key advantages: first, our model learns a highly multimodal distribution that captures diverse future outcomes. Second, the simple predictor design requires only a single L2 loss training objective, and does not depend on trajectory anchors. Third, our model is capable of learning the joint distribution for the motion of multiple agents in a permutation-invariant manner. Furthermore, we utilize a compressed trajectory representation via PCA, which improves model performance and allows for efficient computation of the exact sample log probability. Subsequently, we propose a general constrained sampling framework that enables controlled trajectory sampling based on differentiable cost functions. This strategy enables a host of applications such as enforcing rules and physical priors, or creating tailored simulation scenarios. MotionDiffuser can be combined with existing backbone architectures to achieve top motion forecasting results. We obtain state-of-the-art results for multi-agent motion prediction on the Waymo Open Motion Dataset.
