Tractable Joint Prediction and Planning over Discrete Behavior Modes for Urban Driving
Adam Villaflor, Brian Yang, Huangyuan Su, Katerina Fragkiadaki, John Dolan, Jeff Schneider
TL;DR
The paper tackles the challenge of integrating multimodal trajectory forecasting with planning in interaction-rich urban driving. It leverages anchor embeddings to parameterize discrete latent modes and performs fully reactive closed-loop planning over these modes, without retraining the forecasting model, enabling tractable autoregressive rollouts. Key contributions include a scalable, mode-space planning approach that scales linearly with horizon and agents, and strong empirical results showing proactive behavior and state-of-the-art performance on challenging CARLA benchmarks, including dynamic merging and Longest6 at realistic speeds. The work advances practical autonomous driving by enabling proactive, interaction-aware decisions in dense traffic, reducing the frozen robot problem and improving safety and efficiency in real-world scenarios.
Abstract
Significant progress has been made in training multimodal trajectory forecasting models for autonomous driving. However, effectively integrating these models with downstream planners and model-based control approaches is still an open problem. Although these models have conventionally been evaluated for open-loop prediction, we show that they can be used to parameterize autoregressive closed-loop models without retraining. We consider recent trajectory prediction approaches which leverage learned anchor embeddings to predict multiple trajectories, finding that these anchor embeddings can parameterize discrete and distinct modes representing high-level driving behaviors. We propose to perform fully reactive closed-loop planning over these discrete latent modes, allowing us to tractably model the causal interactions between agents at each step. We validate our approach on a suite of more dynamic merging scenarios, finding that our approach avoids the $\textit{frozen robot problem}$ which is pervasive in conventional planners. Our approach also outperforms the previous state-of-the-art in CARLA on challenging dense traffic scenarios when evaluated at realistic speeds.
