Driving is a Game: Combining Planning and Prediction with Bayesian Iterative Best Response
Aron Distelzweig, Yiwei Wang, Faris Janjoš, Marcel Hallgarten, Mihai Dobre, Alexander Langmann, Joschka Boedecker, Johannes Betz
TL;DR
BIBeR introduces a principled framework that unifies state-of-the-art motion prediction with game-theoretic planning through Bayesian Iterative Best Response. By re-weighting a diverse set of ego and surrounding trajectories within an IBR loop and modulating updates with a Bayesian confidence score, BIBeR achieves interaction-aware planning that can both react to and influence other drivers. The approach integrates marginals from modern predictors (LAformer) with a sampling-based planner (SPDM) and demonstrates strong gains on interactive benchmarks like interPlan lane-change, as well as robust performance on standard nuPlan tasks. Empirical results show BIBeR and its CV-filtered variant outperform baselines in highly interactive scenarios, with notable improvements in safety-related metrics and planning efficiency, while ablations highlight the importance of update order and the value of early iterations. The work also provides a thorough analysis of predictor choices (marginal vs joint) and runtime implications, arguing for a flexible, modular, and principled interaction-aware planning paradigm with practical potential for real-time deployment.
Abstract
Autonomous driving planning systems perform nearly perfectly in routine scenarios using lightweight, rule-based methods but still struggle in dense urban traffic, where lane changes and merges require anticipating and influencing other agents. Modern motion predictors offer highly accurate forecasts, yet their integration into planning is mostly rudimental: discarding unsafe plans. Similarly, end-to-end models offer a one-way integration that avoids the challenges of joint prediction and planning modeling under uncertainty. In contrast, game-theoretic formulations offer a principled alternative but have seen limited adoption in autonomous driving. We present Bayesian Iterative Best Response (BIBeR), a framework that unifies motion prediction and game-theoretic planning into a single interaction-aware process. BIBeR is the first to integrate a state-of-the-art predictor into an Iterative Best Response (IBR) loop, repeatedly refining the strategies of the ego vehicle and surrounding agents. This repeated best-response process approximates a Nash equilibrium, enabling bidirectional adaptation where the ego both reacts to and shapes the behavior of others. In addition, our proposed Bayesian confidence estimation quantifies prediction reliability and modulates update strength, more conservative under low confidence and more decisive under high confidence. BIBeR is compatible with modern predictors and planners, combining the transparency of structured planning with the flexibility of learned models. Experiments show that BIBeR achieves an 11% improvement over state-of-the-art planners on highly interactive interPlan lane-change scenarios, while also outperforming existing approaches on standard nuPlan benchmarks.
