PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving
Zhili Chen, Maosheng Ye, Shuangjie Xu, Tongyi Cao, Qifeng Chen
TL;DR
PPAD addresses the challenge of integrating prediction and planning in end-to-end autonomous driving by introducing an iterative, timestep-wise interaction between ego and surrounding agents under an autoregressive framework. It interleaves prediction and planning at every future step and employs hierarchical dynamic key objects attention to model ego-agent-environment interactions, including BEV features, map elements, and agent queries. The method trains with noisy trajectories and end-to-end losses, achieving state-of-the-art performance on nuScenes and Argoverse2 with improved L2 accuracy and reduced collision rates. This work highlights the importance of multi-step, bidirectional interaction for safer, more reliable autonomous driving and offers a scalable framework for future BEV-based end-to-end systems.
Abstract
We present a new interaction mechanism of prediction and planning for end-to-end autonomous driving, called PPAD (Iterative Interaction of Prediction and Planning Autonomous Driving), which considers the timestep-wise interaction to better integrate prediction and planning. An ego vehicle performs motion planning at each timestep based on the trajectory prediction of surrounding agents (e.g., vehicles and pedestrians) and its local road conditions. Unlike existing end-to-end autonomous driving frameworks, PPAD models the interactions among ego, agents, and the dynamic environment in an autoregressive manner by interleaving the Prediction and Planning processes at every timestep, instead of a single sequential process of prediction followed by planning. Specifically, we design ego-to-agent, ego-to-map, and ego-to-BEV interaction mechanisms with hierarchical dynamic key objects attention to better model the interactions. The experiments on the nuScenes benchmark show that our approach outperforms state-of-the-art methods.
