CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

Enhui Ma; Lijun Zhou; Tao Tang; Jiahuan Zhang; Junpeng Jiang; Zhan Zhang; Dong Han; Kun Zhan; Xueyang Zhang; XianPeng Lang; Haiyang Sun; Xia Zhou; Di Lin; Kaicheng Yu

CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

Enhui Ma, Lijun Zhou, Tao Tang, Jiahuan Zhang, Junpeng Jiang, Zhan Zhang, Dong Han, Kun Zhan, Xueyang Zhang, XianPeng Lang, Haiyang Sun, Xia Zhou, Di Lin, Kaicheng Yu

TL;DR

This work tackles robustness gaps in end-to-end autonomous driving caused by long-tail failure cases. It presents CorrectAD, a self-correcting agentic system that couples PM-Agent for failure analysis with DriveSora for BEV-conditioned diffusion-based video generation to augment training data and strengthen planners. Demonstrated across nuScenes and a large in-house dataset, CorrectAD yields substantial improvements in L2 error and collision rates while maintaining model-agnostic applicability to different E2E planners. The approach shows that targeted, multimodal data generation guided by failure analysis can provide sustainable, scalable improvements to end-to-end planning in autonomous driving.

Abstract

End-to-end planning methods are the de facto standard of the current autonomous driving system, while the robustness of the data-driven approaches suffers due to the notorious long-tail problem (i.e., rare but safety-critical failure cases). In this work, we explore whether recent diffusion-based video generation methods (a.k.a. world models), paired with structured 3D layouts, can enable a fully automated pipeline to self-correct such failure cases. We first introduce an agent to simulate the role of product manager, dubbed PM-Agent, which formulates data requirements to collect data similar to the failure cases. Then, we use a generative model that can simulate both data collection and annotation. However, existing generative models struggle to generate high-fidelity data conditioned on 3D layouts. To address this, we propose DriveSora, which can generate spatiotemporally consistent videos aligned with the 3D annotations requested by PM-Agent. We integrate these components into our self-correcting agentic system, CorrectAD. Importantly, our pipeline is an end-to-end model-agnostic and can be applied to improve any end-to-end planner. Evaluated on both nuScenes and a more challenging in-house dataset across multiple end-to-end planners, CorrectAD corrects 62.5% and 49.8% of failure cases, reducing collision rates by 39% and 27%, respectively.

CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

TL;DR

Abstract

CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (21)