Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
Jianbiao Mei, Yukai Ma, Xuemeng Yang, Licheng Wen, Xinyu Cai, Xin Li, Daocheng Fu, Bo Zhang, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yong Liu, Yu Qiao
TL;DR
LeapAD introduces a dual-process closed-loop autonomous driving framework that mimics human cognition by coupling a scene-focused Vision-Language Model with a slow Analytic Process and a fast Heuristic Process. A transferable memory bank and a reflection mechanism enable continuous self-improvement in a CARLA-based environment, achieving data-efficient learning and strong performance improvements over camera-only baselines. The Analytic Process leverages world knowledge through an LLM to accumulate high-quality driving experiences, which are distilled into the Heuristic Process via supervised fine-tuning and few-shot prompting to enable rapid edge-deployed decisions. Empirical results show LeapAD surpasses several baselines with only $11{,}000$ fine-tuning examples for VLM and a memory bank of up to $18{,}000$ samples, achieving a driving score of $DS=83.11$ in Town05 while demonstrating robust cross-town generalization and continuous improvement through reflection.
Abstract
Autonomous driving has advanced significantly due to sensors, machine learning, and artificial intelligence improvements. However, prevailing methods struggle with intricate scenarios and causal relationships, hindering adaptability and interpretability in varied environments. To address the above problems, we introduce LeapAD, a novel paradigm for autonomous driving inspired by the human cognitive process. Specifically, LeapAD emulates human attention by selecting critical objects relevant to driving decisions, simplifying environmental interpretation, and mitigating decision-making complexities. Additionally, LeapAD incorporates an innovative dual-process decision-making module, which consists of an Analytic Process (System-II) for thorough analysis and reasoning, along with a Heuristic Process (System-I) for swift and empirical processing. The Analytic Process leverages its logical reasoning to accumulate linguistic driving experience, which is then transferred to the Heuristic Process by supervised fine-tuning. Through reflection mechanisms and a growing memory bank, LeapAD continuously improves itself from past mistakes in a closed-loop environment. Closed-loop testing in CARLA shows that LeapAD outperforms all methods relying solely on camera input, requiring 1-2 orders of magnitude less labeled data. Experiments also demonstrate that as the memory bank expands, the Heuristic Process with only 1.8B parameters can inherit the knowledge from a GPT-4 powered Analytic Process and achieve continuous performance improvement. Project page: https://pjlab-adg.github.io/LeapAD.
