AgentsCoDriver: Large Language Model Empowered Collaborative Driving with Lifelong Learning
Senkang Hu, Zhengru Fang, Zihan Fang, Yiqin Deng, Xianhao Chen, Yuguang Fang
TL;DR
This work tackles interpretability, generalization, and lifelong learning in connected autonomous driving by introducing AgentsCoDriver, an LLM-powered framework that enables multi-vehicle collaboration and negotiation. It formulates the driving problem as a $D$-POMDP and integrates five modules—observation, cognitive memory, reasoning, reinforcement reflection, and communication—together with an evaluator/reflector loop to support lifelong learning. The approach achieves superior performance over baselines, demonstrates improved learning over time, and enables inter-vehicle negotiation to enhance safety and efficiency in complex traffic. These results highlight the practical potential of LLM-driven, memory-augmented, multi-agent collaboration for future autonomous driving systems.
Abstract
Connected and autonomous driving is developing rapidly in recent years. However, current autonomous driving systems, which are primarily based on data-driven approaches, exhibit deficiencies in interpretability, generalization, and continuing learning capabilities. In addition, the single-vehicle autonomous driving systems lack of the ability of collaboration and negotiation with other vehicles, which is crucial for the safety and efficiency of autonomous driving systems. In order to address these issues, we leverage large language models (LLMs) to develop a novel framework, AgentsCoDriver, to enable multiple vehicles to conduct collaborative driving. AgentsCoDriver consists of five modules: observation module, reasoning engine, cognitive memory module, reinforcement reflection module, and communication module. It can accumulate knowledge, lessons, and experiences over time by continuously interacting with the environment, thereby making itself capable of lifelong learning. In addition, by leveraging the communication module, different agents can exchange information and realize negotiation and collaboration in complex traffic environments. Extensive experiments are conducted and show the superiority of AgentsCoDriver.
