Personalized Autonomous Driving with Large Language Models: Field Experiments
Can Cui, Zichong Yang, Yupeng Zhou, Yunsheng Ma, Juanwu Lu, Lingxi Li, Yaobin Chen, Jitesh Panchal, Ziran Wang
TL;DR
Talk2Drive addresses the need for natural-language control and long-term personalization in autonomous driving. It leverages cloud-based LLMs to translate verbal commands into executable Language Model Programs (LMPs) using real-time context and a memory module that stores past interactions for personalization. Field experiments on highway, intersection, and parking scenarios with a Lexus RX450h show reduced takeover rates and maintained safety and comfort; the memory module further lowers takeovers by up to 65.2% compared with no-memory. The work demonstrates effective interpretation of direct and indirect commands and outlines future work to reduce LLM latency, e.g., via model distillation.
Abstract
Integrating large language models (LLMs) in autonomous vehicles enables conversation with AI systems to drive the vehicle. However, it also emphasizes the requirement for such systems to comprehend commands accurately and achieve higher-level personalization to adapt to the preferences of drivers or passengers over a more extended period. In this paper, we introduce an LLM-based framework, Talk2Drive, capable of translating natural verbal commands into executable controls and learning to satisfy personal preferences for safety, efficiency, and comfort with a proposed memory module. This is the first-of-its-kind multi-scenario field experiment that deploys LLMs on a real-world autonomous vehicle. Experiments showcase that the proposed system can comprehend human intentions at different intuition levels, ranging from direct commands like "can you drive faster" to indirect commands like "I am really in a hurry now". Additionally, we use the takeover rate to quantify the trust of human drivers in the LLM-based autonomous driving system, where Talk2Drive significantly reduces the takeover rate in highway, intersection, and parking scenarios. We also validate that the proposed memory module considers personalized preferences and further reduces the takeover rate by up to 65.2% compared with those without a memory module. The experiment video can be watched at https://www.youtube.com/watch?v=4BWsfPaq1Ro
