Enhancing End-to-End Autonomous Driving Systems Through Synchronized Human Behavior Data
Yiqun Duan, Zhuoli Zhuang, Jinzhao Zhou, Yu-Cheng Chang, Yu-Kai Wang, Chin-Teng Lin
TL;DR
The paper tackles the problem of limited generalization and trust in End-to-End autonomous driving by introducing synchronized human-machine driving data, including eye-tracking and EEG signals, to guide learning. It proposes a Hybrid Fusion Transformer Encoder with Monotonic-to-BEV Translation and a Decision Transformer, augmented with human-guidance headers for eye-tracking and intention signals, trained via a joint loss that combines perception, planning, and supervision terms. Experimental results in CARLA show that human eye-tracking guidance improves Driving Score, while brainwave-based intention guidance does not yield immediate gains due to signal noise and alignment issues, illustrating both the potential and current challenges of human-guided autonomy. The approach highlights a concrete pathway toward more robust and trustworthy autonomous systems through multimodal human supervision, while identifying key technical hurdles to overcome for scalable adoption.
Abstract
This paper presents a pioneering exploration into the integration of fine-grained human supervision within the autonomous driving domain to enhance system performance. The current advances in End-to-End autonomous driving normally are data-driven and rely on given expert trials. However, this reliance limits the systems' generalizability and their ability to earn human trust. Addressing this gap, our research introduces a novel approach by synchronously collecting data from human and machine drivers under identical driving scenarios, focusing on eye-tracking and brainwave data to guide machine perception and decision-making processes. This paper utilizes the Carla simulation to evaluate the impact brought by human behavior guidance. Experimental results show that using human attention to guide machine attention could bring a significant improvement in driving performance. However, guidance by human intention still remains a challenge. This paper pioneers a promising direction and potential for utilizing human behavior guidance to enhance autonomous systems.
