AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems

Zhuoli Zhuang; Cheng-You Lu; Yu-Cheng Fred Chang; Yu-Kai Wang; Thomas Do; Chin-Teng Lin

AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems

Zhuoli Zhuang, Cheng-You Lu, Yu-Cheng Fred Chang, Yu-Kai Wang, Thomas Do, Chin-Teng Lin

TL;DR

AEGIS addresses interpretability and data efficiency in reinforcement learning for autonomous driving by injecting human attention as guidance into the RL policy via a self-attention module. It leverages a large VR eye-tracking dataset (1.2 million frames from 20 participants across six scenarios) to pre-train a human-attention predictor and applies KL-based regularization to align machine attention with human attention during the first phase of RL training, with an auxiliary TTC-based loss for safety. Empirical results in CARLA across car-following, left-turn, and occlusion scenarios show faster convergence, improved generalization to unseen towns, and higher alignment between machine and human attention, with a user study confirming enhanced interpretability and perceived safety. The work contributes a large, immersive eye-tracking dataset, a human-attention-guided RL framework, and evidence that attention guidance can yield more robust, explainable AIVs with practical impact for safer autonomous systems.

Abstract

Improving decision-making capabilities in Autonomous Intelligent Vehicles (AIVs) has been a heated topic in recent years. Despite advancements, training machines to capture regions of interest for comprehensive scene understanding, like human perception and reasoning, remains a significant challenge. This study introduces a novel framework, Human Attention-based Explainable Guidance for Intelligent Vehicle Systems (AEGIS). AEGIS utilizes human attention, converted from eye-tracking, to guide reinforcement learning (RL) models to identify critical regions of interest for decision-making. AEGIS uses a pre-trained human attention model to guide RL models to identify critical regions of interest for decision-making. By collecting 1.2 million frames from 20 participants across six scenarios, AEGIS pre-trains a model to predict human attention patterns.

AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems

TL;DR

Abstract

AEGIS: Human Attention-based Explainable Guidance for Intelligent Vehicle Systems

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)