Interpretable Decision-Making for End-to-End Autonomous Driving
Mona Mirzaie, Bodo Rosenhahn
TL;DR
The paper addresses the opacity of end-to-end autonomous driving by introducing a diversity-based regularizer that yields sparse, localized activation maps, enabling interpretable decision-making. Integrated into a TCP-inspired framework (DTCP), the method improves interpretability and safety without ensembles or traffic-rule sub-tasks, achieving competitive or state-of-the-art route completion on CARLA benchmarks with a monocular camera. Key contributions include the diversity loss formulation, extensive ablations, and interpretability evaluations (IoU, GTC, SC, and saliency correlations) that link visual explanations to driving performance. The results suggest that promoting feature diversity enhances both transparency and practical effectiveness, supporting scalable deployment of safer end-to-end autonomous driving systems.
Abstract
Trustworthy AI is mandatory for the broad deployment of autonomous vehicles. Although end-to-end approaches derive control commands directly from raw data, interpreting these decisions remains challenging, especially in complex urban scenarios. This is mainly attributed to very deep neural networks with non-linear decision boundaries, making it challenging to grasp the logic behind AI-driven decisions. This paper presents a method to enhance interpretability while optimizing control commands in autonomous driving. To address this, we propose loss functions that promote the interpretability of our model by generating sparse and localized feature maps. The feature activations allow us to explain which image regions contribute to the predicted control command. We conduct comprehensive ablation studies on the feature extraction step and validate our method on the CARLA benchmarks. We also demonstrate that our approach improves interpretability, which correlates with reducing infractions, yielding a safer, high-performance driving model. Notably, our monocular, non-ensemble model surpasses the top-performing approaches from the CARLA Leaderboard by achieving lower infraction scores and the highest route completion rate, all while ensuring interpretability.
