Enhancing Safety for Autonomous Agents in Partly Concealed Urban Traffic Environments Through Representation-Based Shielding
Pierre Haritz, David Wanke, Thomas Liebig
TL;DR
The paper addresses safe navigation through partly concealed urban intersections using reinforcement learning. It couples an ego-centric invariant state representation (IER+) with a post-posed safety shield for Deep Q-Learning, leveraging time-to-occupancy $\mathrm{TTO}$ and time-to-vacancy $\mathrm{TTV}$ features and LTL-based safety constraints. The approach yields notable safety gains and robust generalization to unseen road maps while maintaining competitive travel speed and energy efficiency, demonstrated in a dedicated PyAD-RL simulator. This work provides a scalable, reproducible framework for safer autonomous urban navigation under partial observability and occlusions.
Abstract
Navigating unsignalized intersections in urban environments poses a complex challenge for self-driving vehicles, where issues such as view obstructions, unpredictable pedestrian crossings, and diverse traffic participants demand a great focus on crash prevention. In this paper, we propose a novel state representation for Reinforcement Learning (RL) agents centered around the information perceivable by an autonomous agent, enabling the safe navigation of previously uncharted road maps. Our approach surpasses several baseline models by a sig nificant margin in terms of safety and energy consumption metrics. These improvements are achieved while maintaining a competitive average travel speed. Our findings pave the way for more robust and reliable autonomous navigation strategies, promising safer and more efficient urban traffic environments.
