Incorporating System-level Safety Requirements in Perception Models via Reinforcement Learning
Weisi Fan, Jesse Lane, Qisai Liu, Soumik Sarkar, Tichakorn Wongpiromsarn
TL;DR
This work addresses the gap where standard perception losses neglect downstream safety in autonomous systems. It proposes a reinforcement learning framework that injects system-level safety objectives by translating rulebook specifications into safety scores used as rewards to fine-tune probabilistic perception models. A hybrid reward combines IoU-based perception performance with rule-based safety violations, enabling training in non-differentiable simulations like CARLA and integrating depth with pixel outputs from a PIX2SEQ detector. Experimental results show improved alignment between perception and safety objectives, better focus on prioritized, safety-relevant objects, and robust performance under varied weather conditions, demonstrating the practical potential of safety-aware perception training.
Abstract
Perception components in autonomous systems are often developed and optimized independently of downstream decision-making and control components, relying on established performance metrics like accuracy, precision, and recall. Traditional loss functions, such as cross-entropy loss and negative log-likelihood, focus on reducing misclassification errors but fail to consider their impact on system-level safety, overlooking the varying severities of system-level failures caused by these errors. To address this limitation, we propose a novel training paradigm that augments the perception component with an understanding of system-level safety objectives. Central to our approach is the translation of system-level safety requirements, formally specified using the rulebook formalism, into safety scores. These scores are then incorporated into the reward function of a reinforcement learning framework for fine-tuning perception models with system-level safety objectives. Simulation results demonstrate that models trained with this approach outperform baseline perception models in terms of system-level safety.
