A Safety-Adapted Loss for Pedestrian Detection in Automated Driving
Maria Lyssenko, Piyush Pimplikar, Maarten Bieshaar, Farzad Nozarian, Rudolph Triebel
TL;DR
The paper tackles the problem of safety-critical misses in pedestrian detection for automated driving by introducing a safety-adapted focal loss that uses per-pedestrian criticality derived from motion-domain reachability ($TTC_{RSB}$) and distance to the vehicle. It combines collision risk with distance into a single criticality weight $\kappa_i$ and modulates the focal loss exponent as $\gamma-\kappa_i$, ensuring重点 attention to hazardous pedestrians while preserving overall performance. Evaluations on nuScenes with RetinaNet and FCOS show meaningful recall improvements in safety-critical zones (approximately $2.5$–$3.2$ percentage points) with negligible degradation in general metrics like AP$^{50}$, demonstrating practical gains for AD safety. The work also includes ablations revealing the relative importance of TTC-based versus distance-based criticality and discusses avenues to address false positives and extend the framework to regression tasks.
Abstract
In safety-critical domains like automated driving (AD), errors by the object detector may endanger pedestrians and other vulnerable road users (VRU). As common evaluation metrics are not an adequate safety indicator, recent works employ approaches to identify safety-critical VRU and back-annotate the risk to the object detector. However, those approaches do not consider the safety factor in the deep neural network (DNN) training process. Thus, state-of-the-art DNN penalizes all misdetections equally irrespective of their criticality. Subsequently, to mitigate the occurrence of critical failure cases, i.e., false negatives, a safety-aware training strategy might be required to enhance the detection performance for critical pedestrians. In this paper, we propose a novel safety-aware loss variation that leverages the estimated per-pedestrian criticality scores during training. We exploit the reachability set-based time-to-collision (TTC-RSB) metric from the motion domain along with distance information to account for the worst-case threat quantifying the criticality. Our evaluation results using RetinaNet and FCOS on the nuScenes dataset demonstrate that training the models with our safety-aware loss function mitigates the misdetection of critical pedestrians without sacrificing performance for the general case, i.e., pedestrians outside the safety-critical zone.
