A Flow-based Credibility Metric for Safety-critical Pedestrian Detection
Maria Lyssenko, Christoph Gladisch, Christian Heinzemann, Matthias Woehrle, Rudolph Triebel
TL;DR
The paper tackles safety challenges in automated driving perception by addressing safety agnostic evaluation and introducing c-flow, a flow based credibility metric for pedestrian bounding boxes that leverages optical flow from image sequences. c-flow combines motion cues with bounding box dynamics and uses a sliding window to produce a score that approximates 1 for true positives and 0 for false negatives, with an unsupervised extension using hypothesized bounding boxes. The approach is validated on a large AD dataset, using RetinaNet pretrained on nuImages and optical flow from RAFT, demonstrating strong discrimination between TP and FN and strong correlation to ground truth in an unsupervised setting (rho around 0.83) with approximately 97% FN agreement. The work provides a practical runtime observer for safety critical misdetections and offers dataset insights for safety auditing and potential active learning, with future work on refining hypothesized bounding boxes and integrating into runtime pipelines.
Abstract
Safety is of utmost importance for perception in automated driving (AD). However, a prime safety concern in state-of-the art object detection is that standard evaluation schemes utilize safety-agnostic metrics to argue sufficient detection performance. Hence, it is imperative to leverage supplementary domain knowledge to accentuate safety-critical misdetections during evaluation tasks. To tackle the underspecification, this paper introduces a novel credibility metric, called c-flow, for pedestrian bounding boxes. To this end, c-flow relies on a complementary optical flow signal from image sequences and enhances the analyses of safety-critical misdetections without requiring additional labels. We implement and evaluate c-flow with a state-of-the-art pedestrian detector on a large AD dataset. Our analysis demonstrates that c-flow allows developers to identify safety-critical misdetections.
