Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective
Tim Bader, Leon Eisemann, Adrian Pogorzelski, Namrata Jangid, Attila-Balazs Kis
TL;DR
This work addresses the need for interpretable, safety-oriented evaluation of monocular depth estimation in automotive settings by introducing a novel Class-Aware Multi-Component Depth Estimation metric. The metric combines a class-based component with intra- and inter-class weighting, a local feature component focused on edges and corners, and a global consistency component, aggregating to $L = \gamma(E_{\text{class}} + E_{\text{feature}} + E_{\text{global}})$ with $\gamma = 1$, where $E_{\text{class}}$, $E_{\text{feature}}$, and $E_{\text{global}}$ are defined via MAE-based formulations. Through evaluation on the GOOSE dataset and a wide set of SOTA models, the approach yields per-class insights (e.g., safety-critical traffic signs) and identifies challenging scenarios beyond what classical metrics reveal, enabling more informed model selection for safety-critical deployments. The work demonstrates the practical utility of its metric for safety assessments and proposes concrete avenues for future enhancements, including improved labeling, distribution awareness, and cross-dataset applicability.
Abstract
The increasing accuracy reports of metric monocular depth estimation models lead to a growing interest from the automotive domain. Current model evaluations do not provide deeper insights into the models' performance, also in relation to safety-critical or unseen classes. Within this paper, we present a novel approach for the evaluation of depth estimation models. Our proposed metric leverages three components, a class-wise component, an edge and corner image feature component, and a global consistency retaining component. Classes are further weighted on their distance in the scene and on criticality for automotive applications. In the evaluation, we present the benefits of our metric through comparison to classical metrics, class-wise analytics, and the retrieval of critical situations. The results show that our metric provides deeper insights into model results while fulfilling safety-critical requirements. We release the code and weights on the following repository: https://github.com/leisemann/ca_mmde
