Do Object Detection Localization Errors Affect Human Performance and Trust?
Sven de Witte, Ombretta Strafforello, Jan van Gemert
TL;DR
This paper investigates whether bounding box localization errors affect human performance and trust when humans use automated object detections. Using an observer-performance paradigm on a visual multi-object counting task, it manipulates localization quality via $IoU$-based shifts and $F1$-score perturbations, and compares standard bounding boxes to center-dot visualizations. The authors find that localization errors in position alone do not significantly impair accuracy or trust, while errors in detection count (false positives/negatives) degrade both performance and trust; center dots can substantially improve counting accuracy and robustness to localization inaccuracies. Practically, the work suggests optimizing $F1$-score rather than IoU for human-in-the-loop systems and demonstrates simple visualization choices can meaningfully boost user performance and resilience to localization errors.
Abstract
Bounding boxes are often used to communicate automatic object detection results to humans, aiding humans in a multitude of tasks. We investigate the relationship between bounding box localization errors and human task performance. We use observer performance studies on a visual multi-object counting task to measure both human trust and performance with different levels of bounding box accuracy. The results show that localization errors have no significant impact on human accuracy or trust in the system. Recall and precision errors impact both human performance and trust, suggesting that optimizing algorithms based on the F1 score is more beneficial in human-computer tasks. Lastly, the paper offers an improvement on bounding boxes in multi-object counting tasks with center dots, showing improved performance and better resilience to localization inaccuracy.
