Approximate Supervised Object Distance Estimation on Unmanned Surface Vehicles
Benjamin Kiefer, Yitong Quan, Andreas Zell
TL;DR
The paper tackles the need for cost-effective distance estimation on USVs by leveraging supervised object detection to predict object distances directly from monocular imagery. It adapts YOLO detectors with an auxiliary distance head, explores multiple distance normalization schemes, and trains with a composite loss, using a maritime dataset (1000 images with bounding boxes and chart-derived distances) plus additional human-labeled data. Through extensive experiments, the method achieves competitive object detection performance and shows that distance estimation can reach real-time accuracy, outperforming triangulation and monocular depth baselines in mean distance error, particularly when combined with tracking and smoothing. The work provides a practical, end-to-end vision-based distance estimation approach for USVs, with publicly released data and clear guidance on trade-offs between distance accuracy and detection performance across varying camera setups and distances.
Abstract
Unmanned surface vehicles (USVs) and boats are increasingly important in maritime operations, yet their deployment is limited due to costly sensors and complexity. LiDAR, radar, and depth cameras are either costly, yield sparse point clouds or are noisy, and require extensive calibration. Here, we introduce a novel approach for approximate distance estimation in USVs using supervised object detection. We collected a dataset comprising images with manually annotated bounding boxes and corresponding distance measurements. Leveraging this data, we propose a specialized branch of an object detection model, not only to detect objects but also to predict their distances from the USV. This method offers a cost-efficient and intuitive alternative to conventional distance measurement techniques, aligning more closely with human estimation capabilities. We demonstrate its application in a marine assistance system that alerts operators to nearby objects such as boats, buoys, or other waterborne hazards.
