Benchmarking Online Object Trackers for Underwater Robot Position Locking Applications
Ali Safa, Waqas Aman, Ali Al-Zawqari, Saif Al-Kuwari
TL;DR
This work addresses underwater vision-based ROV position locking by benchmarking seven one-shot trackers (Boosting, MIL, MedianFlow, MOSSE, TLD, KCF, CSRT) within a closed perception-control loop using a PI controller. It integrates tracker outputs with real-time control on a BlueROV2 in an indoor pool and provides extensive real-world results across disturbance scenarios, identifying CSRT as the most robust choice for underwater applications. The authors also release an open-source underwater ROV dataset to facilitate future research and benchmarking. Overall, the study demonstrates that CSRT consistently delivers the best trade-off between position accuracy and yaw stability, making it a strong candidate for practical underwater vision-based control systems.
Abstract
Autonomously controlling the position of Remotely Operated underwater Vehicles (ROVs) is of crucial importance for a wide range of underwater engineering applications, such as in the inspection and maintenance of underwater industrial structures. Consequently, studying vision-based underwater robot navigation and control has recently gained increasing attention to counter the numerous challenges faced in underwater conditions, such as lighting variability, turbidity, camera image distortions (due to bubbles), and ROV positional disturbances (due to underwater currents). In this paper, we propose (to the best of our knowledge) a first rigorous unified benchmarking of more than seven Machine Learning (ML)-based one-shot object tracking algorithms for vision-based position locking of ROV platforms. We propose a position-locking system that processes images of an object of interest in front of which the ROV must be kept stable. Then, our proposed system uses the output result of different object tracking algorithms to automatically correct the position of the ROV against external disturbances. We conducted numerous real-world experiments using a BlueROV2 platform within an indoor pool and provided clear demonstrations of the strengths and weaknesses of each tracking approach. Finally, to help alleviate the scarcity of underwater ROV data, we release our acquired data base as open-source with the hope of benefiting future research.
