A Bearing-Angle Approach for Unknown Target Motion Analysis Based on Visual Measurements
Zian Ning, Yin Zhang, Jianan Li, Zhang Chen, Shiyu Zhao
TL;DR
This paper tackles vision-based target motion estimation with a moving monocular camera by introducing a bearing-angle approach that uses bounding-box derived measurements. It augments the target state with its unknown size $\ell$ and recasts bearing and angle measurements into a pseudo-linear form suitable for Kalman filtering, improving observability without needing lateral observer motion. Theoretical analysis shows that the observer’s higher-order motion (e.g., nonzero acceleration) is sufficient for full observability, unlike bearing-only methods, and practical results from Matlab, AirSim, and real-world experiments confirm superior performance in diverse scenarios. The work offers a practical, computation-efficient solution for autonomous pursuit and tracking using standard vision outputs (bounding boxes) while clarifying the limits when $\ell$ varies rapidly.
Abstract
Vision-based estimation of the motion of a moving target is usually formulated as a bearing-only estimation problem where the visual measurement is modeled as a bearing vector. Although the bearing-only approach has been studied for decades, a fundamental limitation of this approach is that it requires extra lateral motion of the observer to enhance the target's observability. Unfortunately, the extra lateral motion conflicts with the desired motion of the observer in many tasks. It is well-known that, once a target has been detected in an image, a bounding box that surrounds the target can be obtained. Surprisingly, this common visual measurement especially its size information has not been well explored up to now. In this paper, we propose a new bearing-angle approach to estimate the motion of a target by modeling its image bounding box as bearing-angle measurements. Both theoretical analysis and experimental results show that this approach can significantly enhance the observability without relying on additional lateral motion of the observer. The benefit of the bearing-angle approach comes with no additional cost because a bounding box is a standard output of object detection algorithms. The approach simply exploits the information that has not been fully exploited in the past. No additional sensing devices or special detection algorithms are required.
