A2VISR: An Active and Adaptive Ground-Aerial Localization System Using Visual Inertial and Single-Range Fusion
Sijia Chen, Wei Dong
TL;DR
<3-5 sentence high-level summary> The paper tackles robust ground-aerial localization for flying robots in cluttered and degraded environments. It introduces A2VISR, a framework that fuses active ground-based vision with single-range, visual-inertial, and optical-flow data, complemented by an adaptive sliding-confidence mechanism and a dimension-reduced sliding-window estimator. The active vision subsystem expands the observation field and improves re-capture reliability, while the adaptive weighting handles sensor degradation in real time. Experimental results show an average RMSE around 0.09 m across diverse scenarios and strong resilience to visual loss and long-range operation, underscoring practical viability for autonomous aerial localization with minimal infrastructure.
Abstract
It's a practical approach using the ground-aerial collaborative system to enhance the localization robustness of flying robots in cluttered environments, especially when visual sensors degrade. Conventional approaches estimate the flying robot's position using fixed cameras observing pre-attached markers, which could be constrained by limited distance and susceptible to capture failure. To address this issue, we improve the ground-aerial localization framework in a more comprehensive manner, which integrates active vision, single-ranging, inertial odometry, and optical flow. First, the designed active vision subsystem mounted on the ground vehicle can be dynamically rotated to detect and track infrared markers on the aerial robot, improving the field of view and the target recognition with a single camera. Meanwhile, the incorporation of single-ranging extends the feasible distance and enhances re-capture capability under visual degradation. During estimation, a dimension-reduced estimator fuses multi-source measurements based on polynomial approximation with an extended sliding window, balancing computational efficiency and redundancy. Considering different sensor fidelities, an adaptive sliding confidence evaluation algorithm is implemented to assess measurement quality and dynamically adjust the weighting parameters based on moving variance. Finally, extensive experiments under conditions such as smoke interference, illumination variation, obstacle occlusion, prolonged visual loss, and extended operating range demonstrate that the proposed approach achieves robust online localization, with an average root mean square error of approximately 0.09 m, while maintaining resilience to capture loss and sensor failures.
