HALO: High-Altitude Language-Conditioned Monocular Aerial Exploration and Navigation

Yuezhan Tao; Dexter Ong; Fernando Cladera; Jason Hughes; Camillo J. Taylor; Pratik Chaudhari; Vijay Kumar

HALO: High-Altitude Language-Conditioned Monocular Aerial Exploration and Navigation

Yuezhan Tao, Dexter Ong, Fernando Cladera, Jason Hughes, Camillo J. Taylor, Pratik Chaudhari, Vijay Kumar

TL;DR

HALO tackles the challenge of high-altitude, monocular mapping and language-driven exploration by integrating a real-time feed-forward 3D reconstruction backbone with dense language-embedded semantics and a hierarchical planner. The approach maintains open-set metric-semantic maps on-board, fusing GPS priors for scale, performing loop closures, and using frontier-based global planning with a local ATSP-driven exploration strategy. Simulation and real-world experiments show HALO achieves faster task completion and improved competitive ratios compared to baselines, including significant gains in semantic-driven navigation at altitudes around 40 m. Overall, HALO enables flexible, task-driven autonomous missions in large outdoor environments using a lightweight, onboard monocular system.

Abstract

We demonstrate real-time high-altitude aerial metric-semantic mapping and exploration using a monocular camera paired with a global positioning system (GPS) and an inertial measurement unit (IMU). Our system, named HALO, addresses two key challenges: (i) real-time dense 3D reconstruction using vision at large distances, and (ii) mapping and exploration of large-scale outdoor environments with accurate scene geometry and semantics. We demonstrate that HALO can plan informative paths that exploit this information to complete missions with multiple tasks specified in natural language. In simulation-based evaluation across large-scale environments of size up to 78,000 sq. m., HALO consistently completes tasks with less exploration time and achieves up to 68% higher competitive ratio in terms of the distance traveled compared to the state-of-the-art semantic exploration baseline. We use real-world experiments on a custom quadrotor platform to demonstrate that (i) all modules can run onboard the robot, and that (ii) in diverse environments HALO can support effective autonomous execution of missions covering up to 24,600 sq. m. area at an altitude of 40 m. Experiment videos and more details can be found on our project page: https://tyuezhan.github.io/halo/.

HALO: High-Altitude Language-Conditioned Monocular Aerial Exploration and Navigation

TL;DR

Abstract

HALO: High-Altitude Language-Conditioned Monocular Aerial Exploration and Navigation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)