ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting
Dexter Ong, Yuezhan Tao, Varun Murali, Igor Spasojevic, Vijay Kumar, Pratik Chaudhari
TL;DR
The paper tackles task-directed navigation in unknown, unstructured environments by introducing a hierarchical, language-embedded Gaussian splatting map that jointly yields sparse semantic planning and dense geometric representation for collision-free navigation. It couples bottom-up mapping with language embeddings to create a memory-efficient, submap-based structure and a top-down two-stage planner: a discrete planner selects high-utility vantage points, while a continuous planner generates dynamically feasible trajectories under collision constraints. Task specifications and completions are driven by natural-language prompts and vision-language models, enabling open-set, re-specifiable objectives and termination checks. Real-world indoor and outdoor experiments demonstrate large-scale map construction with over a million Gaussians, competitive performance against privileged baselines, and robust open-vocabulary semantic retrieval, highlighting practical applicability for scalable, language-guided navigation.
Abstract
We address the challenge of task-oriented navigation in unstructured and unknown environments, where robots must incrementally build and reason on rich, metric-semantic maps in real time. Since tasks may require clarification or re-specification, it is necessary for the information in the map to be rich enough to enable generalization across a wide range of tasks. To effectively execute tasks specified in natural language, we propose a hierarchical representation built on language-embedded Gaussian splatting that enables both sparse semantic planning that lends itself to online operation and dense geometric representation for collision-free navigation. We validate the effectiveness of our method through real-world robot experiments conducted in both cluttered indoor and kilometer-scale outdoor environments, with a competitive ratio of about 60% against privileged baselines. Experiment videos and more details can be found on our project page: https://atlasnav.github.io
