ZeST: an LLM-based Zero-Shot Traversability Navigation for Unknown Environments
Shreya Gummadi, Mateus V. Gasparino, Gianluca Capezzuto, Marcelo Becker, Girish Chowdhary
TL;DR
ZeST tackles the challenge of traversability prediction in unknown environments by using multimodal LLMs to infer region-wise traversability without physical data collection. It fuses LLM-derived samples into a probabilistic map via Normal-Inverse-Gamma modeling, applies CVaR-based risk assessment, and employs traversability-aware RRT*-based planning and MPPI control. The approach yields a global traversability map that supports zero-shot navigation with improved safety and success rates, validated in real indoor and outdoor scenarios against baselines. By combining LLM reasoning, Bayesian uncertainty modeling, and risk-aware planning, ZeST offers a scalable, data-efficient solution for robust autonomous navigation in unknown terrains.
Abstract
The advancement of robotics and autonomous navigation systems hinges on the ability to accurately predict terrain traversability. Traditional methods for generating datasets to train these prediction models often involve putting robots into potentially hazardous environments, posing risks to equipment and safety. To solve this problem, we present ZeST, a novel approach leveraging visual reasoning capabilities of Large Language Models (LLMs) to create a traversability map in real-time without exposing robots to danger. Our approach not only performs zero-shot traversability and mitigates the risks associated with real-world data collection but also accelerates the development of advanced navigation systems, offering a cost-effective and scalable solution. To support our findings, we present navigation results, in both controlled indoor and unstructured outdoor environments. As shown in the experiments, our method provides safer navigation when compared to other state-of-the-art methods, constantly reaching the final goal.
