LGR: LLM-Guided Ranking of Frontiers for Object Goal Navigation
Mitsuaki Uno, Kanji Tanaka, Daiki Iwata, Yudai Noda, Shoya Miyazaki, Kouki Terashima
TL;DR
This work tackles mapless Object Goal Navigation by reframing frontier visit ordering as an LLM-driven ranking problem. The LGR framework uses a view-to-prompt strategy and Reciprocal Rank Features to fuse multi-view observations, enabling zero-shot reasoning about frontier priority within a single view and across updates. A modular system combines Mask R-CNN-based perception, occupancy-grid planning, and A* navigation, guided by LLM-derived subgoals and frontier rankings. Evaluations in Habitat-Sim with HM3D demonstrate that LGR consistently improves exploration efficiency (SPL) over random frontier baselines, highlighting the practical impact of integrating commonsense reasoning into robot exploration. Overall, LGR offers a flexible, generalizable approach to frontier selection in unknown environments, reducing unnecessary travel while maintaining robust object-goal search performance.
Abstract
Object Goal Navigation (OGN) is a fundamental task for robots and AI, with key applications such as mobile robot image databases (MRID). In particular, mapless OGN is essential in scenarios involving unknown or dynamic environments. This study aims to enhance recent modular mapless OGN systems by leveraging the commonsense reasoning capabilities of large language models (LLMs). Specifically, we address the challenge of determining the visiting order in frontier-based exploration by framing it as a frontier ranking problem. Our approach is grounded in recent findings that, while LLMs cannot determine the absolute value of a frontier, they excel at evaluating the relative value between multiple frontiers viewed within a single image using the view image as context. We dynamically manage the frontier list by adding and removing elements, using an LLM as a ranking model. The ranking results are represented as reciprocal rank vectors, which are ideal for multi-view, multi-query information fusion. We validate the effectiveness of our method through evaluations in Habitat-Sim.
