AINav: Large Language Model-Based Adaptive Interactive Navigation

Kangjie Zhou; Yao Mu; Haoyang Song; Yi Zeng; Pengying Wu; Han Gao; Chang Liu

AINav: Large Language Model-Based Adaptive Interactive Navigation

Kangjie Zhou, Yao Mu, Haoyang Song, Yi Zeng, Pengying Wu, Han Gao, Chang Liu

TL;DR

AINav tackles the challenge of navigating in unknown, interactive environments where no feasible path exists a priori. It combines an LLM-driven primitive skill tree for robust task planning, a reinforcement-learning-based skill library for robust motion and interaction, and an adaptive replanning mechanism with an advisor and arborist to rapidly adjust plans as new observations arrive. The approach enables proactive interaction with environmental objects to create viable routes, demonstrated through extensive simulations and real-world tests where the quadruped robot uses tools and adjusts plans in response to incremental information. The results show superior success rates and shorter completion times compared with baselines, highlighting the practical potential of integrating foundation-model reasoning with learned skills for long-horizon, tool-using robotic navigation in cluttered settings.

Abstract

Robotic navigation in complex environments remains a critical research challenge. Traditional navigation methods focus on optimal trajectory generation within fixed free workspace, therefore struggling in environments lacking viable paths to the goal, such as disaster zones or cluttered warehouses. To address this problem, we propose AINav, an adaptive interactive navigation approach that proactively interacts with environments to create feasible paths to achieve originally unreachable goals. Specifically, we present a primitive skill tree for task planning with large language models (LLMs), facilitating effective reasoning to determine interaction objects and sequences. To ensure robust subtask execution, we adopt reinforcement learning to pre-train a comprehensive skill library containing versatile locomotion and interaction behaviors for motion planning. Furthermore, we introduce an adaptive replanning approach featuring two LLM-based modules: an advisor serving as a flexible replanning trigger and an arborist for autonomous plan adjustment. Integrated with the tree structure, the replanning mechanism allows for convenient node addition and pruning, enabling rapid plan adaptation in a priori unknown environments. Comprehensive simulations and experiments have demonstrated AINav's effectiveness and adaptivity in diverse scenarios. The supplementary video is available at: https://youtu.be/CjXm5KFx9AI.

AINav: Large Language Model-Based Adaptive Interactive Navigation

TL;DR

Abstract

AINav: Large Language Model-Based Adaptive Interactive Navigation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)