STATE-NAV: Stability-Aware Traversability Estimation for Bipedal Navigation on Rough Terrain
Ziwon Yoon, Lawrence Y. Zhu, Jingxi Lu, Lu Gan, Ye Zhao
TL;DR
This work tackles the stability risk inherent in bipedal navigation on rough terrain by learning a stability predictor (TravFormer) that outputs a BSFA-based instability with uncertainty. Traversability is reframed as the maximum stability-safe command velocity, enabling risk-aware planning within a hierarchical TravRRT* global planner and MPC local controller. The approach unifies learning-based instability estimation with stability-constrained planning, demonstrating improved robustness and time efficiency in MuJoCo simulations and real-world Digit experiments across diverse terrains. The results show superior traversal safety and performance without extensive hyperparameter tuning, highlighting the practical potential for stable humanoid navigation in unstructured environments.
Abstract
Bipedal robots have advantages in maneuvering human-centered environments, but face greater failure risk compared to other stable mobile platforms such as wheeled or quadrupedal robots. While learning-based traversability has been widely studied for these platforms, bipedal traversability has instead relied on manually designed rules with limited consideration of locomotion stability on rough terrain. In this work, we present the first learning-based traversability estimation and risk-sensitive navigation framework for bipedal robots operating in diverse, uneven environments. TravFormer, a transformer-based neural network, is trained to predict bipedal instability with uncertainty, enabling risk-aware and adaptive planning. Based on the network, we define traversability as stability-aware command velocity-the fastest command velocity that keeps instability below a user-defined limit. This velocity-based traversability is integrated into a hierarchical planner that combines traversability-informed Rapid Random Tree Star (TravRRT*) for time-efficient planning and Model Predictive Control (MPC) for safe execution. We validate our method in MuJoCo simulation and the real world, demonstrating improved navigation performance, with enhanced robustness and time efficiency across varying terrains compared to existing methods.
