Blind Bipedal Stair Traversal via Sim-to-Real Reinforcement Learning
Jonah Siekmann, Kevin Green, John Warila, Alan Fern, Jonathan Hurst
TL;DR
This work demonstrates that a blind bipedal robot (Cassie) can robustly traverse stair-like terrain using sim-to-real reinforcement learning with proprioceptive feedback alone. By adding stair-like terrain randomization to an otherwise flat-ground RL framework, the authors train memory-enabled policies (LSTM) that handle unknown stairs without exteroceptive sensing. The results show strong simulation performance and substantial real-world viability, with notable insights into swing-foot behavior and ground reaction forces that underlie robust disturbance rejection. The study also reveals energy-efficiency trade-offs and confirms the practical potential of proprioception-driven stair traversal in real environments, suggesting future integration with vision for efficiency gains.
Abstract
Accurate and precise terrain estimation is a difficult problem for robot locomotion in real-world environments. Thus, it is useful to have systems that do not depend on accurate estimation to the point of fragility. In this paper, we explore the limits of such an approach by investigating the problem of traversing stair-like terrain without any external perception or terrain models on a bipedal robot. For such blind bipedal platforms, the problem appears difficult (even for humans) due to the surprise elevation changes. Our main contribution is to show that sim-to-real reinforcement learning (RL) can achieve robust locomotion over stair-like terrain on the bipedal robot Cassie using only proprioceptive feedback. Importantly, this only requires modifying an existing flat-terrain training RL framework to include stair-like terrain randomization, without any changes in reward function. To our knowledge, this is the first controller for a bipedal, human-scale robot capable of reliably traversing a variety of real-world stairs and other stair-like disturbances using only proprioception.
