Obstacle-Aware Quadrupedal Locomotion With Resilient Multi-Modal Reinforcement Learning
I Made Aswin Nahrendra, Byeongho Yu, Minho Oh, Dongkyu Lee, Seunghyun Lee, Hyeonwoo Lee, Hyungtae Lim, Hyun Myung
TL;DR
The paper tackles robust quadrupedal locomotion in cluttered real-world environments by fusing proprioception with exteroception through a resilient multi-modal reinforcement learning framework. It introduces DreamWaQ++ with a hierarchical exteroceptive memory, a PointNet-based exteroceptive encoder, and a multi-modal mixer that conditions a PPO-based policy, augmented by estimation, VAE, contrastive losses, and a versatility objective to promote diverse skills. Across stairs, slopes, and deformable terrains, the approach delivers superior stair-climbing performance, emergent probing behaviors, and rapid OOD adaptation, all while remaining sensor-agnostic and capable of sim-to-real transfer thanks to domain randomization. The work provides interpretable latent factors that modulate gait and demonstrates potential for integration with higher-level planners and active sensing, advancing practical, autonomous legged locomotion in uncertain environments.
Abstract
Quadrupedal robots hold promising potential for applications in navigating cluttered environments with resilience akin to their animal counterparts. However, their floating base configuration makes them vulnerable to real-world uncertainties, yielding substantial challenges in their locomotion control. Deep reinforcement learning has become one of the plausible alternatives for realizing a robust locomotion controller. However, the approaches that rely solely on proprioception sacrifice collision-free locomotion because they require front-feet contact to detect the presence of stairs to adapt the locomotion gait. Meanwhile, incorporating exteroception necessitates a precisely modeled map observed by exteroceptive sensors over a period of time. Therefore, this work proposes a novel method to fuse proprioception and exteroception featuring a resilient multi-modal reinforcement learning. The proposed method yields a controller that showcases agile locomotion performance on a quadrupedal robot over a myriad of real-world courses, including rough terrains, steep slopes, and high-rise stairs, while retaining its robustness against out-of-distribution situations.
