Learning Terrain-Specialized Policies for Adaptive Locomotion in Challenging Environments
Matheus P. Angarola, Francisco Affonso, Marcelo Becker
TL;DR
The paper tackles blind legged locomotion across unstructured terrains by introducing a hierarchical RL framework that employs terrain-specialized policies and curriculum learning. A privileged terrain observer enables a policy selector to route control to the most appropriate expert, while a terrain generator and curriculum expand agility across diverse velocity commands. Experimental results in high-fidelity simulation show significant gains in success rate and velocity-tracking accuracy, especially on low-friction or discontinuous terrains, demonstrating improved robustness over a generalist policy. The work advances practical adaptive locomotion by decomposing the problem into terrain-specific subtasks and providing a systematic training curriculum, with future work focused on removing privileged cues and enabling sim-to-real transfer.
Abstract
Legged robots must exhibit robust and agile locomotion across diverse, unstructured terrains, a challenge exacerbated under blind locomotion settings where terrain information is unavailable. This work introduces a hierarchical reinforcement learning framework that leverages terrain-specialized policies and curriculum learning to enhance agility and tracking performance in complex environments. We validated our method on simulation, where our approach outperforms a generalist policy by up to 16% in success rate and achieves lower tracking errors as the velocity target increases, particularly on low-friction and discontinuous terrains, demonstrating superior adaptability and robustness across mixed-terrain scenarios.
