Dynamic Legged Ball Manipulation on Rugged Terrains with Hierarchical Reinforcement Learning

Dongjie Zhu; Zhuo Yang; Tianhang Wu; Luzhou Ge; Xuesong Li; Qi Liu; Xiang Li

Dynamic Legged Ball Manipulation on Rugged Terrains with Hierarchical Reinforcement Learning

Dongjie Zhu, Zhuo Yang, Tianhang Wu, Luzhou Ge, Xuesong Li, Qi Liu, Xiang Li

TL;DR

This work tackles dynamic ball manipulation by quadruped robots on rugged terrains through a hierarchical reinforcement learning framework. A high-level policy coordinates four low-level skills (two dribbling and two locomotion) via a context-aware estimator to adapt to terrain and ball state. The authors introduce Dynamic Skill-Focused Policy Optimization (DSF-PO) to address learning inefficiencies arising from mixed discrete-continuous actions, demonstrating superior learning and cross-terrain performance in simulation and zero-shot transfer to real hardware. Results show improved terrain traversability and robust cross-terrain dribbling, highlighting significant potential for autonomous legged loco-manipulation in disaster response and robot sports. The approach advances practical cross-terrain manipulation by enabling flexible skill switching and more efficient policy optimization under real-world constraints.

Abstract

Advancing the dynamic loco-manipulation capabilities of quadruped robots in complex terrains is crucial for performing diverse tasks. Specifically, dynamic ball manipulation in rugged environments presents two key challenges. The first is coordinating distinct motion modalities to integrate terrain traversal and ball control seamlessly. The second is overcoming sparse rewards in end-to-end deep reinforcement learning, which impedes efficient policy convergence. To address these challenges, we propose a hierarchical reinforcement learning framework. A high-level policy, informed by proprioceptive data and ball position, adaptively switches between pre-trained low-level skills such as ball dribbling and rough terrain navigation. We further propose Dynamic Skill-Focused Policy Optimization to suppress gradients from inactive skills and enhance critical skill learning. Both simulation and real-world experiments validate that our methods outperform baseline approaches in dynamic ball manipulation across rugged terrains, highlighting its effectiveness in challenging environments. Videos are on our website: dribble-hrl.github.io.

Dynamic Legged Ball Manipulation on Rugged Terrains with Hierarchical Reinforcement Learning

TL;DR

Abstract

Dynamic Legged Ball Manipulation on Rugged Terrains with Hierarchical Reinforcement Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)