Table of Contents
Fetching ...

HWC-Loco: A Hierarchical Whole-Body Control Approach to Robust Humanoid Locomotion

Sixu Lin, Guanren Qiao, Yunxin Tai, Ang Li, Kui Jia, Guiliang Liu

TL;DR

HWC-Loco, a robust whole-body control algorithm tailored for humanoid locomotion tasks, is proposed, which can dynamically resolve the trade-off between goal-tracking and safety recovery, guided by human behavior norms and dynamic constraints.

Abstract

Humanoid robots, capable of assuming human roles in various workplaces, have become essential to embodied intelligence. However, as robots with complex physical structures, learning a control model that can operate robustly across diverse environments remains inherently challenging, particularly under the discrepancies between training and deployment environments. In this study, we propose HWC-Loco, a robust whole-body control algorithm tailored for humanoid locomotion tasks. By reformulating policy learning as a robust optimization problem, HWC-Loco explicitly learns to recover from safety-critical scenarios. While prioritizing safety guarantees, overly conservative behavior can compromise the robot's ability to complete the given tasks. To tackle this challenge, HWC-Loco leverages a hierarchical policy for robust control. This policy can dynamically resolve the trade-off between goal-tracking and safety recovery, guided by human behavior norms and dynamic constraints. To evaluate the performance of HWC-Loco, we conduct extensive comparisons against state-of-the-art humanoid control models, demonstrating HWC-Loco's superior performance across diverse terrains, robot structures, and locomotion tasks under both simulated and real-world environments.

HWC-Loco: A Hierarchical Whole-Body Control Approach to Robust Humanoid Locomotion

TL;DR

HWC-Loco, a robust whole-body control algorithm tailored for humanoid locomotion tasks, is proposed, which can dynamically resolve the trade-off between goal-tracking and safety recovery, guided by human behavior norms and dynamic constraints.

Abstract

Humanoid robots, capable of assuming human roles in various workplaces, have become essential to embodied intelligence. However, as robots with complex physical structures, learning a control model that can operate robustly across diverse environments remains inherently challenging, particularly under the discrepancies between training and deployment environments. In this study, we propose HWC-Loco, a robust whole-body control algorithm tailored for humanoid locomotion tasks. By reformulating policy learning as a robust optimization problem, HWC-Loco explicitly learns to recover from safety-critical scenarios. While prioritizing safety guarantees, overly conservative behavior can compromise the robot's ability to complete the given tasks. To tackle this challenge, HWC-Loco leverages a hierarchical policy for robust control. This policy can dynamically resolve the trade-off between goal-tracking and safety recovery, guided by human behavior norms and dynamic constraints. To evaluate the performance of HWC-Loco, we conduct extensive comparisons against state-of-the-art humanoid control models, demonstrating HWC-Loco's superior performance across diverse terrains, robot structures, and locomotion tasks under both simulated and real-world environments.

Paper Structure

This paper contains 28 sections, 14 equations, 19 figures, 16 tables, 1 algorithm.

Figures (19)

  • Figure 1: An example of recovering from a Hard Kick: The humanoid robot withstands external disturbance by automatically detecting hazardous states and adjusting its motion to regain stability.
  • Figure 2: Overview of HWC-Loco: The framework consists of two stages: (a) Training goal-tracking policy to effectively enable human-like locomotion across diverse terrains (Section \ref{['subsec:low-level-Goal-policy']}) and safety recovery policy to recover from safety-ciritical states (i.e., extreme-case) (Section \ref{['subsec:low-level-Recovery-policy']}). (b) Training the high-level planning policy to select between the two pre-trained low-level policies (Section \ref{['subsec:high-level-policy']}), thereby ensuring locomotion stability and consistency.
  • Figure 3: Proportion of goal-tracking policy.
  • Figure 4: Human-like behavior.
  • Figure 5: Visualization of different terrains. From left to right, the terrains are flats, obstacles, slopes, and stairs
  • ...and 14 more figures

Theorems & Definitions (1)

  • Definition 3.1