Thor: Towards Human-Level Whole-Body Reactions for Intense Contact-Rich Environments

Gangyang Li; Qing Shi; Youhao Hu; Jincheng Hu; Zhongyuan Wang; Xinlong Wang; Shaqi Luo

Thor: Towards Human-Level Whole-Body Reactions for Intense Contact-Rich Environments

Gangyang Li, Qing Shi, Youhao Hu, Jincheng Hu, Zhongyuan Wang, Xinlong Wang, Shaqi Luo

TL;DR

Thor tackles the challenge of enabling humanoids to perform high-intensity force interactions by decoupling the whole-body control into upper, waist, and lower policies, guided by a FAT2 reward anchored in ZMP-based force equilibria. It employs a three-agent PPO-based RL framework with privileged critic information, a two-stage curriculum, and domain randomization to bridge the sim-to-real gap, validated on the Unitree G1 where it substantially outperforms baselines in pulling tasks and door-opening scenarios. The combination of FAT2 and a decoupled architecture addresses both force amplification and high-dimensionality, enabling real-time inference on onboard resources. This work advances humanoid robustness in contact-rich tasks and provides a practical path toward human-level whole-body reactions in unstructured environments.

Abstract

Humanoids hold great potential for service, industrial, and rescue applications, in which robots must sustain whole-body stability while performing intense, contact-rich interactions with the environment. However, enabling humanoids to generate human-like, adaptive responses under such conditions remains a major challenge. To address this, we propose Thor, a humanoid framework for human-level whole-body reactions in contact-rich environments. Based on the robot's force analysis, we design a force-adaptive torso-tilt (FAT2) reward function to encourage humanoids to exhibit human-like responses during force-interaction tasks. To mitigate the high-dimensional challenges of humanoid control, Thor introduces a reinforcement learning architecture that decouples the upper body, waist, and lower body. Each component shares global observations of the whole body and jointly updates its parameters. Finally, we deploy Thor on the Unitree G1, and it substantially outperforms baselines in force-interaction tasks. Specifically, the robot achieves a peak pulling force of 167.7 N (approximately 48% of the G1's body weight) when moving backward and 145.5 N when moving forward, representing improvements of 68.9% and 74.7%, respectively, compared with the best-performing baseline. Moreover, Thor is capable of pulling a loaded rack (130 N) and opening a fire door with one hand (60 N). These results highlight Thor's effectiveness in enhancing humanoid force-interaction capabilities.

Thor: Towards Human-Level Whole-Body Reactions for Intense Contact-Rich Environments

TL;DR

Abstract

Thor: Towards Human-Level Whole-Body Reactions for Intense Contact-Rich Environments

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)