GentleHumanoid: Learning Upper-body Compliance for Contact-rich Human and Object Interaction
Qingzhou Lu, Yao Feng, Baiyu Shi, Michael Piseno, Zhenan Bao, C. Karen Liu
TL;DR
GentleHumanoid tackles the challenge of safe, compliant physical interaction for humanoids by embedding impedance control into a whole-body motion-tracking policy. It introduces a unified spring-based interaction model that handles both resistive and guiding contacts, with adaptive force thresholds to enforce safety, and trains the policy via a PPO-based teacher–student framework using diverse human-motion datasets. Across simulation and real-world tests on the Unitree G1, it achieves lower peak contact forces and smoother, more natural interactions in tasks like hugging, sit-to-stand, and fragile-object handling, outperforming baselines. The work advances practical human–robot collaboration by enabling coordinated upper-body compliance, vision-assisted personalization, and sim-to-real transfer, paving the way for safer assistive and interactive humanoids in real environments.
Abstract
Humanoid robots are expected to operate in human-centered environments where safe and natural physical interaction is essential. However, most recent reinforcement learning (RL) policies emphasize rigid tracking and suppress external forces. Existing impedance-augmented approaches are typically restricted to base or end-effector control and focus on resisting extreme forces rather than enabling compliance. We introduce GentleHumanoid, a framework that integrates impedance control into a whole-body motion tracking policy to achieve upper-body compliance. At its core is a unified spring-based formulation that models both resistive contacts (restoring forces when pressing against surfaces) and guiding contacts (pushes or pulls sampled from human motion data). This formulation ensures kinematically consistent forces across the shoulder, elbow, and wrist, while exposing the policy to diverse interaction scenarios. Safety is further supported through task-adjustable force thresholds. We evaluate our approach in both simulation and on the Unitree G1 humanoid across tasks requiring different levels of compliance, including gentle hugging, sit-to-stand assistance, and safe object manipulation. Compared to baselines, our policy consistently reduces peak contact forces while maintaining task success, resulting in smoother and more natural interactions. These results highlight a step toward humanoid robots that can safely and effectively collaborate with humans and handle objects in real-world environments.
