Hand-Object Interaction Controller (HOIC): Deep Reinforcement Learning for Reconstructing Interactions with Physics
Haoyu Hu, Xinyu Yi, Zhe Cao, Jun-Hai Yong, Feng Xu
TL;DR
HOIC tackles the challenge of reconstructing physically plausible hand-object interactions from limited single-view RGBD data in real time. It combines object compensation control with a surface contact model within a PPO-based imitation learning framework, employing mimic and physics rewards to jointly guide hand motion and object dynamics. The approach achieves comparable tracking accuracy to a vision-based kinematic baseline while substantially improving physical plausibility, reducing penetration, and smoothing interaction motion across three objects in a real-time pipeline. This work advances physics-aware HOI reconstruction and lays groundwork for more realistic human–robot interaction under constrained sensing conditions.
Abstract
Hand manipulating objects is an important interaction motion in our daily activities. We faithfully reconstruct this motion with a single RGBD camera by a novel deep reinforcement learning method to leverage physics. Firstly, we propose object compensation control which establishes direct object control to make the network training more stable. Meanwhile, by leveraging the compensation force and torque, we seamlessly upgrade the simple point contact model to a more physical-plausible surface contact model, further improving the reconstruction accuracy and physical correctness. Experiments indicate that without involving any heuristic physical rules, this work still successfully involves physics in the reconstruction of hand-object interactions which are complex motions hard to imitate with deep reinforcement learning. Our code and data are available at https://github.com/hu-hy17/HOIC.
