ForceMimic: Force-Centric Imitation Learning with Force-Motion Capture System for Contact-Rich Manipulation
Wenhai Liu, Junbo Wang, Yiming Wang, Weiming Wang, Cewu Lu
TL;DR
ForceMimic addresses the underutilization of force cues in imitation learning for contact-rich manipulation by integrating ForceCapture, a robot-free data-collection system, with HybridIL, a diffusion-policy-based learner that predicts wrench and pose and uses a hybrid force-position controller. The approach yields a substantial improvement in zucchini peeling performance and data-collection efficiency versus traditional vision-based imitation and teleoperation. Key contributions include ForceCapture hardware design and a force-centric imitation learning algorithm that switches between IK-based and force-position primitives based on contact forces. The results suggest force-centric IL is feasible and beneficial for robust manipulation, and point to future work on multimodal representations and expanded force-control primitives.
Abstract
In most contact-rich manipulation tasks, humans apply time-varying forces to the target object, compensating for inaccuracies in the vision-guided hand trajectory. However, current robot learning algorithms primarily focus on trajectory-based policy, with limited attention given to learning force-related skills. To address this limitation, we introduce ForceMimic, a force-centric robot learning system, providing a natural, force-aware and robot-free robotic demonstration collection system, along with a hybrid force-motion imitation learning algorithm for robust contact-rich manipulation. Using the proposed ForceCapture system, an operator can peel a zucchini in 5 minutes, while force-feedback teleoperation takes over 13 minutes and struggles with task completion. With the collected data, we propose HybridIL to train a force-centric imitation learning model, equipped with hybrid force-position control primitive to fit the predicted wrench-position parameters during robot execution. Experiments demonstrate that our approach enables the model to learn a more robust policy under the contact-rich task of vegetable peeling, increasing the success rates by 54.5% relatively compared to state-ofthe-art pure-vision-based imitation learning. Hardware, code, data and more results can be found on the project website at https://forcemimic.github.io.
