Twisting Lids Off with Two Hands
Toru Lin, Zhao-Heng Yin, Haozhi Qi, Pieter Abbeel, Jitendra Malik
TL;DR
The paper tackles the challenge of dexterous bimanual manipulation by training a two-handed lid-twisting policy in simulation and transferring it zero-shot to real robots. It introduces a fast brake-based friction model, a minimal sparse perception pipeline, and a keypoint-based contact reward, combined with domain randomization and asymmetric PPO, to achieve robust sim-to-real transfer. Real-world experiments with dual Allegro Hands demonstrate generalization across seen and unseen bottle objects, resilience to perturbations, and partial lid-removal on novel shapes. Overall, the work advances practical two-handed manipulation by enabling generalizable, high-precision contact-rich policies without relying on real-world expert demonstrations.
Abstract
Manipulating objects with two multi-fingered hands has been a long-standing challenge in robotics, due to the contact-rich nature of many manipulation tasks and the complexity inherent in coordinating a high-dimensional bimanual system. In this work, we share novel insights into physical modeling, real-time perception, and reward design that enable policies trained in simulation using deep reinforcement learning (RL) to be effectively and efficiently transferred to the real world. Specifically, we consider the problem of twisting lids of various bottle-like objects with two hands, demonstrating policies with generalization capabilities across a diverse set of unseen objects as well as dynamic and dexterous behaviors. To the best of our knowledge, this is the first sim-to-real RL system that enables such capabilities on bimanual multi-fingered hands.
