Learning Dexterous Manipulation Skills from Imperfect Simulations
Elvis Hsieh, Wen-Han Hsieh, Yen-Jen Wang, Toru Lin, Jitendra Malik, Koushil Sreenath, Haozhi Qi
TL;DR
DexScrew tackles the sim-to-real gap in dexterous manipulation by bootstrapping from a simplified simulation to learn rotational finger gaits, then collecting real-world multisensory demonstrations via skill-based teleoperation, and finally training a tactile-aware behavior-cloning policy. The approach yields robust, generalizable manipulation for nut-bolt fastening and screwdriver tasks, outperforming direct sim-to-real transfer and showing strong performance on unseen geometries and under perturbations. Key findings highlight the necessity of tactile feedback and temporal history for stable, efficient manipulation. This staged pipeline offers a practical, scalable path toward dexterous manipulation with general-purpose robotic hands in unstructured environments.
Abstract
Reinforcement learning and sim-to-real transfer have made significant progress in dexterous manipulation. However, progress remains limited by the difficulty of simulating complex contact dynamics and multisensory signals, especially tactile feedback. In this work, we propose \ours, a sim-to-real framework that addresses these limitations and demonstrates its effectiveness on nut-bolt fastening and screwdriving with multi-fingered hands. The framework has three stages. First, we train reinforcement learning policies in simulation using simplified object models that lead to the emergence of correct finger gaits. We then use the learned policy as a skill primitive within a teleoperation system to collect real-world demonstrations that contain tactile and proprioceptive information. Finally, we train a behavior cloning policy that incorporates tactile sensing and show that it generalizes to nuts and screwdrivers with diverse geometries. Experiments across both tasks show high task progress ratios compared to direct sim-to-real transfer and robust performance even on unseen object shapes and under external perturbations. Videos and code are available on https://dexscrew.github.io.
