Transferring Dexterous Manipulation from GPU Simulation to a Remote Real-World TriFinger
Arthur Allshire, Mayank Mittal, Varun Lodaya, Viktor Makoviychuk, Denys Makoviichuk, Felix Widmaier, Manuel Wüthrich, Stefan Bauer, Ankur Handa, Animesh Garg
TL;DR
This work addresses closed-loop 6-DoF in-hand manipulation with a 3-finger TriFinger by training a single policy in high-speed IsaacGym simulation and transferring it to a remote real robot. It shows that representing the manipulated object with eight 3D keypoints, rather than raw position and quaternion, improves learning and reward shaping for reposing tasks, especially when combined with domain randomization. The resulting policies achieve robust sim-to-real transfer, reaching about 82–83% success on the real apparatus across object morphologies, with a scalable, open-source workflow. Overall, the study demonstrates a practical pathway for end-to-end RL in dexterous manipulation and real-world deployment, enabling reproducibility and extension by other researchers.
Abstract
We present a system for learning a challenging dexterous manipulation task involving moving a cube to an arbitrary 6-DoF pose with only 3-fingers trained with NVIDIA's IsaacGym simulator. We show empirical benefits, both in simulation and sim-to-real transfer, of using keypoints as opposed to position+quaternion representations for the object pose in 6-DoF for policy observations and in reward calculation to train a model-free reinforcement learning agent. By utilizing domain randomization strategies along with the keypoint representation of the pose of the manipulated object, we achieve a high success rate of 83% on a remote TriFinger system maintained by the organizers of the Real Robot Challenge. With the aim of assisting further research in learning in-hand manipulation, we make the codebase of our system, along with trained checkpoints that come with billions of steps of experience available, at https://s2r2-ig.github.io
