DextrAH-RGB: Visuomotor Policies to Grasp Anything with Dexterous Hands

Ritvik Singh; Arthur Allshire; Ankur Handa; Nathan Ratliff; Karl Van Wyk

DextrAH-RGB: Visuomotor Policies to Grasp Anything with Dexterous Hands

Ritvik Singh, Arthur Allshire, Ankur Handa, Nathan Ratliff, Karl Van Wyk

TL;DR

DextrAH-RGB advances end-to-end RGB-based visuomotor control for dexterous grasping by training a privileged state-based Fabric-Guided Policy (FGP) in simulation and distilling it into a stereo RGB-based policy via online DAgger and photorealistic rendering. The approach leverages geometric fabrics to enforce safe, reactive behavior and uses a cross-attention transformer to extract depth cues from stereo RGB inputs. Real-world experiments with a Kuka iiwa and Allegro Hand demonstrate competitive sim-to-real transfer across unseen objects and lighting conditions, though transfer variability and training complexity remain challenges. Overall, the work establishes a scalable path for RGB-driven dexterous manipulation trained in simulation with robust real-world performance, paving the way for multi-object and more dexterous capabilities.

Abstract

One of the most important, yet challenging, skills for a dexterous robot is grasping a diverse range of objects. Much of the prior work has been limited by speed, generality, or reliance on depth maps and object poses. In this paper, we introduce DextrAH-RGB, a system that can perform dexterous arm-hand grasping end-to-end from RGB image input. We train a privileged fabric-guided policy (FGP) in simulation through reinforcement learning that acts on a geometric fabric controller to dexterously grasp a wide variety of objects. We then distill this privileged FGP into a RGB-based FGP strictly in simulation using photorealistic tiled rendering. To our knowledge, this is the first work that is able to demonstrate robust sim2real transfer of an end2end RGB-based policy for complex, dynamic, contact-rich tasks such as dexterous grasping. DextrAH-RGB is competitive with depth-based dexterous grasping policies, and generalizes to novel objects with unseen geometry, texture, and lighting conditions in the real world. Videos of our system grasping a diverse range of unseen objects are available at \url{https://dextrah-rgb.github.io/}.

DextrAH-RGB: Visuomotor Policies to Grasp Anything with Dexterous Hands

TL;DR

Abstract

DextrAH-RGB: Visuomotor Policies to Grasp Anything with Dexterous Hands

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)