Speeding up 6-DoF Grasp Sampling with Quality-Diversity
Johann Huber, François Hélénon, Mathilde Kappel, Elie Chelly, Mahdi Khoramshahi, Faïz Ben Amar, Stéphane Doncieux
TL;DR
This work tackles the data bottleneck in learning 6-DoF grasping by integrating Quality-Diversity (QD) optimization with robotic priors to rapidly generate diverse, robust grasps. It introduces a genotype-to-grasp projection that leverages synergies and priors, guided by a behavior-space-based QD archive to produce a large, diverse set of high-quality grasps. Across extensive simulations on four grippers and multiple YCB objects, QD-based sampling with priors outperforms standard priors and non-QD baselines, and transfers to real-world hardware with substantial sim-to-real fidelity. The results demonstrate the potential to build large, diverse grasp datasets that support robust and generalizable manipulation policies, with real-world experiments showing high transfer rates for the Panda and solid transfer for the Allegro. Overall, the approach offers a scalable path to data-efficient grasp policy learning by fast, diverse data generation that preserves sim-to-real transferability.
Abstract
Recent advances in AI have led to significant results in robotic learning, including natural language-conditioned planning and efficient optimization of controllers using generative models. However, the interaction data remains the bottleneck for generalization. Getting data for grasping is a critical challenge, as this skill is required to complete many manipulation tasks. Quality-Diversity (QD) algorithms optimize a set of solutions to get diverse, high-performing solutions to a given problem. This paper investigates how QD can be combined with priors to speed up the generation of diverse grasps poses in simulation compared to standard 6-DoF grasp sampling schemes. Experiments conducted on 4 grippers with 2-to-5 fingers on standard objects show that QD outperforms commonly used methods by a large margin. Further experiments show that QD optimization automatically finds some efficient priors that are usually hard coded. The deployment of generated grasps on a 2-finger gripper and an Allegro hand shows that the diversity produced maintains sim-to-real transferability. We believe these results to be a significant step toward the generation of large datasets that can lead to robust and generalizing robotic grasping policies.
