A Planar-Symmetric SO(3) Representation for Learning Grasp Detection
Tianyi Ko, Takuya Ikeda, Hiroya Sato, Koichi Nishiwaki
TL;DR
This work tackles the bi-modal rotation ambiguity introduced by planar-symmetric grippers by introducing a planar-symmetric SO(3) representation based on a 2D Bingham distribution. The authors encode two symmetric gripper poses with a single 9-parameter set, train a grasp detector with a joint loss combining cosine similarity and BNLL, and validate via eigen-decomposition or sampling at inference time. Experiments demonstrate improved rotation continuity, informative uncertainty, and higher grasp success and clarity in both simulation and real-robot setup, especially for yaw-critical, large-flat-object scenarios. The approach serves as a practical add-on to direct rotation regression detectors, offering improved yaw robustness without substantial computational overhead.
Abstract
Planar-symmetric hands, such as parallel grippers, are widely adopted in both research and industrial fields. Their symmetry, however, introduces ambiguity and discontinuity in the SO(3) representation, which hinders both the training and inference of neural-network-based grasp detectors. We propose a novel SO(3) representation that can parametrize a pair of planar-symmetric poses with a single parameter set by leveraging the 2D Bingham distribution. We also detail a grasp detector based on our representation, which provides a more consistent rotation output. An intensive evaluation with multiple grippers and objects in both the simulation and the real world quantitatively shows our approach's contribution.
