Physics-Aware Iterative Learning and Prediction of Saliency Map for Bimanual Grasp Planning
Shiyao Wang, Xiuping Liu, Charlie C. L. Wang, Jian Liu
TL;DR
The paper tackles the challenge of bimanual grasp planning by leveraging abundant single-handed grasp saliency data to predict physically plausible bimanual contact regions without requiring large-scale bimanual annotations. It introduces a physics-aware iterative learning pipeline comprising BSPN, BCPN, a physics-balance loss, and a physics-aware refinement that enforce balance and generalize to unseen objects, followed by grasp synthesis via ContactGrasp with MANO. Key contributions include the saliency corresponding vector framework, iterative updates of the single-handed saliency map, and a refinement module that enforces physical stability, achieving high bimanual grasp success in simulation (e.g., 92.5% across 80 shapes) and superior performance to single-handed baselines. This work reduces data requirements and enhances robustness of bimanual grasping for household objects, with practical impact on dexterous manipulation in unstructured environments.
Abstract
Learning the skill of human bimanual grasping can extend the capabilities of robotic systems when grasping large or heavy objects. However, it requires a much larger search space for grasp points than single-hand grasping and numerous bimanual grasping annotations for network learning, making both data-driven or analytical grasping methods inefficient and insufficient. We propose a framework for bimanual grasp saliency learning that aims to predict the contact points for bimanual grasping based on existing human single-handed grasping data. We learn saliency corresponding vectors through minimal bimanual contact annotations that establishes correspondences between grasp positions of both hands, capable of eliminating the need for training a large-scale bimanual grasp dataset. The existing single-handed grasp saliency value serves as the initial value for bimanual grasp saliency, and we learn a saliency adjusted score that adds the initial value to obtain the final bimanual grasp saliency value, capable of predicting preferred bimanual grasp positions from single-handed grasp saliency. We also introduce a physics-balance loss function and a physics-aware refinement module that enables physical grasp balance, capable of enhancing the generalization of unknown objects. Comprehensive experiments in simulation and comparisons on dexterous grippers have demonstrated that our method can achieve balanced bimanual grasping effectively.
