Collapse and Collision Aware Grasping for Cluttered Shelf Picking
Abhinav Pathak, Rajkumar Muthusamy
TL;DR
This work tackles safe cluttered-shelf retrieval by integrating physics reasoning into grasp planning. It introduces a real-to-sim pipeline that reconstructs an approximate scene from a single RGB-D image and uses PyBullet to simulate collapses and collisions before execution. The authors compare a physics-based planning approach with a heuristic baseline for both single-box extraction and shelf clearance, reporting substantial improvements in efficiency and success rates, especially in unstructured stacks. The results suggest that physics-aware planning enhances reliability and throughput for warehouse manipulation, with future work aiming to broaden sensing and manipulation capabilities.
Abstract
Efficient and safe retrieval of stacked objects in warehouse environments is a significant challenge due to complex spatial dependencies and structural inter-dependencies. Traditional vision-based methods excel at object localization but often lack the physical reasoning required to predict the consequences of extraction, leading to unintended collisions and collapses. This paper proposes a collapse and collision aware grasp planner that integrates dynamic physics simulations for robotic decision-making. Using a single image and depth map, an approximate 3D representation of the scene is reconstructed in a simulation environment, enabling the robot to evaluate different retrieval strategies before execution. Two approaches 1) heuristic-based and 2) physics-based are proposed for both single-box extraction and shelf clearance tasks. Extensive real-world experiments on structured and unstructured box stacks, along with validation using datasets from existing databases, show that our physics-aware method significantly improves efficiency and success rates compared to baseline heuristics.
