Broadcasting Support Relations Recursively from Local Dynamics for Object Retrieval in Clutters
Yitong Li, Ruihai Wu, Haoran Lu, Chuanruo Ning, Yan Shen, Guanqi Zhan, Hao Dong
TL;DR
This work addresses safe retrieval of a target object from clutter by modeling inter-object support as a graph built through recursive broadcasting of accurate local dynamics. The framework combines a Retrieval Direction Predictor and a Local Dynamics Predictor to construct a target-centered Support Graph $\mathcal{G}_s$, with a Clutter Solver and Graph Adjustment ensuring robust refinement under occlusions. A Manipulation Affordance Predictor estimates optimal grasp points and poses, enabling safe, stepwise removal of obstructing objects before accessing the target. Empirical results in both large-scale simulations and real-world setups demonstrate superior retrieval success, lower displacement, and fewer steps compared with baselines, highlighting the practical impact for robotic manipulation in complex cluttered environments.
Abstract
In our daily life, cluttered objects are everywhere, from scattered stationery and books cluttering the table to bowls and plates filling the kitchen sink. Retrieving a target object from clutters is an essential while challenging skill for robots, for the difficulty of safely manipulating an object without disturbing others, which requires the robot to plan a manipulation sequence and first move away a few other objects supported by the target object step by step. However, due to the diversity of object configurations (e.g., categories, geometries, locations and poses) and their combinations in clutters, it is difficult for a robot to accurately infer the support relations between objects faraway with various objects in between. In this paper, we study retrieving objects in complicated clutters via a novel method of recursively broadcasting the accurate local dynamics to build a support relation graph of the whole scene, which largely reduces the complexity of the support relation inference and improves the accuracy. Experiments in both simulation and the real world demonstrate the efficiency and effectiveness of our method.
