Tabletop Object Rearrangement: Structure, Complexity, and Efficient Combinatorial Search-Based Solutions
Kai Gao
TL;DR
This thesis delivers a comprehensive treatment of tabletop object rearrangement (TORO) across multiple workspace and manipulator configurations. It first establishes the NP-hardness of running buffer minimization (MRB) in TORE and develops scalable exact algorithms for both labeled and unlabeled settings, linking MRB to linear vertex ordering and FVS-based bounds. It then extends to internal buffers with TORI via lazy buffer verification (TRLB), introducing preprocessing, heterogeneous object support (HeTORI), and weighted planning strategies that integrate with Monte Carlo Tree Search and MCTS. The work further advances dual-arm coordination (CDR) with a GPU-accelerated task-and-motion pipeline (MODAP), providing robust scheduling, buffer management, and real-time control, and explores mobile tabletop rearrangement with ORLA* (lazy A*), including StabilNet-based stability predictions for general shapes. Collectively, these contributions yield high-quality, scalable planning and execution methods for dense, constrained tabletop rearrangements with single, dual, and mobile manipulators, demonstrated through extensive simulations and hardware experiments. The findings suggest that practical TORO instances exhibit small MRB on average, while the proposed lazy-buffer techniques enable near-optimal planning in challenging, cluttered environments with real-time applicability.
Abstract
This thesis provides an in-depth structural analysis and efficient algorithmic solutions for tabletop object rearrangement with overhand grasps (TORO), a foundational task in advancing intelligent robotic manipulation. Rearranging multiple objects in a confined workspace presents two primary challenges: sequencing actions to minimize pick-and-place operations - an NP-hard problem in TORO - and determining temporary object placements ("buffer poses") within a cluttered environment, which is essential yet highly complex. For TORO with available external free space, this work investigates the minimum buffer space, or "running buffer size," required for temporary relocations, presenting both theoretical insights and exact algorithms. For TORO without external free space, the concept of lazy buffer verification is introduced, with its efficiency evaluated across various manipulator configurations, including single-arm, dual-arm, and mobile manipulators.
