Multi-Stage Monte Carlo Tree Search for Non-Monotone Object Rearrangement Planning in Narrow Confined Environments
Hanwen Ren, Ahmed H. Qureshi
TL;DR
The paper tackles non-monotone object rearrangement in narrow confined environments, a problem that is NP-hard due to joint object ordering and relocation constraints. It introduces a Multi-Stage Monte Carlo Tree Search (MS-MCTS) framework that decouples the problem into ordered subproblems via an object stage topology and a subgoal-focused expansion strategy, augmented by a linear motion planner to minimize swept volume. Key contributions include the object stage topology generation, a subgoal-focused SS-MCTS with blocking-object handling, and post-optimization that merges actions for efficiency, all validated by sim-to-real experiments on a UR5e. The results demonstrate superior planning time and plan quality compared to baselines, showing practical impact for efficient, high-quality object rearrangement in tight spaces.
Abstract
Non-monotone object rearrangement planning in confined spaces such as cabinets and shelves is a widely occurring but challenging problem in robotics. Both the robot motion and the available regions for object relocation are highly constrained because of the limited space. This work proposes a Multi-Stage Monte Carlo Tree Search (MS-MCTS) method to solve non-monotone object rearrangement planning problems in confined spaces. Our approach decouples the complex problem into simpler subproblems using an object stage topology. A subgoal-focused tree expansion algorithm that jointly considers the high-level planning and the low-level robot motion is designed to reduce the search space and better guide the search process. By fitting the task into the MCTS paradigm, our method produces optimistic solutions by balancing exploration and exploitation. The experiments demonstrate that our method outperforms the existing methods in terms of the planning time, the number of steps, and the total move distance. Moreover, we deploy our MS-MCTS to a real-world robot system and verify its performance in different scenarios.
