Learn2Decompose: Learning Problem Decomposition for Efficient Sequential Multi-object Manipulation Planning
Yan Zhang, Teng Xue, Amirreza Razmjoo, Sylvain Calinon
TL;DR
This work tackles the exponential growth of planning time in sequential multi-object manipulation by learning problem decompositions from demonstrations. It introduces three modules—goal decomposition learning (via sequential pattern mining to extract must-pass subgoal sequences), computational distance learning (a GNN that predicts object importance to identify the closest subgoal), and object reduction (parallel replanning over reduced object sets)—all integrated with a classical TAMP solver (PDDLStream). The approach reduces planning horizon and active object counts, improving replanning speed and robustness to disturbances, and generalizes to more realistic kitchens and real-world robots. Empirical evaluations across Tower Construction, Tool Use, Cooking benchmarks, and transfers to PR2 PyBullet and real Franka experiments demonstrate significant gains over baselines and strong transferability with limited retraining.
Abstract
We present an efficient task and motion replanning approach for sequential multi-object manipulation in dynamic environments. Conventional Task And Motion Planning (TAMP) solvers experience an exponential increase in planning time as the planning horizon and number of objects grow, limiting their applicability in real-world scenarios. To address this, we propose learning problem decompositions from demonstrations to accelerate TAMP solvers. Our approach consists of three key components: goal decomposition learning, computational distance learning, and object reduction. Goal decomposition identifies the necessary sequences of states that the system must pass through before reaching the final goal, treating them as subgoal sequences. Computational distance learning predicts the computational complexity between two states, enabling the system to identify the temporally closest subgoal from a disturbed state. Object reduction minimizes the set of active objects considered during replanning, further improving efficiency. We evaluate our approach on three benchmarks, demonstrating its effectiveness in improving replanning efficiency for sequential multi-object manipulation tasks in dynamic environments.
