Factored Task and Motion Planning with Combined Optimization, Sampling and Learning
Joaquim Ortiz-Haro
TL;DR
This thesis tackles Task and Motion Planning (TAMP) by introducing a refined factored representation (Factored-NLP) that jointly handles discrete task decisions and continuous trajectory constraints. It presents two bidirectional solvers: Diverse Planning for Logic Geometric Programming (LGP) and the Factored-NLP Planner, which extract infeasible constraint subsets and feed this information back into discrete planning to prune branches. A second strand develops meta-solvers that adaptively combine sampling and optimization, using a TAMP computation tree to reason about compute operations and problem decomposition, with empirical demonstrations on Blocks, Hanoi, and Push-like domains. A third strand integrates learning to accelerate computations, including deep generative models for constraint sampling and graph neural networks to predict infeasibility, enabling faster warm-starts and feasibility checks. Together, these contributions form a cohesive framework for scalable, learning-augmented TAMP that can generalize across multi-robot, multi-object manipulation tasks and pave the way for real-time, perception-enabled autonomous planning in complex environments.
Abstract
In this thesis, we aim to improve the performance of TAMP algorithms from three complementary perspectives. First, we investigate the integration of discrete task planning with continuous trajectory optimization. Our main contribution is a conflict-based solver that automatically discovers why a task plan might fail when considering the constraints of the physical world. This information is then fed back into the task planner, resulting in an efficient, bidirectional, and intuitive interface between task and motion, capable of solving TAMP problems with multiple objects, robots, and tight physical constraints. In the second part, we first illustrate that, given the wide range of tasks and environments within TAMP, neither sampling nor optimization is superior in all settings. To combine the strengths of both approaches, we have designed meta-solvers for TAMP, adaptive solvers that automatically select which algorithms and computations to use and how to best decompose each problem to find a solution faster. In the third part, we combine deep learning architectures with model-based reasoning to accelerate computations within our TAMP solver. Specifically, we target infeasibility detection and nonlinear optimization, focusing on generalization, accuracy, compute time, and data efficiency. At the core of our contributions is a refined, factored representation of the trajectory optimization problems inside TAMP. This structure not only facilitates more efficient planning, encoding of geometric infeasibility, and meta-reasoning but also provides better generalization in neural architectures.
