Satisficing and Optimal Generalised Planning via Goal Regression (Extended Version)
Dillon Z. Chen, Till Hofmann, Toryn Q. Klassen, Sheila A. McIlraith
TL;DR
This paper tackles generalized planning (GP) by introducing Moose, a bottom-up approach that learns generalised plans from training problems through optimal singleton-goal planning, goal regression, and lifting into first-order rules. Moose yields a generalised plan that can be instantiated on new problems or used to prune search by encoding rules as axioms in PDDL, enabling pruning without new solvers. The authors formalise soundness and completeness under goal-independence notions (TGI/SGI/OGI) and demonstrate significant empirical gains in synthesis cost, instantiation cost, and solution quality across classical and numeric domains, including optimal planning settings. The work also shows that learned Moose rules can speed up existing optimal planners, achieving competitive or superior performance relative to state-of-the-art baselines and offering practical impact for scalable, reusable planning policies.
Abstract
Generalised planning (GP) refers to the task of synthesising programs that solve families of related planning problems. We introduce a novel, yet simple method for GP: given a set of training problems, for each problem, compute an optimal plan for each goal atom in some order, perform goal regression on the resulting plans, and lift the corresponding outputs to obtain a set of first-order $\textit{Condition} \rightarrow \textit{Actions}$ rules. The rules collectively constitute a generalised plan that can be executed as is or alternatively be used to prune the planning search space. We formalise and prove the conditions under which our method is guaranteed to learn valid generalised plans and state space pruning axioms for search. Experiments demonstrate significant improvements over state-of-the-art (generalised) planners with respect to the 3 metrics of synthesis cost, planning coverage, and solution quality on various classical and numeric planning domains.
