Safe Learning of PDDL Domains with Conditional Effects -- Extended Version
Argaman Mordoch, Enrico Scala, Roni Stern, Brendan Juba
TL;DR
This work introduces Conditional-SAM, the first algorithm for learning safe PDDL action models in domains with conditional and universal effects. It provides theoretical guarantees on safety, space/time complexity, and sharp sample-complexity bounds for fixed antecedent size, plus extensions to lifted and universally quantified effects. Empirically, Conditional-SAM yields learned models that allow planners to solve test problems in multiple domains, with near-perfect semantic recall of preconditions and exact match of effects to the real model when applicable. The results demonstrate practical applicability while highlighting challenges in domains with large antecedent sizes or complex universal preconditions, guiding future improvements in scalability and expressivity. Overall, the approach offers a principled, safety-preserving path to automatic action-model induction in expressive planning languages.
Abstract
Powerful domain-independent planners have been developed to solve various types of planning problems. These planners often require a model of the acting agent's actions, given in some planning domain description language. Manually designing such an action model is a notoriously challenging task. An alternative is to automatically learn action models from observation. Such an action model is called safe if every plan created with it is consistent with the real, unknown action model. Algorithms for learning such safe action models exist, yet they cannot handle domains with conditional or universal effects, which are common constructs in many planning problems. We prove that learning non-trivial safe action models with conditional effects may require an exponential number of samples. Then, we identify reasonable assumptions under which such learning is tractable and propose SAM Learning of Conditional Effects (Conditional-SAM), the first algorithm capable of doing so. We analyze Conditional-SAM theoretically and evaluate it experimentally. Our results show that the action models learned by Conditional-SAM can be used to solve perfectly most of the test set problems in most of the experimented domains.
