Aligning LLM+PDDL Symbolic Plans with Human Objective Specifications through Evolutionary Algorithm Guidance
Owen Burns, Dana Hughes, Katia Sycara
TL;DR
This work tackles aligning LLM-generated PDDL plan specifications with human objectives when NL translations produce imperfect symbolic constraints. It introduces a neurosymbolic pipeline that uses a genetic algorithm to evolve plan specifications, guided by an LSTM-based specification adherence model and a symbolic planner (OPTIC) to generate state-trajectories. Empirical results in a naval disaster scenario and a satellite-domain validation show that GA-guided refinement yields more adherent plans than LLM-only translations, improving valid-plan rates from roughly 32–6% to about 48–34%. The approach enables non-experts to specify goals in natural language while systematically correcting mistranslations, with potential for application to dynamic, safety-critical planning tasks.
Abstract
Automated planning using a symbolic planning language, such as PDDL, is a general approach to producing optimal plans to achieve a stated goal. However, creating suitable machine understandable descriptions of the planning domain, problem, and goal requires expertise in the planning language, limiting the utility of these tools for non-expert humans. Recent efforts have explored utilizing a symbolic planner in conjunction with a large language model to generate plans from natural language descriptions given by a non-expert human (LLM+PDDL). Our approach performs initial translation of goal specifications to a set of PDDL goal constraints using an LLM; such translations often result in imprecise symbolic specifications, which are difficult to validate directly. We account for this using an evolutionary approach to generate a population of symbolic goal specifications with slight differences from the initial translation, and utilize a trained LSTM-based validation model to assess whether each induced plan in the population adheres to the natural language specifications. We evaluate our approach on a collection of prototypical specifications in a notional naval disaster recovery task, and demonstrate that our evolutionary approach improve adherence of generated plans to natural language specifications when compared to plans generated using only LLM translations. The code for our method can be found at https://github.com/owenonline/PlanCritic.
