Abstraction Generation for Generalized Planning with Pretrained Large Language Models

Zhenhe Cui; Huaxiang Xia; Hangjun Shen; Kailun Luo; Yong He; Wei Liang

Abstraction Generation for Generalized Planning with Pretrained Large Language Models

Zhenhe Cui, Huaxiang Xia, Hangjun Shen, Kailun Luo, Yong He, Wei Liang

TL;DR

The paper tackles the challenge of generating robust $QNP$ abstractions for Generalized Planning by leveraging pretrained LLMs through a structured prompt protocol that first derives domain features and then outputs a $QNP$ abstraction, including an automated debugging loop to repair abstraction errors. The approach hinges on a refinement-mapping framework that relates high-level $QNP$ abstractions to low-level GP problems, with iterative checks (ASC, HLISC, HLPRC, LLGRC) and a $QNP$ solver ($DSET$) to guide corrections. Empirical results across seven GP domains show that certain LLMs (notably GPT-5.2) can produce useful abstractions when guided by debugging, while performance varies across models and domains; debugging generally improves coverage, though domain-specific limitations and action-refinement constraints remain challenges. Overall, the work demonstrates a practical pathway to harness LLMs for GP abstraction generation, highlighting both potential benefits and avenues for future enhancement toward more expressive features and stronger abstraction guarantees.

Abstract

Qualitative Numerical Planning (QNP) serves as an important abstraction model for generalized planning (GP), which aims to compute general plans that solve multiple instances at once. Recent works show that large language models (LLMs) can function as generalized planners. This work investigates whether LLMs can serve as QNP abstraction generators for GP problems and how to fix abstractions via automated debugging. We propose a prompt protocol: input a GP domain and training tasks to LLMs, prompting them to generate abstract features and further abstract the initial state, action set, and goal into QNP problems. An automated debugging method is designed to detect abstraction errors, guiding LLMs to fix abstractions. Experiments demonstrate that under properly guided by automated debugging, some LLMs can generate useful QNP abstractions.

Abstraction Generation for Generalized Planning with Pretrained Large Language Models

TL;DR

The paper tackles the challenge of generating robust

abstractions for Generalized Planning by leveraging pretrained LLMs through a structured prompt protocol that first derives domain features and then outputs a

abstraction, including an automated debugging loop to repair abstraction errors. The approach hinges on a refinement-mapping framework that relates high-level

abstractions to low-level GP problems, with iterative checks (ASC, HLISC, HLPRC, LLGRC) and a

solver (

) to guide corrections. Empirical results across seven GP domains show that certain LLMs (notably GPT-5.2) can produce useful abstractions when guided by debugging, while performance varies across models and domains; debugging generally improves coverage, though domain-specific limitations and action-refinement constraints remain challenges. Overall, the work demonstrates a practical pathway to harness LLMs for GP abstraction generation, highlighting both potential benefits and avenues for future enhancement toward more expressive features and stronger abstraction guarantees.

Abstract

Paper Structure (18 sections, 2 theorems, 1 figure, 2 tables, 1 algorithm)

This paper contains 18 sections, 2 theorems, 1 figure, 2 tables, 1 algorithm.

Introduction
Preliminaries
Generalized Planning
Qualitative Numerical Planning
Abstraction for Generalized Planning
QNP Abstraction Generation
Prompt Protocol
Automated Interactive Debugging
Abstraction Solvability Check (ASC).
HL Instance Solvability Check (HLISC).
HL Plan Refinability Check (HLPRC).
LL Goal Reachability Check (LLGRC).
Experiments and Results
Experimental Setup.
Results and Analysis.
...and 3 more sections

Key Result

Theorem 1

If a QNP problem $\mathcal{Q}_h$ is a sound abstraction of a GP problem $\mathcal{Q}_l$ and a policy $\pi$ is a solution of $\mathcal{Q}_h$, then $m(\pi)$ is a solution of $\mathcal{Q}_l$.

Figures (1)

Figure 1: The overall framework for abstraction generation for generalized planning with pretrained LLMs.

Theorems & Definitions (10)

Example 1
Example 2
Definition 1
Example 3
Definition 2
Definition 3
Definition 4
Theorem 1
Definition 5
Corollary 1

Abstraction Generation for Generalized Planning with Pretrained Large Language Models

TL;DR

Abstract

Abstraction Generation for Generalized Planning with Pretrained Large Language Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (10)