Graph Neural Network Based Action Ranking for Planning
Rajesh Mangannavar, Stefan Lee, Alan Fern, Prasad Tadepalli
TL;DR
Graph Neural Network Based Action Ranking for Planning (GABAR) tackles the challenge of scalable planning by shifting from learning global value functions to directly ranking actions in each state. It introduces an action-centric graph representation and a GNN encoder paired with a GRU-based autoregressive decoder to construct grounded actions, with beam search enabling exploration of multiple candidates. Trained on small, solvable instances, GABAR generalizes to significantly larger problems and outperforms baselines such as GPL, ASNets, and GRAPL in both coverage and plan quality, as well as surpassing large-language-model prompts in planning tasks. The work demonstrates that local action ranking on relational graphs yields robust generalization and practical planning performance, suggesting a promising direction for scalable, relational policy learning in classical planning.
Abstract
We propose a novel approach to learn relational policies for classical planning based on learning to rank actions. We introduce a new graph representation that explicitly captures action information and propose a Graph Neural Network (GNN) architecture augmented with Gated Recurrent Units (GRUs) to learn action rankings. Unlike value-function based approaches that must learn a globally consistent function, our action ranking method only needs to learn locally consistent ranking. Our model is trained on data generated from small problem instances that are easily solved by planners and is applied to significantly larger instances where planning is computationally prohibitive. Experimental results across standard planning benchmarks demonstrate that our action-ranking approach not only achieves better generalization to larger problems than those used in training but also outperforms multiple baselines (value function and action ranking) methods in terms of success rate and plan quality.
