Scaling Up without Fading Out: Goal-Aware Sparse GNN for RL-based Generalized Planning
Sangwoo Jeon, Juchul Shin, Gyeong-Tae Kim, YeonJe Cho, Seongwoo Kim
TL;DR
This work tackles the scalability bottlenecks of RL-based generalized planning in PDDL domains by introducing a sparse, goal-aware GNN representation. It combines sparse local graph connectivity, goal-aware node embeddings, and action embeddings optimized with PPO, complemented by curriculum learning to scale to large grid worlds. Empirical results on drone-inspired grid domains show improved memory efficiency, faster and more stable learning, and strong generalization to unseen, larger instances, with GBFS-GNN inference validating scalability to 25×25 grids. The findings suggest a practical path toward deploying RL-based generalized planners in realistic, large-scale symbolic domains and motivate future integration with BDI-based autonomous drone frameworks.
Abstract
Generalized planning using deep reinforcement learning (RL) combined with graph neural networks (GNNs) has shown promising results in various symbolic planning domains described by PDDL. However, existing approaches typically represent planning states as fully connected graphs, leading to a combinatorial explosion in edge information and substantial sparsity as problem scales grow, especially evident in large grid-based environments. This dense representation results in diluted node-level information, exponentially increases memory requirements, and ultimately makes learning infeasible for larger-scale problems. To address these challenges, we propose a sparse, goal-aware GNN representation that selectively encodes relevant local relationships and explicitly integrates spatial features related to the goal. We validate our approach by designing novel drone mission scenarios based on PDDL within a grid world, effectively simulating realistic mission execution environments. Our experimental results demonstrate that our method scales effectively to larger grid sizes previously infeasible with dense graph representations and substantially improves policy generalization and success rates. Our findings provide a practical foundation for addressing realistic, large-scale generalized planning tasks.
