What Planning Problems Can A Relational Neural Network Solve?
Jiayuan Mao, Tomás Lozano-Pérez, Joshua B. Tenenbaum, Leslie Pack Kaelbling
TL;DR
This work addresses when relational neural networks can implement goal-conditioned planning policies with polynomial-size circuits. It builds a bridge between policy realization and classical planning width by introducing serialized goal regression search (S-GRS) and the notions of regression width and SOS width, then derives upper bounds on policy-circuit size and depth as functions of these widths and planning horizon. It presents two RelNN-based compilation schemes—direct backward search and regression-rule-selector-guided compilation—that can yield finite-breadth, finite-depth circuits in domains with low width, and demonstrates depth-enabled generalization in several object-centric domains (e.g., Assembly3, Logistics, Blocks World). The results illuminate why RelNNs can generalize to larger instances in many planning tasks while clarifying the challenges in harder domains like Sokoban, guiding design choices for neural planners and suggesting extensions to hierarchical or continuous settings.
Abstract
Goal-conditioned policies are generally understood to be "feed-forward" circuits, in the form of neural networks that map from the current state and the goal specification to the next action to take. However, under what circumstances such a policy can be learned and how efficient the policy will be are not well understood. In this paper, we present a circuit complexity analysis for relational neural networks (such as graph neural networks and transformers) representing policies for planning problems, by drawing connections with serialized goal regression search (S-GRS). We show that there are three general classes of planning problems, in terms of the growth of circuit width and depth as a function of the number of objects and planning horizon, providing constructive proofs. We also illustrate the utility of this analysis for designing neural networks for policy learning.
