Efficient Subgraph GNNs by Learning Effective Selection Policies
Beatrice Bevilacqua, Moshe Eliasof, Eli Meirom, Bruno Ribeiro, Haggai Maron
TL;DR
This work addresses the high computational cost of Subgraph GNNs by learning data-driven policies to select a small, task-relevant subset of subgraphs. It introduces Policy-Learn, a two-network system where a DS-GNN-based selection module builds a bag of subgraphs iteratively using differentiable sampling, and a downstream DS-GNN predictor executes the task on the selected bag. Theoretical results show Policy-Learn can identify isomorphism types for WL-indistinguishable graph families with as few as ℓ subgraphs, outperforming random policies and prior methods, and empirical results demonstrate competitive accuracy with significant runtime savings across diverse datasets. The approach enables efficient, scalable subgraph-based graph representations while preserving expressive power for practical applications. Overall, Policy-Learn advances the practical deployment of Subgraph GNNs by learning when and which subgraphs to process.
Abstract
Subgraph GNNs are provably expressive neural architectures that learn graph representations from sets of subgraphs. Unfortunately, their applicability is hampered by the computational complexity associated with performing message passing on many subgraphs. In this paper, we consider the problem of learning to select a small subset of the large set of possible subgraphs in a data-driven fashion. We first motivate the problem by proving that there are families of WL-indistinguishable graphs for which there exist efficient subgraph selection policies: small subsets of subgraphs that can already identify all the graphs within the family. We then propose a new approach, called Policy-Learn, that learns how to select subgraphs in an iterative manner. We prove that, unlike popular random policies and prior work addressing the same problem, our architecture is able to learn the efficient policies mentioned above. Our experimental results demonstrate that Policy-Learn outperforms existing baselines across a wide range of datasets.
