Tackling GNARLy Problems: Graph Neural Algorithmic Reasoning Reimagined through Reinforcement Learning
Alex Schutz, Victor-Alexandru Darvariu, Efimia Panagiotaki, Bruno Lacerda, Nick Hawes
TL;DR
GNARL addresses the core limitations of Neural Algorithmic Reasoning by reframing algorithm execution as a Markov Decision Process and leveraging reinforcement learning. It introduces an encode-process-act architecture with a proto-action strategy and action masking to produce valid, potentially multiple solutions without post-processing. The framework is evaluated across CLRS-30 problems (polynomial) and NP-hard tasks like MVC, TSP, and RGC, achieving high graph accuracy and competitive performance without relying on expert algorithms in some settings. This work demonstrates a general, graph-based approach to learning algorithms that can operate even when expert algorithms are unavailable, marking progress toward a unified combinatorial optimization framework.
Abstract
Neural Algorithmic Reasoning (NAR) is a paradigm that trains neural networks to execute classic algorithms by supervised learning. Despite its successes, important limitations remain: inability to construct valid solutions without post-processing and to reason about multiple correct ones, poor performance on combinatorial NP-hard problems, and inapplicability to problems for which strong algorithms are not yet known. To address these limitations, we reframe the problem of learning algorithm trajectories as a Markov Decision Process, which imposes structure on the solution construction procedure and unlocks the powerful tools of imitation and reinforcement learning (RL). We propose the GNARL framework, encompassing the methodology to translate problem formulations from NAR to RL and a learning architecture suitable for a wide range of graph-based problems. We achieve very high graph accuracy results on several CLRS-30 problems, performance matching or exceeding much narrower NAR approaches for NP-hard problems and, remarkably, applicability even when lacking an expert algorithm.
