Finding path and cycle counting formulae in graphs with Deep Reinforcement Learning
Jason Piquenot, Maxime Bérar, Pierre Héroux, Jean-Yves Ramel, Romain Raveaux, Sébastien Adam
TL;DR
This work tackles the problem of discovering efficient matrix-based formulae for counting graph substructures (paths and cycles) by learning within a Context-Free Grammar constrained space. It introduces Grammar Reinforcement Learning (GRL), a deep RL approach that uses Monte Carlo Tree Search over a CFG, implemented via Gramformer, a transformer model that emulates a PushDown Automaton. GRL recovers known Voropaev-style formulae and, crucially, discovers novel, more efficient expressions for path counts up to length six, achieving speedups up to 6.25x. It also adapts the framework to edge-level and directed-graph counting and outlines directions to extend beyond length six by using more expressive k-WL CFGs, with potential impact on scalable graph analytics and interpretability of substructure counting.
Abstract
This paper presents Grammar Reinforcement Learning (GRL), a reinforcement learning algorithm that uses Monte Carlo Tree Search (MCTS) and a transformer architecture that models a Pushdown Automaton (PDA) within a context-free grammar (CFG) framework. Taking as use case the problem of efficiently counting paths and cycles in graphs, a key challenge in network analysis, computer science, biology, and social sciences, GRL discovers new matrix-based formulas for path/cycle counting that improve computational efficiency by factors of two to six w.r.t state-of-the-art approaches. Our contributions include: (i) a framework for generating gramformers that operate within a CFG, (ii) the development of GRL for optimizing formulas within grammatical structures, and (iii) the discovery of novel formulas for graph substructure counting, leading to significant computational improvements.
