Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Matej Balog, Gheorghe Comanici, Tudor Berariu, Andrew Lee, Anian Ruoss, Anna Bulanova, Daniel Toyama, Sam Blackwell, Bernardino Romera Paredes, Petar Veličković, Laurent Orseau, Joonkyung Lee, Anurag Murty Naredla, Doina Precup, Adam Zsolt Wagner
TL;DR
This work tackles the extremal graph problem of maximizing edges in $n$-node graphs without 3- or 4-cycles, formalized as $f(n)=ex(n,{C3,C4})$ and optimized via a telescoping score $s(G)=e(G)-t(G)-q(G)$. It introduces an edge-flipping RL environment and a novel Pairformer architecture to guide AlphaZero, and compares it with incremental tabu search under a curriculum that seeds larger problems from smaller, high-quality graphs. Across sizes $64$ to $190$, incremental tabu search and Incremental AlphaZero push new lower bounds, often matching or surpassing the state of the art, while Wagner’s cross-entropy underperforms. The study finds curriculum-driven incremental search crucial for large state spaces and discusses the implications for learning-to-search in hard combinatorial problems, with potential extensions to other graph-design and optimization challenges.
Abstract
This work studies a central extremal graph theory problem inspired by a 1975 conjecture of Erdős, which aims to find graphs with a given size (number of nodes) that maximize the number of edges without having 3- or 4-cycles. We formulate this problem as a sequential decision-making problem and compare AlphaZero, a neural network-guided tree search, with tabu search, a heuristic local search method. Using either method, by introducing a curriculum -- jump-starting the search for larger graphs using good graphs found at smaller sizes -- we improve the state-of-the-art lower bounds for several sizes. We also propose a flexible graph-generation environment and a permutation-invariant network architecture for learning to search in the space of graphs.
