Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning

Anja Surina; Amin Mansouri; Lars Quaedvlieg; Amal Seddas; Maryna Viazovska; Emmanuel Abbe; Caglar Gulcehre

Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning

Anja Surina, Amin Mansouri, Lars Quaedvlieg, Amal Seddas, Maryna Viazovska, Emmanuel Abbe, Caglar Gulcehre

TL;DR

EvoTune addresses the challenge of discovering high-quality algorithms by bridging evolutionary search with reinforcement-learning fine-tuning of the LLM. By using evolutionary exploration to generate data and RL to update the LLM policy, EvoTune accelerates progress beyond static-generation baselines, while forward KL regularization maintains output diversity critical for exploration. Across bin packing, traveling salesman, flatpack, and broader Hash Code and LLM-SR benchmarks, EvoTune yields higher top performance and more unique solutions, often outperforming human heuristics and non-LLM baselines. The work demonstrates the viability and potential of RL-enhanced evolutionary strategies for automated algorithm design, with implications for scalable, data-efficient discovery in combinatorial optimization and beyond.

Abstract

Discovering efficient algorithms for solving complex problems has been an outstanding challenge in mathematics and computer science, requiring substantial human expertise over the years. Recent advancements in evolutionary search with large language models (LLMs) have shown promise in accelerating the discovery of algorithms across various domains, particularly in mathematics and optimization. However, existing approaches treat the LLM as a static generator, missing the opportunity to update the model with the signal obtained from evolutionary exploration. In this work, we propose to augment LLM-based evolutionary search by continuously refining the search operator - the LLM - through reinforcement learning (RL) fine-tuning. Our method leverages evolutionary search as an exploration strategy to discover improved algorithms, while RL optimizes the LLM policy based on these discoveries. Our experiments on combinatorial optimization tasks demonstrate that integrating RL with evolutionary search accelerates the discovery of superior algorithms, showcasing the potential of RL-enhanced evolutionary strategies for algorithm design.

Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning

TL;DR

Abstract

Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)