Automated Algorithmic Discovery for Scientific Computing through LLM-Guided Evolutionary Search: A Case Study in Gravitational-Wave Detection
He Wang, Liang Zeng
TL;DR
The paper presents Evo-MCTS, a domain-agnostic framework that combines reflective code synthesis from large language models with tree-structured evolutionary search to automatically discover interpretable scientific algorithms. Applied to gravitational-wave detection, Evo-MCTS achieves substantial performance gains over both domain-specific baselines and prior LLM-based optimization strategies, while producing transparent algorithmic pathways that can be analyzed post-hoc. The work demonstrates robust generalization, reproducibility across independent runs, and meaningful improvements through domain-knowledge integration, multi-scale evolutionary operations, and reflective reasoning. The framework's design offers a principled pathway to automated algorithm discovery in physics, chemistry, biology, and engineering, where interpretability and physical validity are essential alongside performance.
Abstract
Automated algorithm discovery in scientific computing faces fundamental challenges: vast design spaces with expensive evaluations, domain-specific physical constraints requiring expert knowledge, and the necessity for interpretable solutions that scientists can validate and understand. We present the Evo-MCTS (Evolutionary Monte Carlo Tree Search) framework, integrating large language models (LLMs) with tree-structured evolutionary search for interpretable algorithm discovery. Evo-MCTS combines reflective code synthesis leveraging LLM domain knowledge, multi-scale evolutionary operations on structured code representations, and interpretable algorithmic pathways emerging from tree-guided exploration. When applied to gravitational wave detection-a challenging domain with continuous parameter spaces and strict physical constraints-Evo-MCTS achieves 20.2% improvement over domain-specific methods and 59.1% over LLM-based optimization frameworks. This improvement arises from its ability to consistently converge toward interpretable algorithmic structures that integrate multiple functional components. Our domain-agnostic architecture establishes a generalizable methodology for automated algorithm discovery in scientific computing, where algorithmic transparency and physical validity are as essential as performance optimization.
