SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning
Yizhou Chi, Yizhang Lin, Sirui Hong, Duyi Pan, Yaying Fei, Guanghao Mei, Bangbang Liu, Tianqi Pang, Jacky Kwok, Ceyao Zhang, Bang Liu, Chenglin Wu
TL;DR
SELA addresses suboptimal diversity and iteration in LLM-driven AutoML by introducing a tree-search framework that uses Monte Carlo Tree Search to explore a tree-structured pipeline space. It combines an insight proposer, an MCTS-based search module, and an experiment executor to iteratively propose, test, and refine ML configurations across five stages (EDA to evaluation). Empirical results on 20 AML datasets show SELA achieving higher average NS and best-rank metrics, beating traditional AutoML and existing LLM-based baselines, with ablations confirming the value of MCTS, rollouts, and LLM adaptability. The approach highlights the potential of structured search and iterative refinement for dynamic, human-like problem solving in AutoML and potentially other sequential decision tasks.
Abstract
Automated Machine Learning (AutoML) approaches encompass traditional methods that optimize fixed pipelines for model selection and ensembling, as well as newer LLM-based frameworks that autonomously build pipelines. While LLM-based agents have shown promise in automating machine learning tasks, they often generate low-diversity and suboptimal code, even after multiple iterations. To overcome these limitations, we introduce Tree-Search Enhanced LLM Agents (SELA), an innovative agent-based system that leverages Monte Carlo Tree Search (MCTS) to optimize the AutoML process. By representing pipeline configurations as trees, our framework enables agents to conduct experiments intelligently and iteratively refine their strategies, facilitating a more effective exploration of the machine learning solution space. This novel approach allows SELA to discover optimal pathways based on experimental feedback, improving the overall quality of the solutions. In an extensive evaluation across 20 machine learning datasets, we compare the performance of traditional and agent-based AutoML methods, demonstrating that SELA achieves a win rate of 65% to 80% against each baseline across all datasets. These results underscore the significant potential of agent-based strategies in AutoML, offering a fresh perspective on tackling complex machine learning challenges.
