Large Language Model-based Data Science Agent: A Survey
Ke Chen, Peiran Wang, Yaoning Yu, Xianyang Zhan, Haohan Wang
TL;DR
This survey addresses the challenge of deploying reliable LLM-based agents for data science by presenting a dual-perspective framework that links general agent design principles with concrete data-science workflows. It analyzes agent roles, execution structures, external knowledge sources, and reflection mechanisms, illustrating how single, dual, and multi-agent configurations support end-to-end DS tasks—from data preprocessing to visualization. The work identifies core design dimensions (roles, execution, knowledge, reflection) and surveys current benchmarks, highlighting limitations such as short-horizon tasks and data cleanliness considerations. It proposes three future directions—data-centric diagnostics, uncertainty-aware planning, and pipeline-level reflection—to improve reliability, interpretability, and robustness in DS agents. Overall, the paper offers a structured lens for building end-to-end, AI-assisted data science systems that integrate planning, verification, and data-quality checks into routine workflows.
Abstract
The rapid advancement of Large Language Models (LLMs) has driven novel applications across diverse domains, with LLM-based agents emerging as a crucial area of exploration. This survey presents a comprehensive analysis of LLM-based agents designed for data science tasks, summarizing insights from recent studies. From the agent perspective, we discuss the key design principles, covering agent roles, execution, knowledge, and reflection methods. From the data science perspective, we identify key processes for LLM-based agents, including data preprocessing, model development, evaluation, visualization, etc. Our work offers two key contributions: (1) a comprehensive review of recent developments in applying LLMbased agents to data science tasks; (2) a dual-perspective framework that connects general agent design principles with the practical workflows in data science.
