AutoSurvey: Large Language Models Can Automatically Write Surveys
Yidong Wang, Qi Guo, Wenjin Yao, Hongbo Zhang, Xin Zhang, Zhen Wu, Meishan Zhang, Xinyu Dai, Min Zhang, Qingsong Wen, Wei Ye, Shikun Zhang, Yue Zhang
TL;DR
<3-5 sentence high-level summary> The paper addresses the challenge of rapidly summarizing a growing body of AI research where traditional surveys lag behind. It introduces AutoSurvey, a pipeline with initial retrieval, parallel outline/subsection drafting, and iterative refinement guided by retrieval-augmented generation and multi-LLM evaluation. It demonstrates through extensive experiments that AutoSurvey achieves near-human content and citation quality at a fraction of the time compared with humans and naive RAG baselines. The work provides a structured evaluation framework and detailed ablations, offering a scalable approach to automated scholarly synthesis.
Abstract
This paper introduces AutoSurvey, a speedy and well-organized methodology for automating the creation of comprehensive literature surveys in rapidly evolving fields like artificial intelligence. Traditional survey paper creation faces challenges due to the vast volume and complexity of information, prompting the need for efficient survey methods. While large language models (LLMs) offer promise in automating this process, challenges such as context window limitations, parametric knowledge constraints, and the lack of evaluation benchmarks remain. AutoSurvey addresses these challenges through a systematic approach that involves initial retrieval and outline generation, subsection drafting by specialized LLMs, integration and refinement, and rigorous evaluation and iteration. Our contributions include a comprehensive solution to the survey problem, a reliable evaluation method, and experimental validation demonstrating AutoSurvey's effectiveness.We open our resources at \url{https://github.com/AutoSurveys/AutoSurvey}.
