SurveyX: Academic Survey Automation via Large Language Models

Xun Liang; Jiawei Yang; Yezhaohui Wang; Chen Tang; Zifan Zheng; Shichao Song; Zehao Lin; Yebin Yang; Simin Niu; Hanyu Wang; Bo Tang; Feiyu Xiong; Keming Mao; Zhiyu li

SurveyX: Academic Survey Automation via Large Language Models

Xun Liang, Jiawei Yang, Yezhaohui Wang, Chen Tang, Zifan Zheng, Shichao Song, Zehao Lin, Yebin Yang, Simin Niu, Hanyu Wang, Bo Tang, Feiyu Xiong, Keming Mao, Zhiyu li

TL;DR

The paper addresses the challenge of generating comprehensive, up-to-date academic surveys amidst rapidly expanding literature. It introduces SurveyX, a two-phase system comprising Preparation (reference acquisition with online retrieval and AttributeTree preprocessing) and Generation (outline/content generation plus post-refinement) to produce high-quality surveys with enriched figures and tables. The approach is evaluated with expanded automatic and human metrics, showing SurveyX outperforms prior automated methods and approaches human expert performance across multiple dimensions. This work suggests a scalable framework for automated, credible scholarly surveys with practical implications for researchers and evaluators.

Abstract

Large Language Models (LLMs) have demonstrated exceptional comprehension capabilities and a vast knowledge base, suggesting that LLMs can serve as efficient tools for automated survey generation. However, recent research related to automated survey generation remains constrained by some critical limitations like finite context window, lack of in-depth content discussion, and absence of systematic evaluation frameworks. Inspired by human writing processes, we propose SurveyX, an efficient and organized system for automated survey generation that decomposes the survey composing process into two phases: the Preparation and Generation phases. By innovatively introducing online reference retrieval, a pre-processing method called AttributeTree, and a re-polishing process, SurveyX significantly enhances the efficacy of survey composition. Experimental evaluation results show that SurveyX outperforms existing automated survey generation systems in content quality (0.259 improvement) and citation quality (1.76 enhancement), approaching human expert performance across multiple evaluation dimensions. Examples of surveys generated by SurveyX are available on www.surveyx.cn

SurveyX: Academic Survey Automation via Large Language Models

TL;DR

Abstract

SurveyX: Academic Survey Automation via Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (16)