Automated Review Generation Method Based on Large Language Models
Shican Wu, Xiao Ma, Dehui Luo, Lulu Li, Xiangcheng Shi, Xin Chang, Xiaoyun Lin, Ran Luo, Chunlei Pei, Changying Du, Zhi-Jian Zhao, Jinlong Gong
TL;DR
The study tackles the problem of exploding scientific literature by introducing an end-to-end automated review generation method based on large language models (LLMs). It combines literature retrieval, topic formulation, knowledge extraction, and review composition within a modular pipeline, augmented by a dual-baseline evaluation framework and multi-layer hallucination mitigation. A propane dehydrogenation (PDH) catalyst case study demonstrates cross-disciplinary applicability, processing 343 articles (and 1041 in extended analysis) across 35 topics, with near-manual quality and robust citation tracing; hallucination probability is reduced to below 0.5% with 95% confidence. An open-source Windows GUI enables one-click review generation, and data mining insights into catalyst design are provided, highlighting broad potential for automated literature analysis across domains and for accelerating scientific knowledge dissemination.
Abstract
Literature research, vital for scientific work, faces the challenge of surging information volumes exceeding researchers' processing capabilities. We present an automated review generation method based on large language models (LLMs) to overcome efficiency bottlenecks and reduce cognitive load. Our statistically validated evaluation framework demonstrates that the generated reviews match or exceed manual quality, offering broad applicability across research fields without requiring users' domain knowledge. Applied to propane dehydrogenation (PDH) catalysts, our method swiftly analyzed 343 articles, averaging seconds per article per LLM account, producing comprehensive reviews spanning 35 topics, with extended analysis of 1041 articles providing insights into catalysts' properties. Through multi-layered quality control, we effectively mitigated LLMs' hallucinations, with expert verification confirming accuracy and citation integrity while demonstrating hallucination risks reduced to below 0.5\% with 95\% confidence. Released Windows application enables one-click review generation, enhancing research productivity and literature recommendation efficiency while setting the stage for broader scientific explorations.
