CS-PaperSum: A Large-Scale Dataset of AI-Generated Summaries for Scientific Papers
Javin Liu, Aryan Vats, Zihao He
TL;DR
CS-PaperSum tackles the challenge of scalable scientific literature analysis in computer science by providing structured AI-generated summaries for nearly 92k papers across 31 conferences. The approach uses GPT-3.5 to extract key contributions, methodologies, evaluation metrics, and future directions, enabling standardized cross-paper comparisons. The authors validate the quality with SciBERT-based embeddings and KeyBERT keyword overlap, showing faithful semantic preservation and topic retention. A case study demonstrates how the dataset supports trend detection across major AI conferences, highlighting shifts toward self-supervised learning, retrieval-augmented generation, and multimodal AI, with broader implications for automated literature reviews and AI-driven discovery.
Abstract
The rapid expansion of scientific literature in computer science presents challenges in tracking research trends and extracting key insights. Existing datasets provide metadata but lack structured summaries that capture core contributions and methodologies. We introduce CS-PaperSum, a large-scale dataset of 91,919 papers from 31 top-tier computer science conferences, enriched with AI-generated structured summaries using ChatGPT. To assess summary quality, we conduct embedding alignment analysis and keyword overlap analysis, demonstrating strong preservation of key concepts. We further present a case study on AI research trends, highlighting shifts in methodologies and interdisciplinary crossovers, including the rise of self-supervised learning, retrieval-augmented generation, and multimodal AI. Our dataset enables automated literature analysis, research trend forecasting, and AI-driven scientific discovery, providing a valuable resource for researchers, policymakers, and scientific information retrieval systems.
