In-depth Research Impact Summarization through Fine-Grained Temporal Citation Analysis
Hiba Arnaout, Noy Sternlicht, Tom Hope, Iryna Gurevych
TL;DR
The paper tackles the inadequacy of raw citation counts for assessing scientific impact by proposing time-aware impact summaries derived from impact-revealing citation contexts. It introduces a two-stage, LLM-based framework: first extracting fine-grained intents behind citations, then generating structured, time-indexed impact narratives. A new dataset extends PST-Bench to 4k contexts, and an automated, reference-free evaluation framework assesses faithfulness, coverage, and informativeness, with human judges showing moderate-to-strong correlation. Expert feedback confirms practical value, and the authors release code and data to support future research on nuanced, time-evolving scholarly impact.
Abstract
Understanding the impact of scientific publications is crucial for identifying breakthroughs and guiding future research. Traditional metrics based on citation counts often miss the nuanced ways a paper contributes to its field. In this work, we propose a new task: generating nuanced, expressive, and time-aware impact summaries that capture both praise (confirmation citations) and critique (correction citations) through the evolution of fine-grained citation intents. We introduce an evaluation framework tailored to this task, showing moderate to strong human correlation on subjective metrics such as insightfulness. Expert feedback from professors reveals a strong interest in these summaries and suggests future improvements.
