Papers-to-Posts: Supporting Detailed Long-Document Summarization with an Interactive LLM-Powered Source Outline
Marissa Radensky, Daniel S. Weld, Joseph Chee Chang, Pao Siangliulue, Jonathan Bragg
TL;DR
This work introduces interactive reverse source outlines as a mixed-initiative mechanism for grounded long-document summarization and implements it in Papers-to-Posts, a system for authoring research-paper blog posts. The approach maintains the source narrative through an LLM-generated reverse outline and allows iterative content selection and drafting in a plan-draft-revise cycle. Two user studies (lab with 20 participants and deployment with 26 participants across 37 blog posts) show that Papers-to-Posts increases editing power and, under time pressure, improves satisfaction with content coverage, while enabling easier incorporation of source content. The results highlight the value of grounding and controllable content selection for detailed summaries, while also revealing that full flexibility from baseline tools remains useful in some contexts; limitations include parsing accuracy and the absence of figures, guiding future enhancements toward hierarchical outlines and broader domain evaluation.
Abstract
Compressing long and technical documents (e.g., >10 pages) into shorter-form articles (e.g., <2 pages) is critical for communicating information to different audiences, for example, blog posts of scientific research paper or legal briefs of dense court proceedings. While large language models (LLMs) are powerful tools for condensing large amounts of text, current interfaces to these models lack support for understanding and controlling what content is included in a detailed summarizing article. Such capability is especially important for detail- and technical-oriented domains, in which tactical selection and coherent synthesis of key details is critical for effective communication to the target audience. For this, we present interactive reverse source outlines, a novel mechanism for controllable long-form summarization featuring outline bullet points with automatic point selections that the user can iteratively adjust to obtain an article with the desired content coverage. We implement this mechanism in Papers-to-Posts, a new LLM-powered system for authoring research-paper blog posts. Through a within-subjects lab study (n=20) and a between-subjects deployment study (n=37 blog posts, 26 participants), we compare Papers-to-Posts to a strong baseline tool that provides an LLM-generated draft and access to free-form prompting. Under time constraints, Papers-to-Posts significantly increases writer satisfaction with blog post quality, particularly with respect to content coverage. Furthermore, quantitative results showed an increase in editing power (change in text for an amount of time or writing actions) while using Papers-to-Posts, and qualitative results showed that participants found incorporating key research-paper insights in their blog posts easier while using Papers-to-Posts.
