Table of Contents
Fetching ...

Chain of Summaries: Summarization Through Iterative Questioning

William Brach, Lukas Galke Poech

TL;DR

CoS addresses the challenge of making web content accessible to LLMs by generating information-dense, plain-text summaries through an iterative, question-guided refinement inspired by Hegel's dialectic. The method uses an initial thesis, then antithesis via synthetic questions, and a synthesis that best supports downstream QA, iterating to maximize generalization while minimizing token usage. Empirical results on TriviaQA, TruthfulQA, and SQuAD show CoS outperforms zero-shot baselines and specialized summarizers, often surpassing full-source QA performance while using far fewer tokens. The approach is model-agnostic and suitable for server-side deployment as a content cache with human oversight and synthetic QA generation when labeled data are scarce.

Abstract

Large Language Models (LLMs) are increasingly using external web content. However, much of this content is not easily digestible by LLMs due to LLM-unfriendly formats and limitations of context length. To address this issue, we propose a method for generating general-purpose, information-dense summaries that act as plain-text repositories of web content. Inspired by Hegel's dialectical method, our approach, denoted as Chain of Summaries (CoS), iteratively refines an initial summary (thesis) by identifying its limitations through questioning (antithesis), leading to a general-purpose summary (synthesis) that can satisfy current and anticipate future information needs. Experiments on the TriviaQA, TruthfulQA, and SQUAD datasets demonstrate that CoS outperforms zero-shot LLM baselines by up to 66% and specialized summarization methods such as BRIO and PEGASUS by up to 27%. CoS-generated summaries yield higher Q&A performance compared to the source content, while requiring substantially fewer tokens and being agnostic to the specific downstream LLM. CoS thus resembles an appealing option for website maintainers to make their content more accessible for LLMs, while retaining possibilities for human oversight.

Chain of Summaries: Summarization Through Iterative Questioning

TL;DR

CoS addresses the challenge of making web content accessible to LLMs by generating information-dense, plain-text summaries through an iterative, question-guided refinement inspired by Hegel's dialectic. The method uses an initial thesis, then antithesis via synthetic questions, and a synthesis that best supports downstream QA, iterating to maximize generalization while minimizing token usage. Empirical results on TriviaQA, TruthfulQA, and SQuAD show CoS outperforms zero-shot baselines and specialized summarizers, often surpassing full-source QA performance while using far fewer tokens. The approach is model-agnostic and suitable for server-side deployment as a content cache with human oversight and synthetic QA generation when labeled data are scarce.

Abstract

Large Language Models (LLMs) are increasingly using external web content. However, much of this content is not easily digestible by LLMs due to LLM-unfriendly formats and limitations of context length. To address this issue, we propose a method for generating general-purpose, information-dense summaries that act as plain-text repositories of web content. Inspired by Hegel's dialectical method, our approach, denoted as Chain of Summaries (CoS), iteratively refines an initial summary (thesis) by identifying its limitations through questioning (antithesis), leading to a general-purpose summary (synthesis) that can satisfy current and anticipate future information needs. Experiments on the TriviaQA, TruthfulQA, and SQUAD datasets demonstrate that CoS outperforms zero-shot LLM baselines by up to 66% and specialized summarization methods such as BRIO and PEGASUS by up to 27%. CoS-generated summaries yield higher Q&A performance compared to the source content, while requiring substantially fewer tokens and being agnostic to the specific downstream LLM. CoS thus resembles an appealing option for website maintainers to make their content more accessible for LLMs, while retaining possibilities for human oversight.

Paper Structure

This paper contains 30 sections, 4 figures, 6 tables.

Figures (4)

  • Figure 1: Chain of Summaries (CoS) method implements Hegel's dialectical method for document summarization, iteratively refining an initial summary (thesis) through challenging questions (antithesis) to produce an information-dense optimal summary (synthesis) capable of answering diverse queries.
  • Figure 2: Summary length (tokens, log scale) vs. Correct-F1 score on the TriviaQA dataset, comparing performance across model backbones and summarization methods. This visualization demonstrates CoS's ability to create information-dense summaries that optimize the efficiency-performance trade-off across model architectures.
  • Figure 3: Distribution of summary lengths, measured in gpt-4o-mini tokens.
  • Figure 4: Distribution of F1 scores in the training dataset used for the reward model.