Exploring Iterative Controllable Summarization with Large Language Models
Sangwon Ryu, Heejin Do, Daehee Kim, Hwanjo Yu, Dongwoo Kim, Yunsu Kim, Gary Geunbae Lee, Jungseul Ok
TL;DR
This work identifies a gap in precisely controllable summarization by LLMs and introduces a refined attribute measurement suite for extractiveness, length, topic, and speaker, along with iterative evaluation metrics. It proposes Guide-to-Explain (GTE), a two-phase framework that uses step-by-step attribute identification and self-explanation guidance to steer LLMs toward attribute-aligned summaries in few iterations. Empirical results on MACSum datasets show that GTE dramatically reduces failures and iterations while preserving or improving overall summary quality, even for numerically constrained attributes. The findings highlight both the feasibility and limits of iterative controllable summarization with LLMs and point to future work on balancing multiple correlated attributes and exploring more robust planning strategies.
Abstract
Large language models (LLMs) have demonstrated remarkable performance in abstractive summarization tasks. However, their ability to precisely control summary attributes (e.g., length or topic) remains underexplored, limiting their adaptability to specific user preferences. In this paper, we systematically explore the controllability of LLMs. To this end, we revisit summary attribute measurements and introduce iterative evaluation metrics, failure rate and average iteration count to precisely evaluate controllability of LLMs, rather than merely assessing errors. Our findings show that LLMs struggle more with numerical attributes than with linguistic attributes. To address this challenge, we propose a guide-to-explain framework (GTE) for controllable summarization. Our GTE framework enables the model to identify misaligned attributes in the initial draft and guides it in self-explaining errors in the previous output. By allowing the model to reflect on its misalignment, GTE generates well-adjusted summaries that satisfy the desired attributes with robust effectiveness, requiring surprisingly fewer iterations than other iterative approaches.
