Instructive Dialogue Summarization with Query Aggregations

Bin Wang; Zhengyuan Liu; Nancy F. Chen

Instructive Dialogue Summarization with Query Aggregations

Bin Wang, Zhengyuan Liu, Nancy F. Chen

TL;DR

The paper tackles the problem of tailoring dialogue summaries to user interests, which traditional methods overlook. It introduces InstructDS, an instruction-tuning framework for dialogues, and a three-step pipeline to synthesize query-dialogue-summary triples (QDS) by generating candidate queries, filtering for quality and diversity, and producing query-based summaries; this is trained across three dialogue datasets to form a unified model. Empirically, InstructDS achieves state-of-the-art results on SAMSum and strong performance on DialogSum and TODSum, while transferring effectively to the DREAM reading-comprehension task, with human evaluations indicating competitive fluency, informativeness, and conciseness, and strong faithfulness. The approach demonstrates how synthesized QDS data and length-aware instruction tuning can yield flexible, faithful, and concise summaries that adapt to user queries and potentially scale to long dialogues and privacy-aware settings in the future.

Abstract

Conventional dialogue summarization methods directly generate summaries and do not consider user's specific interests. This poses challenges in cases where the users are more focused on particular topics or aspects. With the advancement of instruction-finetuned language models, we introduce instruction-tuning to dialogues to expand the capability set of dialogue summarization models. To overcome the scarcity of instructive dialogue summarization data, we propose a three-step approach to synthesize high-quality query-based summarization triples. This process involves summary-anchored query generation, query filtering, and query-based summary generation. By training a unified model called InstructDS (Instructive Dialogue Summarization) on three summarization datasets with multi-purpose instructive triples, we expand the capability of dialogue summarization models. We evaluate our method on four datasets, including dialogue summarization and dialogue reading comprehension. Experimental results show that our approach outperforms the state-of-the-art models and even models with larger sizes. Additionally, our model exhibits higher generalizability and faithfulness, as confirmed by human subjective evaluations.

Instructive Dialogue Summarization with Query Aggregations

TL;DR

Abstract

Paper Structure (22 sections, 8 figures, 17 tables)

This paper contains 22 sections, 8 figures, 17 tables.

Introduction
Related Work
Dialogue Summarization
Instruction Tuning
Instructive Dialogue Summarization
Problem Definition
Synthesize QDS Triples
Model Training
Experiments
Datasets, Metrics, and Baselines
Main Results
Ablation Study
Subjective Quality Evaluation
Discussions
Relationship with Query-focused Summarization
...and 7 more sections

Figures (8)

Figure 1: Instructive dialogue summarization models, such as InstructDS, demonstrate multiple capabilities.
Figure 2: Overall framework of our Instructive Dialogue Summarization (InstructDS) model.
Figure 3: Ablation study on different variants of InstructDS. Averaged ROUGE-1/2/L score is reported for dialogue summarization datasets, including SAMSum, DialogSum and TODSum. Accuracy is computed for DREAM dataset.
Figure 4: Ablation study on the number of included QDS triples. Performance on SAMSum (left, averaged ROUGE) and DREAM (right, accuracy) datasets are reported.
Figure 5: Ablation study on the percentage of length augmented instances. Performance on standard SAMSum and length-revealed SAMSum are reported.
...and 3 more figures

Instructive Dialogue Summarization with Query Aggregations

TL;DR

Abstract

Instructive Dialogue Summarization with Query Aggregations

Authors

TL;DR

Abstract

Table of Contents

Figures (8)