Incremental Summarization for Customer Support via Progressive Note-Taking and Agent Feedback
Yisha Wu, Cen Mia Zhao, Yuanpei Cao, Xiaoqing Su, Yashar Mehdad, Mindy Ji, Claire Na Cheng
TL;DR
This work addresses the cost of maintaining context in multi-channel customer-support conversations by proposing an incremental summarization system that generates concise notes online via progressive note-taking and a fine-tuned Mixtral-8x7B model. A DeBERTa-based bullet relevance classifier filters non-essential updates, while an Agent Edits Learning Framework integrates real-time agent feedback online and offline retraining to continually improve the system. In production, the approach yields a 3% reduction in average case handling time, with up to 9% reductions on complex cases, and maintains high agent satisfaction across English, French, and Spanish deployments. Evaluation combines offline LLM-judge assessments and a Diff-in-Diff online study, demonstrating improved summary quality and substantial operational efficiency at scale, with insights into multilingual performance and deployment considerations.
Abstract
We introduce an incremental summarization system for customer support agents that intelligently determines when to generate concise bullet notes during conversations, reducing agents' context-switching effort and redundant review. Our approach combines a fine-tuned Mixtral-8x7B model for continuous note generation with a DeBERTa-based classifier to filter trivial content. Agent edits refine the online notes generation and regularly inform offline model retraining, closing the agent edits feedback loop. Deployed in production, our system achieved a 3% reduction in case handling time compared to bulk summarization (with reductions of up to 9% in highly complex cases), alongside high agent satisfaction ratings from surveys. These results demonstrate that incremental summarization with continuous feedback effectively enhances summary quality and agent productivity at scale.
