SEGMENT+: Long Text Processing with Short-Context Language Models
Wei Shi, Shuang Li, Kerun Yu, Jinglei Chen, Zujie Liang, Xinhui Wu, Yuxi Qian, Feng Wei, Bo Zheng, Jiaqing Liang, Jiangjie Chen, Yanghua Xiao
TL;DR
Segment+ addresses the challenge of long-input processing for language models with restricted context windows by introducing a two-stage framework that gathers and merges information using structured notes and a filtering module. It defines notes with Evidence and Reasoning, enabling controllable information flow and interpretable reasoning, and applies batch merging to fit within the limited context windows. Across long-document QA and Babilong style tasks, Segment+ yields robust performance gains across model sizes, outperforming retrieval-augmented and agent-based baselines and demonstrating strong noise resistance and efficiency. The work highlights the importance of explicit information control, ablation-backed design choices, and segment size analysis, with implications for scalable long-text understanding and broader applications in memory management and multimedia contexts.
Abstract
There is a growing interest in expanding the input capacity of language models (LMs) across various domains. However, simply increasing the context window does not guarantee robust performance across diverse long-input processing tasks, such as understanding extensive documents and extracting detailed information from lengthy and noisy data. In response, we introduce SEGMENT+, a general framework that enables LMs to handle extended inputs within limited context windows efficiently. SEGMENT+ utilizes structured notes and a filtering module to manage information flow, resulting in a system that is both controllable and interpretable. Our extensive experiments across various model sizes, focusing on long-document question-answering and Needle-in-a-Haystack tasks, demonstrate the effectiveness of SEGMENT+ in improving performance.
