Shifting Long-Context LLMs Research from Input to Output
Yuhao Wu, Yushi Bai, Zhiqing Hu, Shangqing Tu, Ming Shan Hee, Juanzi Li, Roy Ka-Wei Lee
TL;DR
The paper argues for shifting NLP research from primarily improving long-input processing to advancing long-output generation in LLMs, defining long-output LLMs as models optimized for producing coherent, extended text. It surveys current progress across data, benchmarks, and models, highlighting real-world demand in domains like healthcare and law, and identifies key challenges in data alignment, benchmark coverage, and training/inference efficiency. The authors discuss real-world applications such as creative writing and long-chain-of-thought tasks, and outline opportunities including real-world data collection, hybrid data strategies, improved evaluation, and architectural innovations to reduce latency and memory demands. They also present alternative viewpoints, acknowledging that long-output generation may not always be necessary and that improving input handling could be a precursor to long-form generation. Overall, the paper calls for targeted foundational research to develop robust, scalable long-output LLMs with practical impact across industries and tasks requiring extensive and coherent text generation.
Abstract
Recent advancements in long-context Large Language Models (LLMs) have primarily concentrated on processing extended input contexts, resulting in significant strides in long-context comprehension. However, the equally critical aspect of generating long-form outputs has received comparatively less attention. This paper advocates for a paradigm shift in NLP research toward addressing the challenges of long-output generation. Tasks such as novel writing, long-term planning, and complex reasoning require models to understand extensive contexts and produce coherent, contextually rich, and logically consistent extended text. These demands highlight a critical gap in current LLM capabilities. We underscore the importance of this under-explored domain and call for focused efforts to develop foundational LLMs tailored for generating high-quality, long-form outputs, which hold immense potential for real-world applications.
