Teaching Large Language Models Number-Focused Headline Generation With Key Element Rationales
Zhen Qian, Xiuzhen Zhang, Xiaofei Xu, Feng Xia
TL;DR
This paper tackles number-focused headline generation, where both textual quality and numerical accuracy are critical. It introduces TEN, a chain-of-thought framework that decomposes headlines into Topic, Entities, and Numerical reasoning, and uses a teacher-student distillation pipeline to automatically produce TEN rationales and headlines. A Direct Preference Optimization stage refines rationale quality to improve topic alignment and numerical correctness. Experiments on NumHG and XSum show that TEN substantially improves numerical accuracy while maintaining strong text quality, demonstrating the practical viability of rationale-augmented, numerically grounded headline generation. The work advances how LLMs can jointly reason about content and numbers in abstractive summarization, with implications for more reliable news generation and evaluation.
Abstract
Number-focused headline generation is a summarization task requiring both high textual quality and precise numerical accuracy, which poses a unique challenge for Large Language Models (LLMs). Existing studies in the literature focus only on either textual quality or numerical reasoning and thus are inadequate to address this challenge. In this paper, we propose a novel chain-of-thought framework for using rationales comprising key elements of the Topic, Entities, and Numerical reasoning (TEN) in news articles to enhance the capability for LLMs to generate topic-aligned high-quality texts with precise numerical accuracy. Specifically, a teacher LLM is employed to generate TEN rationales as supervision data, which are then used to teach and fine-tune a student LLM. Our approach teaches the student LLM automatic generation of rationales with enhanced capability for numerical reasoning and topic-aligned numerical headline generation. Experiments show that our approach achieves superior performance in both textual quality and numerical accuracy.
