Behavioral Analysis of Information Salience in Large Language Models
Jan Trienes, Jörg Schlötterer, Junyi Jessy Li, Christin Seifert
TL;DR
This work tackles the unclear notion of information salience learned by large language models during summarization. It introduces an explainable framework that uses length-controlled summarization as a behavioral probe and questions-under-discussion (QUDs) to derive an answerability-based salience proxy, enabling interpretable content-selection analysis across 13 models and four datasets. The study finds that LLMs harbor a nuanced, hierarchical salience that is consistent across model families and sizes, yet inaccessible to direct introspection and only weakly aligned with human salience. These results have practical implications for prompting, evaluation, and training strategies, suggesting that model-driven salience signals may require explicit alignment or training signals to match human expectations. The framework opens avenues for diagnosing content selection in synthesis tasks and exploring salience emergence during model training across domains.
Abstract
Large Language Models (LLMs) excel at text summarization, a task that requires models to select content based on its importance. However, the exact notion of salience that LLMs have internalized remains unclear. To bridge this gap, we introduce an explainable framework to systematically derive and investigate information salience in LLMs through their summarization behavior. Using length-controlled summarization as a behavioral probe into the content selection process, and tracing the answerability of Questions Under Discussion throughout, we derive a proxy for how models prioritize information. Our experiments on 13 models across four datasets reveal that LLMs have a nuanced, hierarchical notion of salience, generally consistent across model families and sizes. While models show highly consistent behavior and hence salience patterns, this notion of salience cannot be accessed through introspection, and only weakly correlates with human perceptions of information salience.
