LLM-Assisted News Discovery in High-Volume Information Streams: A Case Study
Nick Hagar, Ethan Silver, Clare Spencer, Nicholas Diakopoulos
TL;DR
The paper investigates whether large language models (LLMs) can act as first-pass editors for high-volume information streams by encoding core journalistic news values into prompts. It develops a two-phase approach—prompt engineering to capture news values and an open-source pipeline to deploy and test the system on the beat of generative AI in newsrooms—validated through ground-truth annotations and a one-week field deployment. Results show strong item-level extraction ($F1=0.94$) and reasonable coarse newsworthiness alignment (up to $\ extpm 1$ accuracy at ~92%), but reveal limitations in beat-specific judgments and the persistence of editorial nuance that requires human review. The study advocates a hybrid human–AI workflow, offering practical recommendations for newsroom integration to augment editorial capacity while preserving core journalistic standards.
Abstract
Journalists face mounting challenges in monitoring ever-expanding digital information streams to identify newsworthy content. While traditional automation tools gather information at scale, they struggle with the editorial judgment needed to assess newsworthiness. This paper investigates whether large language models (LLMs) can serve as effective first-pass filters for journalistic monitoring. We develop a prompt-based approach encoding journalistic news values - timeliness, impact, controversy, and generalizability - into LLM instructions to extract and evaluate potential story leads. We validate our approach across multiple models against expert-annotated ground truth, then deploy a real-world monitoring pipeline that processes trade press articles daily. Our evaluation reveals strong performance in extracting relevant leads from source material ($F1=0.94$) and in coarse newsworthiness assessment ($\pm$1 accuracy up to 92%), but it consistently struggles with nuanced editorial judgments requiring beat expertise. The system proves most valuable as a hybrid tool combining automated monitoring with human review, successfully surfacing novel, high-value leads while filtering obvious noise. We conclude with practical recommendations for integrating LLM-powered monitoring into newsroom workflows that preserves editorial judgment while extending journalistic capacity.
