Developing Story: Case Studies of Generative AI's Use in Journalism
Natalie Grace Brigham, Chongjiu Gao, Tadayoshi Kohno, Franziska Roesner, Niloofar Mireshghallah
TL;DR
The paper investigates journalist use of large language models by analyzing WildChat conversations to identify prompting behaviors and the degree of human intervention before publishing machine-generated articles. It classifies tasks and stimuli, verifies outputs by matching to published articles, and uses ROUGE-L to quantify overlap, finding a median $0.62$ overlap and a prompt-to-publication span of about $1$ day. It also detects broader dissemination of LLM-generated content with GPTZero and reveals privacy risks from external and private stimuli, including sensitive interviews. The work argues for responsible AI guidelines, improved AI literacy for practitioners, and cross-disciplinary collaboration to steer journalism's co-evolution with AI technology.
Abstract
Journalists are among the many users of large language models (LLMs). To better understand the journalist-AI interactions, we conduct a study of LLM usage by two news agencies through browsing the WildChat dataset, identifying candidate interactions, and verifying them by matching to online published articles. Our analysis uncovers instances where journalists provide sensitive material such as confidential correspondence with sources or articles from other agencies to the LLM as stimuli and prompt it to generate articles, and publish these machine-generated articles with limited intervention (median output-publication ROUGE-L of 0.62). Based on our findings, we call for further research into what constitutes responsible use of AI, and the establishment of clear guidelines and best practices on using LLMs in a journalistic context.
