Exposing Pink Slime Journalism: Linguistic Signatures and Robust Detection Against LLM-Generated Threats
Sadat Shahriar, Navid Ayoobi, Arjun Mukherjee, Mostafa Musharrat, Sai Vishnu Vamsi
TL;DR
Pink Slime journalism employs template-driven local news that mimics legitimacy, challenging detection. The authors perform fine-grained linguistic analysis to identify simple, less lexical-rich patterns and then develop detectors using handcrafted features and transformer fine-tuning. They show LLM-based rewriting can substantially degrade detection (up to 40% F1 loss) and propose a continual-learning framework to adapt to such drift, achieving meaningful robustness gains (up to ~27%) with limited forgetting. The work provides actionable linguistic cues and a scalable defense against evolving AI-generated misinformation in local news ecosystems.
Abstract
The local news landscape, a vital source of reliable information for 28 million Americans, faces a growing threat from Pink Slime Journalism, a low-quality, auto-generated articles that mimic legitimate local reporting. Detecting these deceptive articles requires a fine-grained analysis of their linguistic, stylistic, and lexical characteristics. In this work, we conduct a comprehensive study to uncover the distinguishing patterns of Pink Slime content and propose detection strategies based on these insights. Beyond traditional generation methods, we highlight a new adversarial vector: modifications through large language models (LLMs). Our findings reveal that even consumer-accessible LLMs can significantly undermine existing detection systems, reducing their performance by up to 40% in F1-score. To counter this threat, we introduce a robust learning framework specifically designed to resist LLM-based adversarial attacks and adapt to the evolving landscape of automated pink slime journalism, and showed and improvement by up to 27%.
