Table of Contents
Fetching ...

Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles

Chen Wei Kuo, Kevin Chu, Nouar AlDahoul, Hazem Ibrahim, Talal Rahwan, Yasir Zaki

TL;DR

The paper introduces an AI-driven framework to detect and mitigate bias in crime-related news using six large language models, with GPT-4o Mini achieving the strongest alignment to human judgments. It builds a large, longitudinal corpus of 30,000 articles across five politically diverse outlets (2013–2023) and evaluates bias at the paragraph level, validating detection with human annotators. A three-prompt debiasing scheme, guided by GPT-4o Mini, demonstrates significant bias reductions while preserving content, with temporal and geographic analyses linking bias to socio-political events. The work offers a scalable approach to bias mitigation in journalism and lays groundwork for real-world tools to support fairer news production and consumption, while noting limitations and avenues for future research.

Abstract

Bias in news reporting significantly impacts public perception, particularly regarding crime, politics, and societal issues. Traditional bias detection methods, predominantly reliant on human moderation, suffer from subjective interpretations and scalability constraints. Here, we introduce an AI-driven framework leveraging advanced large language models (LLMs), specifically GPT-4o, GPT-4o Mini, Gemini Pro, Gemini Flash, Llama 8B, and Llama 3B, to systematically identify and mitigate biases in news articles. To this end, we collect an extensive dataset consisting of over 30,000 crime-related articles from five politically diverse news sources spanning a decade (2013-2023). Our approach employs a two-stage methodology: (1) bias detection, where each LLM scores and justifies biased content at the paragraph level, validated through human evaluation for ground truth establishment, and (2) iterative debiasing using GPT-4o Mini, verified by both automated reassessment and human reviewers. Empirical results indicate GPT-4o Mini's superior accuracy in bias detection and effectiveness in debiasing. Furthermore, our analysis reveals temporal and geographical variations in media bias correlating with socio-political dynamics and real-world events. This study contributes to scalable computational methodologies for bias mitigation, promoting fairness and accountability in news reporting.

Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles

TL;DR

The paper introduces an AI-driven framework to detect and mitigate bias in crime-related news using six large language models, with GPT-4o Mini achieving the strongest alignment to human judgments. It builds a large, longitudinal corpus of 30,000 articles across five politically diverse outlets (2013–2023) and evaluates bias at the paragraph level, validating detection with human annotators. A three-prompt debiasing scheme, guided by GPT-4o Mini, demonstrates significant bias reductions while preserving content, with temporal and geographic analyses linking bias to socio-political events. The work offers a scalable approach to bias mitigation in journalism and lays groundwork for real-world tools to support fairer news production and consumption, while noting limitations and avenues for future research.

Abstract

Bias in news reporting significantly impacts public perception, particularly regarding crime, politics, and societal issues. Traditional bias detection methods, predominantly reliant on human moderation, suffer from subjective interpretations and scalability constraints. Here, we introduce an AI-driven framework leveraging advanced large language models (LLMs), specifically GPT-4o, GPT-4o Mini, Gemini Pro, Gemini Flash, Llama 8B, and Llama 3B, to systematically identify and mitigate biases in news articles. To this end, we collect an extensive dataset consisting of over 30,000 crime-related articles from five politically diverse news sources spanning a decade (2013-2023). Our approach employs a two-stage methodology: (1) bias detection, where each LLM scores and justifies biased content at the paragraph level, validated through human evaluation for ground truth establishment, and (2) iterative debiasing using GPT-4o Mini, verified by both automated reassessment and human reviewers. Empirical results indicate GPT-4o Mini's superior accuracy in bias detection and effectiveness in debiasing. Furthermore, our analysis reveals temporal and geographical variations in media bias correlating with socio-political dynamics and real-world events. This study contributes to scalable computational methodologies for bias mitigation, promoting fairness and accountability in news reporting.

Paper Structure

This paper contains 16 sections, 3 figures, 6 tables.

Figures (3)

  • Figure 1: (A - E) The average bias score of articles from each publisher over time. (F) The average bias score for each publisher overall.
  • Figure 2: Biased coverage of crime in the United States. (A) A heatmap illustrating the proportion of articles covering a crime in a given state with biased language. (B) For the states of Missouri, Louisiana, Minnesota, New York, Georgia, and Ohio, the number of articles with biased language in each year, with information detailing relevant social issues corresponding to spikes in biased coverage.
  • Figure 3: Debiasing effectiveness (A, B) An illustrative example of a paragraph which includes biased language (A), and after the paragraph is modified to remove biased language (B). (C) The average bias score of articles before and after debiasing as judged by human annotators (black circles) and by GPT-4o-mini (orange circles). (D) The similarity of biased and debiased paragraphs as determined by humans (left y-axis, black circles), and their cosine similarity scores (right y-axis, orange circles).