Table of Contents
Fetching ...

MisinfoEval: Generative AI in the Era of "Alternative Facts"

Saadia Gabriel, Liang Lyu, James Siderius, Marzyeh Ghassemi, Jacob Andreas, Asu Ozdaglar

TL;DR

A framework for generating and comprehensively evaluating large language model (LLM) based misinformation interventions and finds that users favor more personalized interventions when making decisions about news reliability and users shown personalized interventions have significantly higher accuracy at identifying misinformation.

Abstract

The spread of misinformation on social media platforms threatens democratic processes, contributes to massive economic losses, and endangers public health. Many efforts to address misinformation focus on a knowledge deficit model and propose interventions for improving users' critical thinking through access to facts. Such efforts are often hampered by challenges with scalability, and by platform users' personal biases. The emergence of generative AI presents promising opportunities for countering misinformation at scale across ideological barriers. In this paper, we introduce a framework (MisinfoEval) for generating and comprehensively evaluating large language model (LLM) based misinformation interventions. We present (1) an experiment with a simulated social media environment to measure effectiveness of misinformation interventions, and (2) a second experiment with personalized explanations tailored to the demographics and beliefs of users with the goal of countering misinformation by appealing to their pre-existing values. Our findings confirm that LLM-based interventions are highly effective at correcting user behavior (improving overall user accuracy at reliability labeling by up to 41.72%). Furthermore, we find that users favor more personalized interventions when making decisions about news reliability and users shown personalized interventions have significantly higher accuracy at identifying misinformation.

MisinfoEval: Generative AI in the Era of "Alternative Facts"

TL;DR

A framework for generating and comprehensively evaluating large language model (LLM) based misinformation interventions and finds that users favor more personalized interventions when making decisions about news reliability and users shown personalized interventions have significantly higher accuracy at identifying misinformation.

Abstract

The spread of misinformation on social media platforms threatens democratic processes, contributes to massive economic losses, and endangers public health. Many efforts to address misinformation focus on a knowledge deficit model and propose interventions for improving users' critical thinking through access to facts. Such efforts are often hampered by challenges with scalability, and by platform users' personal biases. The emergence of generative AI presents promising opportunities for countering misinformation at scale across ideological barriers. In this paper, we introduce a framework (MisinfoEval) for generating and comprehensively evaluating large language model (LLM) based misinformation interventions. We present (1) an experiment with a simulated social media environment to measure effectiveness of misinformation interventions, and (2) a second experiment with personalized explanations tailored to the demographics and beliefs of users with the goal of countering misinformation by appealing to their pre-existing values. Our findings confirm that LLM-based interventions are highly effective at correcting user behavior (improving overall user accuracy at reliability labeling by up to 41.72%). Furthermore, we find that users favor more personalized interventions when making decisions about news reliability and users shown personalized interventions have significantly higher accuracy at identifying misinformation.

Paper Structure

This paper contains 35 sections, 4 figures, 4 tables.

Figures (4)

  • Figure 1: Examples of a post in the simulated newsfeed (left), and a pop-up intervention with a veracity label (right).
  • Figure 2: Effects of personalization on self-reported helpfulness of explanations (left) and user accuracy (right).
  • Figure 3: Analysis of all GPT-4 explanations. We provide a breakdown of explanation quality: (1) reasoning accuracy, (2) whether explanations use commonsense reasoning that should be innate to humans, (3) whether explanations require learned background knowledge from specific news events and (4) whether explanations require domain knowledge (e.g. scientific facts or knowledge of legal processes).
  • Figure 4: Full Amazon Mechanical Turk Instructions.