Assessing the Impact of Conspiracy Theories Using Large Language Models

Bohan Jiang; Dawei Li; Zhen Tan; Xinyi Zhou; Ashwin Rao; Kristina Lerman; H. Russell Bernard; Huan Liu

Assessing the Impact of Conspiracy Theories Using Large Language Models

Bohan Jiang, Dawei Li, Zhen Tan, Xinyi Zhou, Ashwin Rao, Kristina Lerman, H. Russell Bernard, Huan Liu

TL;DR

This work investigates using large language models to automate the impact assessment of conspiracy theories. It builds a CT impact dataset based on a YouGov survey and designs human-like prompting strategies that emulate fast/slow thinking, comparative/scoring judgments, and single/multi-agent reasoning. Through extensive experiments across eight LLMs, the study shows that multi-step reasoning and multi-agent debating yield greater accuracy, while prompting biases like position, wording, and verbosity can substantially affect results. It also discusses mitigation strategies and limitations, offering guidance for scalable, human-aligned CT impact evaluation in practice.

Abstract

Measuring the relative impact of CTs is important for prioritizing responses and allocating resources effectively, especially during crises. However, assessing the actual impact of CTs on the public poses unique challenges. It requires not only the collection of CT-specific knowledge but also diverse information from social, psychological, and cultural dimensions. Recent advancements in large language models (LLMs) suggest their potential utility in this context, not only due to their extensive knowledge from large training corpora but also because they can be harnessed for complex reasoning. In this work, we develop datasets of popular CTs with human-annotated impacts. Borrowing insights from human impact assessment processes, we then design tailored strategies to leverage LLMs for performing human-like CT impact assessments. Through rigorous experiments, we textit{discover that an impact assessment mode using multi-step reasoning to analyze more CT-related evidence critically produces accurate results; and most LLMs demonstrate strong bias, such as assigning higher impacts to CTs presented earlier in the prompt, while generating less accurate impact assessments for emotionally charged and verbose CTs.

Assessing the Impact of Conspiracy Theories Using Large Language Models

TL;DR

Abstract

Assessing the Impact of Conspiracy Theories Using Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)