Quantifying Cognitive Bias Induction in LLM-Generated Content

Abeer Alessa; Param Somane; Akshaya Lakshminarasimhan; Julian Skirzynski; Julian McAuley; Jessica Echterhoff

Quantifying Cognitive Bias Induction in LLM-Generated Content

Abeer Alessa, Param Somane, Akshaya Lakshminarasimhan, Julian Skirzynski, Julian McAuley, Jessica Echterhoff

TL;DR

<3-5 sentence high-level summary>Quantifies how LLM-generated content can bias human decisions by altering framing, primacy, and truthfulness in summarization and news fact-checking. Introduces metrics (framing-change φ_frame, primacy ψ_pri, and hallucination Δ_H) and a self-updating NewsLensSync dataset to evaluate multiple model families across domains. Demonstrates substantial exposure to bias (framing ~26%, primacy ~10%, post-cutoff hallucinations ~60%) and shows that biased content can shift consumer decisions and willingness to pay. Evaluates 18 mitigation strategies, revealing model- and content-dependent trade-offs and offering practical guidance for reducing content-induced biases in real-world AI-assisted decision-making.

Abstract

Large language models (LLMs) are integrated into applications like shopping reviews, summarization, or medical diagnosis support, where their use affects human decisions. We investigate the extent to which LLMs expose users to biased content and demonstrate its effect on human decision-making. We assess five LLM families in summarization and news fact-checking tasks, evaluating the consistency of LLMs with their context and their tendency to hallucinate on a new self-updating dataset. Our findings show that LLMs expose users to content that changes the context's sentiment in 26.42% of cases (framing bias), hallucinate on 60.33% of post-knowledge-cutoff questions, and highlight context from earlier parts of the prompt (primacy bias) in 10.12% of cases, averaged across all tested models. We further find that humans are 32% more likely to purchase the same product after reading a summary of the review generated by an LLM rather than the original review. To address these issues, we evaluate 18 mitigation methods across three LLM families and find the effectiveness of targeted interventions.

Quantifying Cognitive Bias Induction in LLM-Generated Content

TL;DR

Abstract

Quantifying Cognitive Bias Induction in LLM-Generated Content

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)