Table of Contents
Fetching ...

Overstating Attitudes, Ignoring Networks: LLM Biases in Simulating Misinformation Susceptibility

Eun Cheol Choi, Lindsay E. Young, Emilio Ferrara

TL;DR

The paper investigates whether large language models (LLMs) can faithfully simulate human misinformation susceptibility by conditioning synthetic respondents on real survey profiles across three domains. It employs a multi-domain dataset and a rigorous analytic pipeline, including distributional comparisons via $JSD$ and $EMD$, correlation analyses with ground truth using Spearman’s $ ho$, and Elastic Net modeling to compare predictor effects. The findings indicate that while LLMs capture broad distributional tendencies and correlate moderately with human responses, they exaggerate the link between belief and sharing, inflate $R^2$-like predictive power, and overemphasize attitudinal/behavioral predictors at the expense of network features. Explorations of model reasoning traces and training data with CoT prompts and OLMoTrace suggest these distortions arise from biases embedded in training data and reasoning processes. Consequently, LLM-based survey simulations are better suited for diagnosing systematic divergences from human judgment than substituting for human data, particularly when social networks and relational structure are central to the research question.

Abstract

Large language models (LLMs) are increasingly used as proxies for human judgment in computational social science, yet their ability to reproduce patterns of susceptibility to misinformation remains unclear. We test whether LLM-simulated survey respondents, prompted with participant profiles drawn from social survey data measuring network, demographic, attitudinal and behavioral features, can reproduce human patterns of misinformation belief and sharing. Using three online surveys as baselines, we evaluate whether LLM outputs match observed response distributions and recover feature-outcome associations present in the original survey data. LLM-generated responses capture broad distributional tendencies and show modest correlation with human responses, but consistently overstate the association between belief and sharing. Linear models fit to simulated responses exhibit substantially higher explained variance and place disproportionate weight on attitudinal and behavioral features, while largely ignoring personal network characteristics, relative to models fit to human responses. Analyses of model-generated reasoning and LLM training data suggest that these distortions reflect systematic biases in how misinformation-related concepts are represented. Our findings suggest that LLM-based survey simulations are better suited for diagnosing systematic divergences from human judgment than for substituting it.

Overstating Attitudes, Ignoring Networks: LLM Biases in Simulating Misinformation Susceptibility

TL;DR

The paper investigates whether large language models (LLMs) can faithfully simulate human misinformation susceptibility by conditioning synthetic respondents on real survey profiles across three domains. It employs a multi-domain dataset and a rigorous analytic pipeline, including distributional comparisons via and , correlation analyses with ground truth using Spearman’s , and Elastic Net modeling to compare predictor effects. The findings indicate that while LLMs capture broad distributional tendencies and correlate moderately with human responses, they exaggerate the link between belief and sharing, inflate -like predictive power, and overemphasize attitudinal/behavioral predictors at the expense of network features. Explorations of model reasoning traces and training data with CoT prompts and OLMoTrace suggest these distortions arise from biases embedded in training data and reasoning processes. Consequently, LLM-based survey simulations are better suited for diagnosing systematic divergences from human judgment than substituting for human data, particularly when social networks and relational structure are central to the research question.

Abstract

Large language models (LLMs) are increasingly used as proxies for human judgment in computational social science, yet their ability to reproduce patterns of susceptibility to misinformation remains unclear. We test whether LLM-simulated survey respondents, prompted with participant profiles drawn from social survey data measuring network, demographic, attitudinal and behavioral features, can reproduce human patterns of misinformation belief and sharing. Using three online surveys as baselines, we evaluate whether LLM outputs match observed response distributions and recover feature-outcome associations present in the original survey data. LLM-generated responses capture broad distributional tendencies and show modest correlation with human responses, but consistently overstate the association between belief and sharing. Linear models fit to simulated responses exhibit substantially higher explained variance and place disproportionate weight on attitudinal and behavioral features, while largely ignoring personal network characteristics, relative to models fit to human responses. Analyses of model-generated reasoning and LLM training data suggest that these distortions reflect systematic biases in how misinformation-related concepts are represented. Our findings suggest that LLM-based survey simulations are better suited for diagnosing systematic divergences from human judgment than for substituting it.
Paper Structure (33 sections, 25 figures, 9 tables)

This paper contains 33 sections, 25 figures, 9 tables.

Figures (25)

  • Figure 1: Study pipeline. Human survey participants provided demographic, attitudinal, behavioral, and personal network information through online surveys. Structured profiles containing the same information were provided to large language models (LLMs). Both human participants and LLMs evaluated the same set of false claims across domains and contexts, reporting belief and sharing intentions. The resulting outcome distributions and estimated predictor–outcome relationships were compared to assess the extent to which LLM-generated responses align with human response patterns and to identify systematic differences between simulated and empirical results.
  • Figure 2: Example prompt format used to simulate public health survey participants’ misinformation belief or sharing intention. Belief and sharing questions were queried separately using identical system prompts, participant profiles, and claims.
  • Figure 3: Relative explained variance after removing each feature block. Bars show the proportion of predictive power retained after block removal, $R^2_{\mathrm{block\ removed}}/R^2_{\mathrm{full\ model}}$ (in %), for simulated (Blue) and ground-truth (Orange) outcomes. All $R^2$ values are 5-fold cross-validated. $\sim$Net.: network block removed; $\sim$At./Be.: attitudinal/behavioral block removed; $\sim$Demo.: demographics block removed.
  • Figure 4: Standardized coefficients for simulation-by-feature interaction terms (Simulated $\times$ Variable) across domains and outcomes. Shown are the seven variables with the largest absolute interaction coefficients. Each coefficient captures the difference in the estimated feature–outcome association between simulated and ground-truth data (i.e., the change in slope for simulated relative to ground truth). The vertical line at zero indicates no difference between simulated and ground-truth estimates.
  • Figure 5: Frequency of the seven most frequently referred variables across reasoning chains by domain.
  • ...and 20 more figures