A Persuasive Approach to Combating Misinformation
Safwan Hossain, Andjela Mladenovic, Yiling Chen, Gauthier Gidel
TL;DR
The paper tackles misinformation spread on social platforms by leveraging information design through Bayesian persuasion under imperfect predictions. It formulates a noisy persuasion model where a platform signals predicted post-state $\hat{\theta}$ via a confusion matrix $Q^\Theta$, optimizing the signaling scheme through a linear program and analyzing how classifier quality affects platform utility. The authors establish that, under symmetric $Q^\Theta$, the optimal utility is non-decreasing with respect to the convex hull of the confusion-row set and is Lipschitz continuous in $Q^\Theta$. They also introduce a stateful performative model describing how signaling changes user behavior over time, proving convergence to a stable posterior and characterizing conditions for stability. Experiments on synthetic data demonstrate meaningful reductions in misinformation sharing across single-round and performative settings, highlighting the practical potential of information design as a non-censorship intervention.
Abstract
Bayesian Persuasion is proposed as a tool for social media platforms to combat the spread of misinformation. Since platforms can use machine learning to predict the popularity and misinformation features of to-be-shared posts, and users are largely motivated to share popular content, platforms can strategically signal this informational advantage to change user beliefs and persuade them not to share misinformation. We characterize the optimal signaling scheme with imperfect predictions as a linear program and give sufficient and necessary conditions on the classifier to ensure optimal platform utility is non-decreasing and continuous. Next, this interaction is considered under a performative model, wherein platform intervention affects the user's future behaviour. The convergence and stability of optimal signaling under this performative process are fully characterized. Lastly, we experimentally validate that our approach significantly reduces misinformation in both the single round and performative setting and discuss the broader scope of using information design to combat misinformation.
