Quantifying the Vulnerabilities of the Online Public Square to Adversarial Manipulation Tactics
Bao Tran Truong, Xiaodan Lou, Alessandro Flammini, Filippo Menczer
TL;DR
Online social platforms function as a public square but are vulnerable to coordinated inauthentic manipulation. The authors develop SimSoM, an agent-based diffusion model on an empirical follower network to quantify how bad actors spreading low-quality content affect the overall information quality, parameterizing tactics as infiltration $\gamma$, deception $\phi$, flooding $\theta$, and targeting. The results show that infiltration is the most damaging tactic, reducing average quality by more than half at moderate infiltration, and that hubs (high-degree nodes) amplify the impact; flooding and deception can further degrade quality, while targeting influentials can yield counterintuitive effects by localizing harm. The study also reconstructs exposure vs. reshare cascades, showing low-quality content often achieves larger exposure under high appeal and infiltration, underscoring the need for platform defenses such as stronger bot detection, diffusion caps, and user-accuracy nudges to bolster resilience. Overall, the work provides a quantitative framework to evaluate manipulation tactics and design mitigation strategies for safeguarding the online public square.
Abstract
Social media, seen by some as the modern public square, is vulnerable to manipulation. By controlling inauthentic accounts impersonating humans, malicious actors can amplify disinformation within target communities. The consequences of such operations are difficult to evaluate due to the challenges posed by collecting data and carrying out ethical experiments that would influence online communities. Here we use a social media model that simulates information diffusion in an empirical network to quantify the impacts of several adversarial manipulation tactics on the quality of content. We find that the presence of influential accounts, a hallmark of social media, exacerbates the vulnerabilities of online communities to manipulation. Among the explored tactics that bad actors can employ, infiltrating a community is the most likely to make low-quality content go viral. Such harm can be further compounded by inauthentic agents flooding the network with low-quality, yet appealing content, but is mitigated when bad actors focus on specific targets, such as influential or vulnerable individuals. These insights suggest countermeasures that platforms could employ to increase the resilience of social media users to manipulation.
