Disrupting Networks: Amplifying Social Dissensus via Opinion Perturbation and Large Language Models
Erica Coppolillo, Giuseppe Manco
TL;DR
The paper investigates how targeted content injection can intentionally disrupt online opinion dynamics by extending the Friedkin-Johnsen framework to include negative influence and node-level susceptibility. It establishes theoretical guarantees showing basic FJ cannot increase disruption at equilibrium, while the enriched model enables configurations where disruption is maximized, and further enhanced by perturbing innate opinions. To operationalize these insights, the authors design an RL-guided framework that fine-tunes a Large Language Model to generate disruption-oriented text, and validate the approach on synthetic and real-world networks, with the generated content approaching the theoretical disruption upper bounds. The work raises important ethical considerations for content moderation and regulatory policies and outlines future directions including dynamic networks, multi-agent interactions, and defenses against adversarial AI-driven influence.
Abstract
We study how targeted content injection can strategically disrupt social networks. Using the Friedkin-Johnsen (FJ) model, we utilize a measure of social dissensus and show that (i) simple FJ variants cannot significantly perturb the network, (ii) extending the model enables valid graph structures where disruption at equilibrium exceeds the initial state, and (iii) altering an individual's inherent opinion can maximize disruption. Building on these insights, we design a reinforcement learning framework to fine-tune a Large Language Model (LLM) for generating disruption-oriented text. Experiments on synthetic and real-world data confirm that tuned LLMs can approach theoretical disruption limits. Our findings raise important considerations for content moderation, adversarial information campaigns, and generative model regulation.
