Synthetic Social Media Influence Experimentation via an Agentic Reinforcement Learning Large Language Model Bot
Bailu Jin, Weisi Guo
TL;DR
The paper tackles the challenge of ethically studying public opinion dynamics on social platforms by building a simulated environment that couples agentic reinforcement learning with large language models. It trains a target agent to maximize followers using Q-learning, while LLMs convert RL decisions into topic-relevant posts. Key contributions include a topic-aware ABM framework, empirical evidence that constraining actions and enabling self-observation yield more stable opinion-leader formation, and analysis across multiple observability and model-settings scenarios. The work has practical significance for understanding influence mechanisms, informing policy and marketing strategies, and providing a controllable testbed to explore misinformation risks and detection in online ecosystems.
Abstract
Understanding the dynamics of public opinion evolution on online social platforms is crucial for understanding influence mechanisms and the provenance of information. Traditional influence analysis is typically divided into qualitative assessments of personal attributes (e.g., psychology of influence) and quantitative evaluations of influence power mechanisms (e.g., social network analysis). One challenge faced by researchers is the ethics of real-world experimentation and the lack of social influence data. In this study, we provide a novel simulated environment that combines agentic intelligence with Large Language Models (LLMs) to test topic-specific influence mechanisms ethically. Our framework contains agents that generate posts, form opinions on specific topics, and socially follow/unfollow each other based on the outcome of discussions. This simulation allows researchers to observe the evolution of how opinions form and how influence leaders emerge. Using our own framework, we design an opinion leader that utilizes Reinforcement Learning (RL) to adapt its linguistic interaction with the community to maximize its influence and followers over time. Our current findings reveal that constraining the action space and incorporating self-observation are key factors for achieving stable and consistent opinion leader generation for topic-specific influence. This demonstrates the simulation framework's capacity to create agents that can adapt to complex and unpredictable social dynamics. The work is important in an age of increasing online influence on social attitudes and emerging technologies.
