Simulating hashtag dynamics with networked groups of generative agents
Abha Jha, J. Hunter Priniski, Carolyn Steinle, Fred Morstatter
TL;DR
This paper investigates how networked narratives influence belief formation and coordination by simulating hashtag dynamics with networks of generative LLM agents. It introduces the Hashtag Matching game and benchmarks four diverse LLMs (DeepSeek-R1, LLaMA-3, Qwen2, Gemma2) against controlled human data (Fukushima) and real-world Twitter discourse (Philippines 2022 election) within a Watts-Strogatz network. The study uses entropy, narrative alignment via embedding, and unigram perplexity to measure group coherence and linguistic plausibility, uncovering that humans show stronger convergence than LLMs and that prompt design and social-context integration critically shape outcomes. The findings highlight domain dependence in LLM coordination, with Fukushima eliciting more narrative-conscious responses while the election domain yields greater lexical variability, underscoring the need for structured prompting and better alignment for realistic modeling of group narrative coordination and consensus formation.
Abstract
Networked environments shape how information embedded in narratives influences individual and group beliefs and behavior. This raises key questions about how group communication around narrative media impacts belief formation and how such mechanisms contribute to the emergence of consensus or polarization. Language data from generative agents offer insight into how naturalistic forms of narrative interactions (such as hashtag generation) evolve in response to social rewards within networked communication settings. To investigate this, we developed an agent-based modeling and simulation framework composed of networks of interacting Large Language Model (LLM) agents. We benchmarked the simulations of four state-of-the-art LLMs against human group behaviors observed in a prior network experiment (Study 1) and against naturally occurring hashtags from Twitter (Study 2). Quantitative metrics of network coherence (e.g., entropy of a group's responses) reveal that while LLMs can approximate human-like coherence in sanitized domains (Study 1's experimental data), effective integration of background knowledge and social context in more complex or politically sensitive narratives likely requires careful and structured prompting.
