Leveraging LLM-based agents for social science research: insights from citation network simulations

Jiarui Ji; Runlin Lei; Xuchen Pan; Zhewei Wei; Hao Sun; Yankai Lin; Xu Chen; Yongzheng Yang; Yaliang Li; Bolin Ding; Ji-Rong Wen

Leveraging LLM-based agents for social science research: insights from citation network simulations

Jiarui Ji, Runlin Lei, Xuchen Pan, Zhewei Wei, Hao Sun, Yankai Lin, Xu Chen, Yongzheng Yang, Yaliang Li, Bolin Ding, Ji-Rong Wen

TL;DR

The paper introduces CiteAgent, a framework that uses LLM-based agents to simulate social-behavioral processes in citation networks, reproducing key structural phenomena such as power-law in-degree distributions, citational distortion, and shrinking diameter. It establishes two LLM-based paradigms, LLM-SE and LLM-LE, to perform hypothesis-driven analyses of citation decisions and network evolution, validated against real networks and extended through idealized social experiments. The work demonstrates how LLM-driven simulations can test, refine, and challenge theories in science-of-science research, while offering new metrics like Referencing Preference Score to disentangle structural effects from intentional biases. Overall, CiteAgent provides a scalable, reproducible platform for counterfactual and empirical investigation of citation dynamics and offers insights with potential implications for real-world academic environments.

Abstract

The emergence of Large Language Models (LLMs) demonstrates their potential to encapsulate the logic and patterns inherent in human behavior simulation by leveraging extensive web data pre-training. However, the boundaries of LLM capabilities in social simulation remain unclear. To further explore the social attributes of LLMs, we introduce the CiteAgent framework, designed to generate citation networks based on human-behavior simulation with LLM-based agents. CiteAgent successfully captures predominant phenomena in real-world citation networks, including power-law distribution, citational distortion, and shrinking diameter. Building on this realistic simulation, we establish two LLM-based research paradigms in social science: LLM-SE (LLM-based Survey Experiment) and LLM-LE (LLM-based Laboratory Experiment). These paradigms facilitate rigorous analyses of citation network phenomena, allowing us to validate and challenge existing theories. Additionally, we extend the research scope of traditional science of science studies through idealized social experiments, with the simulation experiment results providing valuable insights for real-world academic environments. Our work demonstrates the potential of LLMs for advancing science of science research in social science.

Leveraging LLM-based agents for social science research: insights from citation network simulations

TL;DR

Abstract

Leveraging LLM-based agents for social science research: insights from citation network simulations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)