The Emergence of Social Science of Large Language Models
Xiao Jia, Zhanzhan Zhao
TL;DR
This work addresses the fragmentation in the social science of large language models by constructing a reproducible, data-driven taxonomy. It combines a PRISMA-guided literature review of $270$ studies with an unsupervised machine-learning pipeline (SBERT embeddings, $K$-means clustering, UMAP, and LDA) to reveal a robust three-domain structure: LLM as Social Minds, LLM Societies, and LLM--Human Interactions. Study 2 then benchmarks this machine-derived map against an expert taxonomy, finding strong alignment (Macro-F1 ≈ $0.954$, $NMI ≈ 0.811$, $ARI ≈ 0.867$) and demonstrating that the unsupervised partitions reflect text-derived distinctions. The resulting framework provides a scalable, cross-domain lens for cumulative progress, enabling targeted theory development, methodological standardization, and living updates as the field evolves. Practically, the taxonomy guides interpretation across micro-, meso-, and human–AI interaction levels, offering a structured path toward responsible governance and integration of LLMs into social systems.
Abstract
The social science of large language models (LLMs) examines how these systems evoke mind attributions, interact with one another, and transform human activity and institutions. We conducted a systematic review of 270 studies, combining text embeddings, unsupervised clustering and topic modeling to build a computational taxonomy. Three domains emerge organically across the reviewed literature. LLM as Social Minds examines whether and when models display behaviors that elicit attributions of cognition, morality and bias, while addressing challenges such as test leakage and surface cues. LLM Societies examines multi-agent settings where interaction protocols, architectures and mechanism design shape coordination, norms, institutions and collective epistemic processes. LLM-Human Interactions examines how LLMs reshape tasks, learning, trust, work and governance, and how risks arise at the human-AI interface. This taxonomy provides a reproducible map of a fragmented field, clarifies evidentiary standards across levels of analysis, and highlights opportunities for cumulative progress in the social science of artificial intelligence.
