Applying Large Language Models to Characterize Public Narratives
Elinor Poole-Dayan, Daniel T Kessler, Hannah Chiou, Margaret Hughes, Emily S Lin, Marshall Ganz, Deb Roy
TL;DR
The paper introduces a codebook-guided LLM framework to automate Public Narrative (PN) annotation, validates near-human performance against PN experts on eight narratives across 14 codes, and scales the approach to 22 narratives and two political speeches. It formalizes the PN structure (Story of Self/Us/Now and Challenge–Choice–Outcome) and empirically analyzes code frequencies, correlations, and extrapolations to real-world political rhetoric. Key findings show that LLMs perform well on explicit, frequent codes but struggle with subjective elements, highlighting the importance of a structured rubric and prompting strategy. The work demonstrates the potential of LLM-assisted PN annotation for scalable civic storytelling analysis with implications for leadership research and computational social science.
Abstract
Public Narratives (PNs) are key tools for leadership development and civic mobilization, yet their systematic analysis remains challenging due to their subjective interpretation and the high cost of expert annotation. In this work, we propose a novel computational framework that leverages large language models (LLMs) to automate the qualitative annotation of public narratives. Using a codebook we co-developed with subject-matter experts, we evaluate LLM performance against that of expert annotators. Our work reveals that LLMs can achieve near-human-expert performance, achieving an average F1 score of 0.80 across 8 narratives and 14 codes. We then extend our analysis to empirically explore how PN framework elements manifest across a larger dataset of 22 stories. Lastly, we extrapolate our analysis to a set of political speeches, establishing a novel lens in which to analyze political rhetoric in civic spaces. This study demonstrates the potential of LLM-assisted annotation for scalable narrative analysis and highlights key limitations and directions for future research in computational civic storytelling.
