Extracting Affect Aggregates from Longitudinal Social Media Data with Temporal Adapters for Large Language Models
Georg Ahnert, Max Pellert, David Garcia, Markus Strohmaier
TL;DR
The paper introduces Temporal Adapters to create temporally aligned LLMs for longitudinal analysis of social media, enabling extraction of affect aggregates and public attitudes from weekly data. By fine-tuning on weekly timelines from a British Twitter panel and prompting with established survey instruments, the method yields longitudinal macroscopes of emotions and attitudes that correlate with YouGov survey data (Britain's Mood, PANAS-X, NHS, Boris Johnson, government attitudes). The approach is flexible, data-efficient, and robust across seeds and prompts, contrasting with traditional classifiers that require labeled data or dictionaries. It demonstrates internal validity via synthetically mixed data and extends to attitudinal aggregates, providing a scalable tool for timely, population-representative insights in crises and beyond.
Abstract
This paper proposes temporally aligned Large Language Models (LLMs) as a tool for longitudinal analysis of social media data. We fine-tune Temporal Adapters for Llama 3 8B on full timelines from a panel of British Twitter users, and extract longitudinal aggregates of emotions and attitudes with established questionnaires. We focus our analysis on the beginning of the COVID-19 pandemic that had a strong impact on public opinion and collective emotions. We validate our estimates against representative British survey data and find strong positive, significant correlations for several collective emotions. The obtained estimates are robust across multiple training seeds and prompt formulations, and in line with collective emotions extracted using a traditional classification model trained on labeled data. We demonstrate the flexibility of our method on questions of public opinion for which no pre-trained classifier is available. Our work extends the analysis of affect in LLMs to a longitudinal setting through Temporal Adapters. It enables flexible, new approaches towards the longitudinal analysis of social media data.
