Exploring temporal dynamics in digital trace data: mining user-sequences for communication research
Yangliu Fan, Jakob Ohme, Lion Wedel
TL;DR
This paper addresses the challenge of leveraging hyper longitudinal digital trace data to study temporal dynamics in communication. It proposes a user sequences framework that preserves fine grained timestamps and applies six analytical approaches to a large data donation dataset spanning four platforms. The case study demonstrates how sequence analysis, event history analysis, hidden Markov models, network analysis, process mining, and language based models reveal patterns such as platform switching, activity motifs, and latent states, while also highlighting methodological challenges like data volume, alignment, and generalizability. The findings suggest that digital trace data offer unprecedented granularity for theory building and cross platform research, with language based embeddings showing particular promise given sufficient high quality data.
Abstract
Communication is commonly considered a process that is dynamically situated in a temporal context. However, there remains a disconnection between such theoretical dynamicality and the non-dynamical character of communication scholars' preferred methodologies. In this paper, we argue for a new research framework that uses computational approaches to leverage the fine-grained timestamps recorded in digital trace data. In particular, we propose to maintain the hyper-longitudinal information in the trace data and analyze time-evolving 'user-sequences,' which provide rich information about user activity with high temporal resolution. To illustrate our proposed framework, we present a case study that applied six approaches (e.g., sequence analysis, process mining, and language-based models) to real-world user-sequences containing 1,262,775 timestamped traces from 309 unique users, gathered via data donations. Overall, our study suggests a conceptual reorientation towards a better understanding of the temporal dimension in communication processes, resting on the exploding supply of digital trace data and the technical advances in analytical approaches.
