Temporal patterns of preferences through Wikipedia editing in different languages
David André Villamil Carrillo, Yérali Gandica
TL;DR
This study analyses over a decade of editorial activity across eleven Wikipedia language editions, representing a diverse set of linguistic and cultural communities, and applies hierarchical clustering with dimensionality reduction via PCA and autoencoders to both static and temporal dimensions of collective behaviour.
Abstract
Temporal editing patterns on Wikipedia provide a unique computational lens to explore cultural dynamics across linguistic communities. This study analyses over a decade of editorial activity (2001-2010) across eleven Wikipedia language editions, representing a diverse set of linguistic and cultural communities. We apply hierarchical clustering with dimensionality reduction via PCA and autoencoders to both static (categorical) and temporal dimensions of collective behaviour. Results reveal that linguistic communities exhibit distinct circadian editing rhythms shaped by cultural and societal factors. Crucially, static and temporal clustering yield substantially different community groupings, demonstrating that time is an essential -- and often neglected -- dimension in cross-cultural computational analyses. These findings contribute to our understanding of how cultural identity manifests in large-scale digital trace data, and offer methodological implications for future studies using online platforms as proxies for collective cultural behaviour.
