Sub-exponential Growth of New Words and Names Online: A Piecewise Power-Law Model
Hayafumi Watanabe
TL;DR
This paper introduces a piecewise power-law model to describe complex growth curves of online word diffusion, showing sub-exponential growth is a prevalent pattern across large-scale blog data. By nondimensionalizing and applying a Box–Cox-like transformation, the authors reveal that a single shape parameter α_i controls diffusion form, while a piecewise extension captures multi-stage dynamics and jumps. They connect α_i to topic inwardness through a simple infection-like diffusion model, yielding α_i = 1 − γ_i/Q, thereby offering a sociophysical interpretation of diffusion shapes and linking micro-behavior to macro patterns. The results provide a practical framework for describing, comparing, and interpreting diverse diffusion curves, with implications for understanding how topic appeal and network structure shape information spread online.
Abstract
The diffusion of ideas and language in society has conventionally been described by S-shaped models, such as the logistic curve. However, the role of sub-exponential growth -- a slower-than-exponential pattern known in epidemiology -- has been largely overlooked in broader social phenomena. Here, we present a piecewise power-law model to characterize complex growth curves with a few parameters. We systematically analyzed a large-scale dataset of approximately one billion Japanese blog articles linked to Wikipedia vocabulary, and observed consistent patterns in web search trend data (English, Spanish, and Japanese). Our analysis of 2,963 items, selected for reliable estimation (e.g., sufficient duration/peak, monotonic growth), reveals that 1,625 (55%) diffusion patterns without abrupt level shifts were adequately described by one or two segments. For single-segment curves, we found that (i) the mode of the shape parameter $α$ was near 0.5, indicating prevalent sub-exponential growth; (ii) the peak diffusion scale is primarily determined by the growth rate $R$, with minor contributions from $α$ or the duration $T$; and (iii) $α$ showed a tendency to vary with the nature of the topic, being smaller for niche/local topics and larger for widely shared ones. Furthermore, a micro-behavioral model of outward (stranger) vs. inward (community) contact suggests that $α$ can be interpreted as an index of the preference for outward-oriented communication. These findings suggest that sub-exponential growth is a common pattern of social diffusion, and our model provides a practical framework for consistently describing, comparing, and interpreting complex and diverse growth curves.
