Fractional stochastic model of citation dynamics with memory and volatility
Keisuke Okamura
TL;DR
This work proposes a fractional stochastic model for citation dynamics in which latent attention $X(t)$ evolves via a memoryful SDE driven by fractional Brownian motion: $dX(t)=X(t)[\alpha(t) dt+\beta dB_{H}(t)]$, with solution $X(t)=X_{0}\exp[A(t)+\beta B_{H}(t)]$ and $C(T)\approx\int_0^T X(t)dt$. A key theoretical result is $\mathrm{SD}[R(t)]=\beta t^{H}$ for $R(t)=\ln(X(t)/X_{0})$, producing the empirical $t^{H}$ law observed in citation fluctuations and linking memory ($H$) and volatility ($\beta$) to the shape of the citation distribution. The model predicts a log-normal distribution for antipersistent regimes ($H<\tfrac{1}{2}$) and a heavy-tailed, power-law-like distribution in persistent regimes ($H> frac{1}{2}$), providing a unified explanation for the log-normal/high-citation tails seen in empirical data. Application to arXiv data yields $H\approx0.13$ (antipersistent) and moderate $\beta$, with fractal dimension $D\approx2-H\approx1.87$, supporting a fractal, memory-rich structure in attention dynamics and suggesting broad applicability to other attention-driven networks.
Abstract
Understanding the statistical laws governing citation dynamics remains a fundamental challenge in network theory and the science of science. Citation networks typically exhibit in-degree distributions well approximated by log-normal distributions yet also display power-law behaviour in the high-citation regime -- an apparent contradiction lacking a unified explanation. Here we identify a previously unrecognised phenomenon: the variance of the logarithm of citation counts per unit time follows a power law with respect to time ($t$) since publication, scaling as $t^{H}$, with $H$ constant. This discovery introduces a new challenge while simultaneously offering a crucial clue to resolving this discrepancy. We develop a stochastic model in which latent attention to publications evolves through a memory-driven process with cumulative advantage, modelled as fractional Brownian motion with Hurst parameter $H$ and volatility. We show that antipersistent fluctuations in attention ($H < 1/2$) yield log-normal citation distributions, whereas persistent attention dynamics ($H > 1/2$) favour heavy-tailed power laws, thus resolving the log-normal--power-law contradiction. Numerical simulations confirm both the $t^{H}$ law and the transition between regimes. Empirical analysis of arXiv e-prints indicates that the latent attention process is intrinsically antipersistent ($H \approx 0.13$). By linking memory effects and stochastic fluctuations in attention to broader network dynamics, our findings provide a unifying framework for understanding the evolution of collective attention in science and other attention-driven processes.
