Scientific productivity as a random walk
Sam Zhang, Nicholas LaBerge, Samuel F. Way, Daniel B. Larremore, Aaron Clauset
TL;DR
The paper addresses why the canonical early-career rise and late-career decline in scientific productivity appears in averages despite wide heterogeneity in individual trajectories. It shows that modeling productivity as a discrete-time random walk with career-stage–dependent variance—where early careers have higher variance than later ones—reproduces the canonical trajectory and captures much of the observed variability. A simplified two-stage model demonstrates the mechanism ($\alpha_1>\alpha_2$) by which the aggregate pattern emerges, while a full model with inferred four career stages fits empirical distributions of $q_t$ and $\delta_t$ and closely matches aggregate and several individual-level features, albeit with some residual gaps likely due to non-Markovian factors. These findings highlight the role of contingent factors and variance dynamics in shaping scientific productivity and suggest policy avenues to steer incentives and opportunities while acknowledging substantial randomness in research trajectories.
Abstract
The expectation that scientific productivity follows regular patterns over a career underpins many scholarly evaluations, including hiring, promotion and tenure, awards, and grant funding. However, recent studies of individual productivity patterns reveal a puzzle: on the one hand, the average number of papers published per year robustly follows the "canonical trajectory" of a rapid rise to an early peak followed by a gradual decline, but on the other hand, only about 20% of individual productivity trajectories follow this pattern. We resolve this puzzle by modeling scientific productivity as a parameterized random walk, showing that the canonical pattern can be explained as a decrease in the variance in changes to productivity in the early-to-mid career. By empirically characterizing the variable structure of 2,085 productivity trajectories of computer science faculty at 205 PhD-granting institutions, spanning 29,119 publications over 1980--2016, we (i) discover remarkably simple patterns in both early-career and year-to-year changes to productivity, and (ii) show that a random walk model of productivity both reproduces the canonical trajectory in the average productivity and captures much of the diversity of individual-level trajectories. These results highlight the fundamental role of a panoply of contingent factors in shaping individual scientific productivity, opening up new avenues for characterizing how systemic incentives and opportunities can be directed for aggregate effect.
