Forecasting Open-Weight AI Model Growth on HuggingFace
Kushal Raj Bhandari, Pin-Yu Chen, Jianxi Gao
TL;DR
The paper tackles forecasting the growth of open-weight AI models on HuggingFace by translating a scientific citation model into model adoption dynamics. It introduces three parameters—immediacy, longevity, and relative fitness—and formalizes the dynamics with a cumulative count $c_i^t$ that follows a saturating trend, e.g., $c_i^t = m ( e^{ lambda_i Phi( (ln t - mu_i)/ sigma_i ) } - 1 )$, where $m$ is a normalization and $Phi$ is the standard normal CDF. Empirical results indicate most models are well captured by the framework; some models show abrupt surges, and organization-level factors shape adoption trajectories, with Meta, BAAI, and StabilityAI showing early peaks. The framework offers a quantitative tool for forecasting long-term diffusion and informing governance and strategic decisions, with potential for extension to include additional data.
Abstract
As the open-weight AI landscape continues to proliferate-with model development, significant investment, and user interest-it becomes increasingly important to predict which models will ultimately drive innovation and shape AI ecosystems. Building on parallels with citation dynamics in scientific literature, we propose a framework to quantify how an open-weight model's influence evolves. Specifically, we adapt the model introduced by Wang et al. for scientific citations, using three key parameters-immediacy, longevity, and relative fitness-to track the cumulative number of fine-tuned models of an open-weight model. Our findings reveal that this citation-style approach can effectively capture the diverse trajectories of open-weight model adoption, with most models fitting well and outliers indicating unique patterns or abrupt jumps in usage.
