The Rapid Growth of AI Foundation Model Usage in Science
Ana Trišović, Alex Fogelson, Janakan Sivaloganathan, Neil Thompson
TL;DR
This study delivers the first large-scale, paper-level analysis of foundation-model adoption in science, revealing near-exponential growth since 2015 and field-specific disparities. It builds the FutureTech AI in Science Database and uses a multi-faceted pipeline to identify, classify, and disambiguate model references in papers, with Bayesian corrections for false positives. The findings show vision models still predominate in adoption, open-weight models dominate, and scientists increasingly customize models, with larger models associated with higher-impact journals and more citations. The work highlights openness, access, and scale as key levers for AI-enabled science and policy.
Abstract
We present the first large-scale analysis of AI foundation model usage in science - not just citations or keywords. We find that adoption has grown rapidly, at nearly-exponential rates, with the highest uptake in Linguistics, Computer Science, and Engineering. Vision models are the most used foundation models in science, although language models' share is growing. Open-weight models dominate. As AI builders increase the parameter counts of their models, scientists have followed suit but at a much slower rate: in 2013, the median foundation model built was 7.7x larger than the median one adopted in science, by 2024 this had jumped to 26x. We also present suggestive evidence that scientists' use of these smaller models may be limiting them from getting the full benefits of AI-enabled science, as papers that use larger models appear in higher-impact journals and accrue more citations.
