Oil & Water? Diffusion of AI Within and Across Scientific Fields
Eamon Duede, William Dolan, André Bauer, Ian Foster, Karim Lakhani
TL;DR
This paper tackles whether AI engagement is truly ubiquitous across science or confined to a subset of fields. It leverages a large-scale analysis of the Semantic Scholar Academic Graph (1985–2022), employing a keyword-based classifier to label AI-engaged papers, a Gini-based metric to quantify diffusion across publication venues, and SPECTER2 embeddings to study semantic trajectories relative to Computer Science AI papers. The results show AI-engaged publications rose ~13x across 20 fields and reached ~9% in 2022, with diffusion across venues accelerating within disciplines. However, embedding-based analyses reveal semantic tension: AI-engaged work becomes more CS-like but also diverges from non-AI-engaged work within fields, indicating oil-and-water dynamics where ubiquity grows but cross-field mixing remains imperfect. The study provides a foundational, quantitative view of AI's broad yet heterogeneous integration into science, with implications for policy, funding, and future research on AI's impact on disciplinary cohesion and scholarly practice.
Abstract
This study empirically investigates claims of the increasing ubiquity of artificial intelligence (AI) within roughly 80 million research publications across 20 diverse scientific fields, by examining the change in scholarly engagement with AI from 1985 through 2022. We observe exponential growth, with AI-engaged publications increasing approximately thirteenfold (13x) across all fields, suggesting a dramatic shift from niche to mainstream. Moreover, we provide the first empirical examination of the distribution of AI-engaged publications across publication venues within individual fields, with results that reveal a broadening of AI engagement within disciplines. While this broadening engagement suggests a move toward greater disciplinary integration in every field, increased ubiquity is associated with a semantic tension between AI-engaged research and more traditional disciplinary research. Through an analysis of tens of millions of document embeddings, we observe a complex interplay between AI-engaged and non-AI-engaged research within and across fields, suggesting that increasing ubiquity is something of an oil-and-water phenomenon -- AI-engaged work is spreading out over fields, but not mixing well with non-AI-engaged work.
