Generative AI as a Linguistic Equalizer in Global Science
Dragan Filimonovic, Christian Rutzer, Jeffrey Macher, Rolf Weder
TL;DR
The study addresses English dominance in global science by testing whether Generative AI (GenAI) acts as a linguistic equalizer. It analyzes 5.65 million English-language publications from non-English-speaking authors (2021–2024), flags GenAI-assisted work via field-specific lexical markers, and measures linguistic proximity to a U.S. English benchmark using SciBERT embeddings in an event-study Difference-in-Differences framework. The authors find post-2022 convergence of GenAI-assisted non-U.S. writings toward U.S. scientific English, strongest for linguistically distant countries and domestically authored teams, and across multiple fields. These findings imply GenAI can broaden global participation in science, with implications for editorial practice and research equity, while underscoring the need for transparent disclosure and equitable access to language tools.
Abstract
For decades, the dominance of English has created a substantial barrier in global science, disadvantaging non-native speakers. The recent rise of generative AI (GenAI) offers a potential technological response to this long-standing inequity. We provide the first large-scale evidence testing whether GenAI acts as a linguistic equalizer in global science. Drawing on 5.65 million scientific articles published from 2021 to 2024, we compare GenAI-assisted and non-assisted publications from authors in non-English-speaking countries. Using text embeddings derived from a pretrained large language model (SciBERT), we measure each publication's linguistic similarity to a benchmark of scientific writing from U.S.-based authors and track stylistic convergence over time. We find significant and growing convergence for GenAI-assisted publications after the release of ChatGPT in late 2022. The effect is strongest for domestic coauthor teams from countries linguistically distant from English. These findings provide large-scale evidence that GenAI is beginning to reshape global science communication by reducing language barriers in research.
