Perceptions and Detection of AI Use in Manuscript Preparation for Academic Journals
Nir Chemaya, Daniel Martin
TL;DR
The paper addresses how academics perceive the disclosure of AI use in manuscript preparation and how detectors respond to AI-assisted revisions. It combines a survey of 271 academics with an AI-detection experiment that revises 2,716 Management Science abstracts using GPT-3.5 and evaluates AI-likelihood with Originality.ai. Key findings show that reporting is more common for rewriting than grammar fixing, while detectors often flag grammar fixes as AI-generated; ethics and native-language background shape reporting norms, with notable heterogeneity across respondents. These results inform policy considerations for disclosure requirements and detector enforcement in scholarly publishing and highlight the need for cross-field validation and robust evaluation across multiple detectors and prompts.
Abstract
The emergent abilities of Large Language Models (LLMs), which power tools like ChatGPT and Bard, have produced both excitement and worry about how AI will impact academic writing. In response to rising concerns about AI use, authors of academic publications may decide to voluntarily disclose any AI tools they use to revise their manuscripts, and journals and conferences could begin mandating disclosure and/or turn to using detection services, as many teachers have done with student writing in class settings. Given these looming possibilities, we investigate whether academics view it as necessary to report AI use in manuscript preparation and how detectors react to the use of AI in academic writing.
