Detection of Fake Generated Scientific Abstracts
Panagiotis C. Theocharopoulos, Panagiotis Anagnostou, Anastasia Tsoukala, Spiros V. Georgakopoulos, Sotiris K. Tasoulis, Vassilis P. Plagianakos
TL;DR
This work addresses the challenge of distinguishing AI-generated scientific abstracts from human-written ones by constructing a benchmark dataset from the CORD-19 corpus using GPT-3 to generate abstracts from titles. It systematically evaluates multiple text representations—TF-IDF, NER-based features, Word2Vec embeddings, and contextualized embeddings from BERT—across classical ML and deep-learning classifiers. The strongest result comes from an LSTM with Word2Vec, achieving an accuracy of $98.7\%$ and an AUC of $0.987$, with Word2Vec embeddings outperforming BERT in this domain. The study also analyzes misclassifications, noting that title complexity and vocabulary differences influence detection, and suggests expanding the dataset with updated models to enhance generalization across domains and languages, with ethical considerations in mind.
Abstract
The widespread adoption of Large Language Models and publicly available ChatGPT has marked a significant turning point in the integration of Artificial Intelligence into people's everyday lives. The academic community has taken notice of these technological advancements and has expressed concerns regarding the difficulty of discriminating between what is real and what is artificially generated. Thus, researchers have been working on developing effective systems to identify machine-generated text. In this study, we utilize the GPT-3 model to generate scientific paper abstracts through Artificial Intelligence and explore various text representation methods when combined with Machine Learning models with the aim of identifying machine-written text. We analyze the models' performance and address several research questions that rise during the analysis of the results. By conducting this research, we shed light on the capabilities and limitations of Artificial Intelligence generated text.
