Table of Contents
Fetching ...

Large Language Models Penetration in Scholarly Writing and Peer Review

Li Zhou, Ruijie Zhang, Xunlian Dai, Daniel Hershcovich, Haizhou Li

TL;DR

A framework with two components is proposed: a curated dataset of human- and LLM-generated content across scholarly writing and peer review for multi-perspective evaluation, and a tool for assessing LLM penetration using rule-based metrics and model-based detectors for multi-dimensional evaluation.

Abstract

While the widespread use of Large Language Models (LLMs) brings convenience, it also raises concerns about the credibility of academic research and scholarly processes. To better understand these dynamics, we evaluate the penetration of LLMs across academic workflows from multiple perspectives and dimensions, providing compelling evidence of their growing influence. We propose a framework with two components: \texttt{ScholarLens}, a curated dataset of human- and LLM-generated content across scholarly writing and peer review for multi-perspective evaluation, and \texttt{LLMetrica}, a tool for assessing LLM penetration using rule-based metrics and model-based detectors for multi-dimensional evaluation. Our experiments demonstrate the effectiveness of \texttt{LLMetrica}, revealing the increasing role of LLMs in scholarly processes. These findings emphasize the need for transparency, accountability, and ethical practices in LLM usage to maintain academic credibility.

Large Language Models Penetration in Scholarly Writing and Peer Review

TL;DR

A framework with two components is proposed: a curated dataset of human- and LLM-generated content across scholarly writing and peer review for multi-perspective evaluation, and a tool for assessing LLM penetration using rule-based metrics and model-based detectors for multi-dimensional evaluation.

Abstract

While the widespread use of Large Language Models (LLMs) brings convenience, it also raises concerns about the credibility of academic research and scholarly processes. To better understand these dynamics, we evaluate the penetration of LLMs across academic workflows from multiple perspectives and dimensions, providing compelling evidence of their growing influence. We propose a framework with two components: \texttt{ScholarLens}, a curated dataset of human- and LLM-generated content across scholarly writing and peer review for multi-perspective evaluation, and \texttt{LLMetrica}, a tool for assessing LLM penetration using rule-based metrics and model-based detectors for multi-dimensional evaluation. Our experiments demonstrate the effectiveness of \texttt{LLMetrica}, revealing the increasing role of LLMs in scholarly processes. These findings emphasize the need for transparency, accountability, and ethical practices in LLM usage to maintain academic credibility.

Paper Structure

This paper contains 41 sections, 12 equations, 18 figures, 13 tables.

Figures (18)

  • Figure 1: Pipeline Overview of Our Work: (1) ScholarLens Curation (§\ref{['sec: ScholarLens']}): A designed dataset used to evaluate the effectiveness of metrics and train detection models; (2) LLMetrica framework (§\ref{['sec: LLMetria']}): The proposed method for distinguishing human-written from LLM-generated texts; (3) Experiments (§\ref{['sec:Experiments']}): Evaluating the effectiveness of LLMetrica and applying it to real-world data to assess LLM penetration rates in scholarly writing and peer process. Symbolically, $\mathrm{P}=\left\{ \mathrm{T},\mathrm{A},\mathrm{C},\mathrm{R},\mathrm{MR} \right\}$ represents a research paper, where $\mathrm{T}$, $\mathrm{A}$ and $\mathrm{C}$ denote its title, abstract and main content, $\mathrm{R}=\left\{ r_i \right\}$ represents the individual reviews, and $\mathrm{MR}$ denotes the meta-review.
  • Figure 2: Comparison of Features for ALL Data Types
  • Figure 3: Feature preference of LLM-generated text: ↑ indicates an increase across all LLMs, ↓ indicates a decrease, → indicates inconsistency. Bold denotes consistent trends across all data types.
  • Figure 5: Meta-Review
  • Figure 6: Review
  • ...and 13 more figures