Quantifying the Risks of Tool-assisted Rephrasing to Linguistic Diversity
Mengying Wang, Andreas Spitz
TL;DR
This paper measures the semantic and vocabulary change enacted by the use of rephrasing tools on a multi-domain corpus of human-generated text to quantify the risk of language change when adopted by a large user base.
Abstract
Writing assistants and large language models see widespread use in the creation of text content. While their effectiveness for individual users has been evaluated in the literature, little is known about their proclivity to change language or reduce its richness when adopted by a large user base. In this paper, we take a first step towards quantifying this risk by measuring the semantic and vocabulary change enacted by the use of rephrasing tools on a multi-domain corpus of human-generated text.
