Unraveling Interwoven Roles of Large Language Models in Authorship Privacy: Obfuscation, Mimicking, and Verification

Tuc Nguyen; Yifan Hu; Thai Le

Unraveling Interwoven Roles of Large Language Models in Authorship Privacy: Obfuscation, Mimicking, and Verification

Tuc Nguyen, Yifan Hu, Thai Le

TL;DR

The paper presents the first unified framework to study the interdependent roles of authorship obfuscation, mimicking, and verification in the context of large language models. It formalizes isolation, pairwise, and triplet-wise interdependencies, and evaluates them across multiple LLMs, datasets, and metadata conditions. Key findings show obfuscation generally disrupts author signals, while mimicking can partially recover stylistic traits over time; demographic metadata enhances verification and impersonation capabilities, increasing privacy risk for well-known individuals. The results underscore the dual-use nature of LLMs, emphasizing the need for robust detection, privacy-aware tooling, and transparent handling of metadata in authorship tasks.

Abstract

Recent advancements in large language models (LLMs) have been fueled by large scale training corpora drawn from diverse sources such as websites, news articles, and books. These datasets often contain explicit user information, such as person names and addresses, that LLMs may unintentionally reproduce in their generated outputs. Beyond such explicit content, LLMs can also leak identity revealing cues through implicit signals such as distinctive writing styles, raising significant concerns about authorship privacy. There are three major automated tasks in authorship privacy, namely authorship obfuscation (AO), authorship mimicking (AM), and authorship verification (AV). Prior research has studied AO, AM, and AV independently. However, their interplays remain under explored, which leaves a major research gap, especially in the era of LLMs, where they are profoundly shaping how we curate and share user generated content, and the distinction between machine generated and human authored text is also increasingly blurred. This work then presents the first unified framework for analyzing the dynamic relationships among LLM enabled AO, AM, and AV in the context of authorship privacy. We quantify how they interact with each other to transform human authored text, examining effects at a single point in time and iteratively over time. We also examine the role of demographic metadata, such as gender, academic background, in modulating their performances, inter-task dynamics, and privacy risks. All source code will be publicly available.

Unraveling Interwoven Roles of Large Language Models in Authorship Privacy: Obfuscation, Mimicking, and Verification

TL;DR

Abstract

Unraveling Interwoven Roles of Large Language Models in Authorship Privacy: Obfuscation, Mimicking, and Verification

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)