The Importance of Suppressing Domain Style in Authorship Analysis
Sebastian Bischoff, Niklas Deckers, Marcel Schliebs, Ben Thies, Matthias Hagen, Efstathios Stamatatos, Benno Stein, Martin Potthast
TL;DR
The paper investigates how domain cues contaminate authorship analysis when using standard stylometric features like character trigrams. It introduces a domain-adversarial, domain-invariant style encoder and compares it to heuristic domain-suppression methods on a huge fanfiction-based corpus designed to test cross-domain attribution. The experiments show that trigram models suffer large performance drops under domain swapping (up to ~55 percentage points), while domain-adversarial training or heuristic masking keep losses to around 3–4 percentage points. The findings emphasize disentangling author style from domain style as essential for robust cross-domain authorship analysis, and suggest domain-adversarial approaches as scalable solutions, albeit with substantial data requirements.
Abstract
The prerequisite of many approaches to authorship analysis is a representation of writing style. But despite decades of research, it still remains unclear to what extent commonly used and widely accepted representations like character trigram frequencies actually represent an author's writing style, in contrast to more domain-specific style components or even topic. We address this shortcoming for the first time in a novel experimental setup of fixed authors but swapped domains between training and testing. With this setup, we reveal that approaches using character trigram features are highly susceptible to favor domain information when applied without attention to domains, suffering drops of up to 55.4 percentage points in classification accuracy under domain swapping. We further propose a new remedy based on domain-adversarial learning and compare it to ones from the literature based on heuristic rules. Both can work well, reducing accuracy losses under domain swapping to 3.6% and 3.9%, respectively.
