Embarrassingly Simple Text Watermarks
Ryoma Sato, Yuki Takezawa, Han Bao, Kenta Niwa, Makoto Yamada
TL;DR
Easymark introduces a family of embarrassingly simple text watermarks that leverage Unicode codepoints with identical appearances to watermark text without altering meaning or degrading quality. It provides three variants—Whitemark for digital text, Variantmark for CJK text, and Printmark for printed text—along with a formal Impossibility Theorem that perfect watermarking is unattainable, underscoring the value of simple, robust approaches. Empirical results show Easymark outperforms prior watermarks in detection reliability while maintaining BLEU and perplexity, and enabling user-side deployment without reliance on LLM providers. The method offers a practical baseline for watermarking in real-world flows and a foundation for further theoretical and applied work in machine-origin tracing of text. It emphasizes that watermarks should be used as a starter or complementary tool, given the inherent vulnerability to erasure.
Abstract
We propose Easymark, a family of embarrassingly simple yet effective watermarks. Text watermarking is becoming increasingly important with the advent of Large Language Models (LLM). LLMs can generate texts that cannot be distinguished from human-written texts. This is a serious problem for the credibility of the text. Easymark is a simple yet effective solution to this problem. Easymark can inject a watermark without changing the meaning of the text at all while a validator can detect if a text was generated from a system that adopted Easymark or not with high credibility. Easymark is extremely easy to implement so that it only requires a few lines of code. Easymark does not require access to LLMs, so it can be implemented on the user-side when the LLM providers do not offer watermarked LLMs. In spite of its simplicity, it achieves higher detection accuracy and BLEU scores than the state-of-the-art text watermarking methods. We also prove the impossibility theorem of perfect watermarking, which is valuable in its own right. This theorem shows that no matter how sophisticated a watermark is, a malicious user could remove it from the text, which motivate us to use a simple watermark such as Easymark. We carry out experiments with LLM-generated texts and confirm that Easymark can be detected reliably without any degradation of BLEU and perplexity, and outperform state-of-the-art watermarks in terms of both quality and reliability.
