Training-free LLM-generated Text Detection by Mining Token Probability Sequences
Yihuai Xu, Yongwei Wang, Yifei Bi, Huangsen Cao, Zhouhan Lin, Yu Zhao, Fei Wu
TL;DR
This work tackles the challenge of cross-domain LLM-generated text detection by proposing Lastde, a training-free detector that reframes token probability sequences (TPS) as time-series data. It introduces multiscale diversity entropy (MDE) to capture local TPS dynamics and combines it with global log-likelihood to form a robust score, with Lastde++ offering a fast-sampling variant for real-time use. Across six datasets and multiple source/proxy-model configurations, Lastde and Lastde++ achieve state-of-the-art performance among training-free detectors, demonstrating strong robustness to paraphrasing attacks and cross-lingual scenarios while maintaining competitive efficiency. The approach has practical implications for scalable, model-agnostic detection of machine-generated text in diverse real-world settings, including cross-language and cross-model contexts.
Abstract
Large language models (LLMs) have demonstrated remarkable capabilities in generating high-quality texts across diverse domains. However, the potential misuse of LLMs has raised significant concerns, underscoring the urgent need for reliable detection of LLM-generated texts. Conventional training-based detectors often struggle with generalization, particularly in cross-domain and cross-model scenarios. In contrast, training-free methods, which focus on inherent discrepancies through carefully designed statistical features, offer improved generalization and interpretability. Despite this, existing training-free detection methods typically rely on global text sequence statistics, neglecting the modeling of local discriminative features, thereby limiting their detection efficacy. In this work, we introduce a novel training-free detector, termed \textbf{Lastde} that synergizes local and global statistics for enhanced detection. For the first time, we introduce time series analysis to LLM-generated text detection, capturing the temporal dynamics of token probability sequences. By integrating these local statistics with global ones, our detector reveals significant disparities between human and LLM-generated texts. We also propose an efficient alternative, \textbf{Lastde++} to enable real-time detection. Extensive experiments on six datasets involving cross-domain, cross-model, and cross-lingual detection scenarios, under both white-box and black-box settings, demonstrated that our method consistently achieves state-of-the-art performance. Furthermore, our approach exhibits greater robustness against paraphrasing attacks compared to existing baseline methods.
