AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection
Jiatao Li, Mao Ye, Cheng Peng, Xunjian Yin, Xiaojun Wan
TL;DR
AGENT-X introduces a threshold-free, zero-shot AI-generated text detector built from a collaborative multi-agent system that leverages three linguistically grounded guideline dimensions: semantic, stylistic, and structural. A router selects relevant guidelines, base agents produce calibrated decisions with explicit reasoning, and a meta agent aggregates results into an interpretable final verdict. The framework emphasizes confidence steering to avoid external threshold tuning, achieving superior accuracy and generalization across diverse datasets and language models. This approach enhances interpretability and robustness in real-world detection tasks, with strong implications for combating misinformation and ensuring source authenticity.
Abstract
Existing AI-generated text detection methods heavily depend on large annotated datasets and external threshold tuning, restricting interpretability, adaptability, and zero-shot effectiveness. To address these limitations, we propose AGENT-X, a zero-shot multi-agent framework informed by classical rhetoric and systemic functional linguistics. Specifically, we organize detection guidelines into semantic, stylistic, and structural dimensions, each independently evaluated by specialized linguistic agents that provide explicit reasoning and robust calibrated confidence via semantic steering. A meta agent integrates these assessments through confidence-aware aggregation, enabling threshold-free, interpretable classification. Additionally, an adaptive Mixture-of-Agent router dynamically selects guidelines based on inferred textual characteristics. Experiments on diverse datasets demonstrate that AGENT-X substantially surpasses state-of-the-art supervised and zero-shot approaches in accuracy, interpretability, and generalization.
