Domain Gating Ensemble Networks for AI-Generated Text Detection
Arihant Tripathi, Liam Dugan, Charis Gao, Maggie Huan, Emma Jin, Peter Zhang, David Zhang, Julia Zhao, Chris Callison-Burch
TL;DR
DoGEN introduces Domain Gating Ensemble Networks to detect AI-generated text across diverse domains by using a domain-router to gate a set of domain-specific detectors. It ensembles the top-$k$ experts weighted by domain-probabilities, enabling robust performance in both in-domain and out-of-domain settings while keeping inference efficient. Across MAGE and RAID benchmarks, DoGEN achieves state-of-the-art in-domain results and strong out-of-domain generalization, outperforming single models and many ensembles with substantially fewer activated parameters. The approach is modular and extensible, allowing easy addition of new experts for structurally novel inputs, and the authors provide code and trained models for community use.
Abstract
As state-of-the-art language models continue to improve, the need for robust detection of machine-generated text becomes increasingly critical. However, current state-of-the-art machine text detectors struggle to adapt to new unseen domains and generative models. In this paper we present DoGEN (Domain Gating Ensemble Networks), a technique that allows detectors to adapt to unseen domains by ensembling a set of domain expert detector models using weights from a domain classifier. We test DoGEN on a wide variety of domains from leading benchmarks and find that it achieves state-of-the-art performance on in-domain detection while outperforming models twice its size on out-of-domain detection. We release our code and trained models to assist in future research in domain-adaptive AI detection.
