Bot Wars Evolved: Orchestrating Competing LLMs in a Counterstrike Against Phone Scams
Nardine Basta, Conor Atkins, Dali Kaafar
TL;DR
Bot Wars addresses AI-augmented phone scams by simulating adversarial dialogues between scammer and victim agents; it proposes a formal model where strategy emerges from prompt-driven chain-of-thought reasoning rather than explicit optimization. The two-layer prompt architecture enables authentic victim personas, backed by a dataset of 3,200 dialogues and a triadic evaluation framework. Across models, GPT-4 delivers superior naturalness and persona realism, while Deepseek shows strength in sustaining long engagements, informing scalable anti-scam deployments. The work advances adversarial-dialogue design and offers practical guidance for disrupting scam economics through proactive, AI-driven scam baiting.
Abstract
We present "Bot Wars," a framework using Large Language Models (LLMs) scam-baiters to counter phone scams through simulated adversarial dialogues. Our key contribution is a formal foundation for strategy emergence through chain-of-thought reasoning without explicit optimization. Through a novel two-layer prompt architecture, our framework enables LLMs to craft demographically authentic victim personas while maintaining strategic coherence. We evaluate our approach using a dataset of 3,200 scam dialogues validated against 179 hours of human scam-baiting interactions, demonstrating its effectiveness in capturing complex adversarial dynamics. Our systematic evaluation through cognitive, quantitative, and content-specific metrics shows that GPT-4 excels in dialogue naturalness and persona authenticity, while Deepseek demonstrates superior engagement sustainability.
