AI-Salesman: Towards Reliable Large Language Model Driven Telemarketing
Qingyu Zhang, Chunlei Xin, Xuanang Chen, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun, Qing Ye, Qianlong Xie, Xingxing Wang
TL;DR
The paper tackles goal-driven persuasive telemarketing with large language models by introducing AI-Salesman, which combines a Bayesian-supervised reinforcement learning objective (via GRPO) with a Dynamic Outline-Guided Agent (DOGA) for turn-level strategy. It additionally releases TeleSalesCorpus, the first real-world-grounded telemarketing dialogue dataset, and proposes a comprehensive offline evaluation framework that uses an LLM (GPT-4) as a judge across six sales capabilities and seven qualitative metrics. Empirical results show that direct reinforcement learning without imitation (GRPO w/o SFT) achieves stronger overall performance than SFT-based baselines, while DOGA substantially improves performance on complex, strategic turns. The work demonstrates practical impact by outperforming baselines in automatic metrics and human evaluations, confirming the value of reasoning-aware training and dynamic, library-driven guidance for reliable, customized persuasive dialogue in real-world telemarketing.
Abstract
Goal-driven persuasive dialogue, exemplified by applications like telemarketing, requires sophisticated multi-turn planning and strict factual faithfulness, which remains a significant challenge for even state-of-the-art Large Language Models (LLMs). A lack of task-specific data often limits previous works, and direct LLM application suffers from strategic brittleness and factual hallucination. In this paper, we first construct and release TeleSalesCorpus, the first real-world-grounded dialogue dataset for this domain. We then propose AI-Salesman, a novel framework featuring a dual-stage architecture. For the training stage, we design a Bayesian-supervised reinforcement learning algorithm that learns robust sales strategies from noisy dialogues. For the inference stage, we introduce the Dynamic Outline-Guided Agent (DOGA), which leverages a pre-built script library to provide dynamic, turn-by-turn strategic guidance. Moreover, we design a comprehensive evaluation framework that combines fine-grained metrics for key sales skills with the LLM-as-a-Judge paradigm. Experimental results demonstrate that our proposed AI-Salesman significantly outperforms baseline models in both automatic metrics and comprehensive human evaluations, showcasing its effectiveness in complex persuasive scenarios.
