SOLOIST: Building Task Bots at Scale with Transfer Learning and Machine Teaching
Baolin Peng, Chunyuan Li, Jinchao Li, Shahin Shayandeh, Lars Liden, Jianfeng Gao
TL;DR
The paper introduces SOLOIST, a Transformer-based autoregressive framework that unifies the entire task-oriented dialog pipeline into a single model and leverages task-grounded pre-training on heterogeneous corpora. It then enables rapid domain adaptation to new tasks via few-shot fine-tuning and machine teaching, significantly reducing labeling costs. Empirical results on CamRest676 and MultiWOZ show state-of-the-art end-to-end performance and strong few-shot capabilities, with substantial gains from machine teaching. The work demonstrates a scalable path for building many task bots at scale by transferring grounding and dialog-management skills learned during pre-training to new domains.
Abstract
We present a new method SOLOIST that uses transfer learning and machine teaching to build task bots at scale. We parameterize classical modular task-oriented dialog systems using a Transformer-based auto-regressive language model, which subsumes different dialog modules into a single neural model. We pre-train, on heterogeneous dialog corpora, a task-grounded response generation model, which can generate dialog responses grounded in user goals and real-world knowledge for task completion. The pre-trained model can be efficiently adapted to accomplish new tasks with a handful of task-specific dialogs via machine teaching, where training samples are generated by human teachers interacting with the system. Experiments show that (i) SOLOIST creates new state-of-the-art on well-studied task-oriented dialog benchmarks, including CamRest676 and MultiWOZ; (ii) in the few-shot fine-tuning settings, SOLOIST significantly outperforms existing methods, and (iii) the use of machine teaching substantially reduces the labeling cost of fine-tuning. The pre-trained models and codes are available at https://aka.ms/soloist.
