CINS: Comprehensive Instruction for Few-shot Learning in Task-oriented Dialog Systems
Fei Mi, Yitong Li, Yasheng Wang, Xin Jiang, Qun Liu
TL;DR
The paper tackles the high labeling cost in task-oriented dialog (ToD) by introducing Comprehensive Instruction (CIns), which augments short prompts with task-specific Definition and Constraint to better leverage pre-trained language models. Implemented on a unified T5 Seq2Seq framework, CIns is instantiated for intent classification, dialog state tracking, and natural language generation, and evaluated under realistic few-shot conditions with small validation sets. Across three ToD tasks, CIns consistently outperforms standard prompting and vanilla fine-tuning, with substantial gains in very low-resource settings and competitive performance near full supervision in generation tasks. This work demonstrates that explicit, task-aware instructions can dramatically improve sample-efficient ToD systems and guides future research on instruction-based strategies for PLMs.
Abstract
As labeling cost for different modules in task-oriented dialog (ToD) systems is high, a major challenge in practice is to learn different tasks with the least amount of labeled data. Recently, prompting methods over pre-trained language models (PLMs) have shown promising results for few-shot learning in ToD. To better utilize the power of PLMs, this paper proposes Comprehensive Instruction (CINS) that exploits PLMs with extra task-specific instructions. We design a schema (definition, constraint, prompt) of instructions and their customized realizations for three important downstream tasks in ToD, i.e. intent classification, dialog state tracking, and natural language generation. A sequence-to-sequence model (T5) is adopted to solve these three tasks in a unified framework. Extensive experiments are conducted on these ToD tasks in realistic few-shot learning scenarios with small validation data. Empirical results demonstrate that the proposed CINS approach consistently improves techniques that finetune PLMs with raw input or short prompts.
