UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking
Chuang Li, Yan Zhang, Min-Yen Kan, Haizhou Li
TL;DR
UNO-DST tackles zero-shot dialogue state tracking by leveraging unlabeled target-domain data through a dual-task framework: a main task that predicts slot values and an auxiliary task that generates slot types. These tasks are trained jointly on source domains and then augmented with cycle-consistent self-training on the target domain to select high-quality samples, effectively turning zero-shot DST into a few-shot setting. The approach yields consistent improvements on MultiWOZ and SGD datasets, with additional gains realized when extending to large language models such as ChatGPT via conversational and in-context learning. The method is model-agnostic and highlights the potential for automatic label creation, unseen/new slot-type generation, and improved data efficiency in DST systems.
Abstract
Previous zero-shot dialogue state tracking (DST) methods only apply transfer learning, ignoring unlabelled data in the target domain. We transform zero-shot DST into few-shot DST by utilising such unlabelled data via joint and self-training methods. Our method incorporates auxiliary tasks that generate slot types as inverse prompts for main tasks, creating slot values during joint training. Cycle consistency between these two tasks enables the generation and selection of quality samples in unknown target domains for subsequent fine-tuning. This approach also facilitates automatic label creation, thereby optimizing the training and fine-tuning of DST models. We demonstrate this method's effectiveness on general language models in zero-shot scenarios, improving average joint goal accuracy by 8% across all domains in MultiWOZ.
