Template-assisted Contrastive Learning of Task-oriented Dialogue Sentence Embeddings
Minsik Oh, Jiwei Li, Guoyin Wang
TL;DR
This work addresses the challenge of learning effective sentence embeddings for task-oriented dialogue by leveraging token-level template information. It introduces TaDSE, which combines template-based data augmentation, utterance-template pairwise contrastive learning, and a semantic compression inference to align utterances with their semantic templates. Across five benchmark dialogue datasets, TaDSE yields significant improvements over state-of-the-art unsupervised methods, with analyses linking semantic compression to uniformity and alignment in the embedding space. The approach highlights the value of incorporating template and slot-level knowledge into dialogue representations and provides new tools for diagnosing and understanding embedding structure.
Abstract
Learning high quality sentence embeddings from dialogues has drawn increasing attentions as it is essential to solve a variety of dialogue-oriented tasks with low annotation cost. Annotating and gathering utterance relationships in conversations are difficult, while token-level annotations, \eg, entities, slots and templates, are much easier to obtain. Other sentence embedding methods are usually sentence-level self-supervised frameworks and cannot utilize token-level extra knowledge. We introduce Template-aware Dialogue Sentence Embedding (TaDSE), a novel augmentation method that utilizes template information to learn utterance embeddings via self-supervised contrastive learning framework. We further enhance the effect with a synthetically augmented dataset that diversifies utterance-template association, in which slot-filling is a preliminary step. We evaluate TaDSE performance on five downstream benchmark dialogue datasets. The experiment results show that TaDSE achieves significant improvements over previous SOTA methods for dialogue. We further introduce a novel analytic instrument of semantic compression test, for which we discover a correlation with uniformity and alignment. Our code will be released upon acceptance.
