Neural Belief Tracker: Data-Driven Dialogue State Tracking
Nikola Mrkšić, Diarmuid Ó Séaghdha, Tsung-Hsien Wen, Blaise Thomson, Steve Young
TL;DR
This work tackles the scalability bottleneck in dialogue state tracking by introducing Neural Belief Tracker (NBT), a data-driven approach that reasons over pre-trained word vectors to jointly model user utterances and dialogue context without hand-crafted semantic lexicons. It presents two representation-learning variants (NBT-DNN and NBT-CNN) and a semantic decoding mechanism that directly evaluates candidate slot-value expressions within a given dialogue context. Through experiments on DSTC2 and WOZ 2.0, NBT matches lexicon-based methods and outperforms them when lexical resources are unavailable, with performance further enhanced by semantically specialized word vectors (Paragram-SL999). The results demonstrate NBT’s potential for scalable, domain-rich dialogue systems and highlight the importance of vector-space semantics in belief tracking across noisy and varied language.
Abstract
One of the core components of modern spoken dialogue systems is the belief tracker, which estimates the user's goal at every step of the dialogue. However, most current approaches have difficulty scaling to larger, more complex dialogue domains. This is due to their dependency on either: a) Spoken Language Understanding models that require large amounts of annotated training data; or b) hand-crafted lexicons for capturing some of the linguistic variation in users' language. We propose a novel Neural Belief Tracking (NBT) framework which overcomes these problems by building on recent advances in representation learning. NBT models reason over pre-trained word vectors, learning to compose them into distributed representations of user utterances and dialogue context. Our evaluation on two datasets shows that this approach surpasses past limitations, matching the performance of state-of-the-art models which rely on hand-crafted semantic lexicons and outperforming them when such lexicons are not provided.
