LSTM based Conversation Models
Yi Luan, Yangfeng Ji, Mari Ostendorf
TL;DR
This work presents an LSTM-based framework for two-party conversations that jointly models local turn context, global topic context via LDA, and speaker roles. It introduces three model variants—R-Conv, LDA-Conv, and the combined R-LDA-Conv—where role information biases the output and topic context informs predictions. Evaluations on the Ubuntu Dialogue Corpus show that incorporating role and topic context yields substantial improvements in perplexity and response ranking, with R-LDA-Conv performing best. Qualitative analyses reveal clear role-specific generation patterns, supporting the practical value of role-aware dialogue systems in goal-directed settings.
Abstract
In this paper, we present a conversational model that incorporates both context and participant role for two-party conversations. Different architectures are explored for integrating participant role and context information into a Long Short-term Memory (LSTM) language model. The conversational model can function as a language model or a language generation model. Experiments on the Ubuntu Dialog Corpus show that our model can capture multiple turn interaction between participants. The proposed method outperforms a traditional LSTM model as measured by language model perplexity and response ranking. Generated responses show characteristic differences between the two participant roles.
