Interpreting Answers to Yes-No Questions in Dialogues from Multiple Domains
Zijie Wang, Farzana Rashid, Eduardo Blanco
TL;DR
This work tackles interpreting indirect yes-no answers in dialogues across multiple domains by first identifying yes-no questions with rule-based and BERT-based methods, then interpreting the answers using domain-adaptation techniques. It introduces three new benchmarks (Tennis, Movie, Air) with 300 indirect-answer instances each and a three-label scheme (Yes, No, Middle), learned via distant supervision and blended training. The study demonstrates that distant supervision combined with blending consistently improves cross-domain performance, achieving meaningful gains where synthetic data alone falls short, and shows that large-language models currently do not outperform a well-tuned RoBERTa-based approach for this task. The findings highlight practical pathways for adapting dialogue systems to interpret indirect answers while revealing limitations and directions for future work in data quality and multilingual extension.
Abstract
People often answer yes-no questions without explicitly saying yes, no, or similar polar keywords. Figuring out the meaning of indirect answers is challenging, even for large language models. In this paper, we investigate this problem working with dialogues from multiple domains. We present new benchmarks in three diverse domains: movie scripts, tennis interviews, and airline customer service. We present an approach grounded on distant supervision and blended training to quickly adapt to a new dialogue domain. Experimental results show that our approach is never detrimental and yields F1 improvements as high as 11-34%.
