Query-Reduction Networks for Question Answering
Minjoon Seo, Sewon Min, Ali Farhadi, Hannaneh Hajishirzi
TL;DR
This work tackles multi-hop question answering by introducing Query-Reduction Network (QRN), a slimmed-down RNN unit that reduces the query as it processes a sequence of context sentences. QRN layers can be stacked and optionally run bidirectionally to capture local and global dependencies, and the model supports time-parallelization to speed up training and inference. Empirical results on the bAbI story-based QA and dialog datasets, as well as the DSTC2 dialog data, demonstrate state-of-the-art performance and robust ablations reveal the importance of multiple layers and gating mechanisms. The approach also provides interpretable intermediate queries and gate visualizations, offering insight into the reasoning flow and attention over facts while maintaining efficiency advantages over traditional RNNs.
Abstract
In this paper, we study the problem of question answering when reasoning over multiple facts is required. We propose Query-Reduction Network (QRN), a variant of Recurrent Neural Network (RNN) that effectively handles both short-term (local) and long-term (global) sequential dependencies to reason over multiple facts. QRN considers the context sentences as a sequence of state-changing triggers, and reduces the original query to a more informed query as it observes each trigger (context sentence) through time. Our experiments show that QRN produces the state-of-the-art results in bAbI QA and dialog tasks, and in a real goal-oriented dialog dataset. In addition, QRN formulation allows parallelization on RNN's time axis, saving an order of magnitude in time complexity for training and inference.
