Efficient and Robust Question Answering from Minimal Context over Documents
Sewon Min, Victor Zhong, Richard Socher, Caiming Xiong
TL;DR
The paper tackles scalable QA over large document collections and robustness to adversarial inputs by showing that most questions can be answered from a small set of sentences. It introduces a dynamic sentence selector that picks a minimal, question-specific context and feeds it to a competitive QA model (DCN+ or S-Reader), achieving substantial training ($up to ${15}\times$) and inference ($up to $13\times$) speedups with accuracy comparable to or better than full-document QA. Across SQuAD, NewsQA, TriviaQA, and SQuAD-Open (and adversarial variants), the approach often reaches or surpasses state-of-the-art performance and exhibits improved robustness to adversarial perturbations. The method combines a shared encoder, attention-based sentence scoring, and three training techniques (weight transfer, data modification, score normalization) to enable per-question dynamic sentence selection, offering a practical and scalable solution for real-world QA over large corpora.
Abstract
Neural models for question answering (QA) over documents have achieved significant performance improvements. Although effective, these models do not scale to large corpora due to their complex modeling of interactions between the document and the question. Moreover, recent work has shown that such models are sensitive to adversarial inputs. In this paper, we study the minimal context required to answer the question, and find that most questions in existing datasets can be answered with a small set of sentences. Inspired by this observation, we propose a simple sentence selector to select the minimal set of sentences to feed into the QA model. Our overall system achieves significant reductions in training (up to 15 times) and inference times (up to 13 times), with accuracy comparable to or better than the state-of-the-art on SQuAD, NewsQA, TriviaQA and SQuAD-Open. Furthermore, our experimental results and analyses show that our approach is more robust to adversarial inputs.
