Table of Contents
Fetching ...

Natural Language Comprehension with the EpiReader

Adam Trischler, Zheng Ye, Xingdi Yuan, Kaheer Suleman

TL;DR

The paper presents EpiReader, a two-stage, end-to-end neural architecture for machine reading comprehension that first extracts a small set of candidate answers and then tests hypotheses against the text using textual entailment. By explicitly separating candidate selection from hypothesis testing and training them jointly, EpiReader achieves state-of-the-art results on the CNN/Daily Mail and CBT datasets. The approach highlights the value of explicit hypothesis testing in neural QA and demonstrates that a reasoner can correct errors made by a comparatively lightweight extractor. This work suggests a fruitful direction of combining fast candidate generation with deeper entailment-based verification for robust machine comprehension.

Abstract

We present the EpiReader, a novel model for machine comprehension of text. Machine comprehension of unstructured, real-world text is a major research goal for natural language processing. Current tests of machine comprehension pose questions whose answers can be inferred from some supporting text, and evaluate a model's response to the questions. The EpiReader is an end-to-end neural model comprising two components: the first component proposes a small set of candidate answers after comparing a question to its supporting text, and the second component formulates hypotheses using the proposed candidates and the question, then reranks the hypotheses based on their estimated concordance with the supporting text. We present experiments demonstrating that the EpiReader sets a new state-of-the-art on the CNN and Children's Book Test machine comprehension benchmarks, outperforming previous neural models by a significant margin.

Natural Language Comprehension with the EpiReader

TL;DR

The paper presents EpiReader, a two-stage, end-to-end neural architecture for machine reading comprehension that first extracts a small set of candidate answers and then tests hypotheses against the text using textual entailment. By explicitly separating candidate selection from hypothesis testing and training them jointly, EpiReader achieves state-of-the-art results on the CNN/Daily Mail and CBT datasets. The approach highlights the value of explicit hypothesis testing in neural QA and demonstrates that a reasoner can correct errors made by a comparatively lightweight extractor. This work suggests a fruitful direction of combining fast candidate generation with deeper entailment-based verification for robust machine comprehension.

Abstract

We present the EpiReader, a novel model for machine comprehension of text. Machine comprehension of unstructured, real-world text is a major research goal for natural language processing. Current tests of machine comprehension pose questions whose answers can be inferred from some supporting text, and evaluate a model's response to the questions. The EpiReader is an end-to-end neural model comprising two components: the first component proposes a small set of candidate answers after comparing a question to its supporting text, and the second component formulates hypotheses using the proposed candidates and the question, then reranks the hypotheses based on their estimated concordance with the supporting text. We present experiments demonstrating that the EpiReader sets a new state-of-the-art on the CNN and Children's Book Test machine comprehension benchmarks, outperforming previous neural models by a significant margin.

Paper Structure

This paper contains 16 sections, 7 equations, 2 figures, 2 tables.

Figures (2)

  • Figure 1: The complete EpiReader framework. The Extractor is above, the Reasoner below. Propagating the Extractor's probability estimates forward and combining them with the Reasoner's entailment estimates renders the model end-to-end differentiable.
  • Figure 2: An abridged example from CBT-NE demonstrating corrective reranking by the Reasoner.