Consensus Attention-based Neural Networks for Chinese Reading Comprehension
Yiming Cui, Ting Liu, Zhipeng Chen, Shijin Wang, Guoping Hu
TL;DR
This work tackles the lack of Chinese Cloze-style reading comprehension data by introducing two large Chinese datasets (People Daily and Children’s Fairy Tale) and a human-evaluated test set. It proposes the Consensus Attention Sum Reader (CAS Reader), a neural architecture that aggregates attention across all query time steps via a merging function to produce word-level predictions directly from the document. Empirical results show CAS Reader achieves strong performance on public benchmarks and significantly improves results on the Chinese datasets, with insights about merging strategies and domain transfer. The paper also establishes a baseline for Chinese RC and highlights directions for handling real-world questions and domain shifts in future work.
Abstract
Reading comprehension has embraced a booming in recent NLP research. Several institutes have released the Cloze-style reading comprehension data, and these have greatly accelerated the research of machine comprehension. In this work, we firstly present Chinese reading comprehension datasets, which consist of People Daily news dataset and Children's Fairy Tale (CFT) dataset. Also, we propose a consensus attention-based neural network architecture to tackle the Cloze-style reading comprehension problem, which aims to induce a consensus attention over every words in the query. Experimental results show that the proposed neural network significantly outperforms the state-of-the-art baselines in several public datasets. Furthermore, we setup a baseline for Chinese reading comprehension task, and hopefully this would speed up the process for future research.
