Open-Retrieval Conversational Question Answering

Chen Qu; Liu Yang; Cen Chen; Minghui Qiu; W. Bruce Croft; Mohit Iyyer

Open-Retrieval Conversational Question Answering

Chen Qu, Liu Yang, Cen Chen, Minghui Qiu, W. Bruce Croft, Mohit Iyyer

TL;DR

The paper introduces open-retrieval conversational question answering (ORConvQA) and the OR-QuAC dataset, built by integrating QuAC with CANARD rewrites and a vast Wikipedia passage collection to enable retrieval-before-answer tasks. It presents an end-to-end Transformer-based system with a learnable retriever, a reranker, and a reader, all incorporating history modeling; training proceeds via retriever pretraining and concurrent multi-task learning. Results show a strong need for a learnable retriever, substantial gains from incorporating dialog history across components, and a regularization role for the reranker, with initial questions found to be particularly informative. The work provides new insights into ORConvQA design and establishes a dataset and methodology to advance open-retrieval conversational search.

Abstract

Conversational search is one of the ultimate goals of information retrieval. Recent research approaches conversational search by simplified settings of response ranking and conversational question answering, where an answer is either selected from a given candidate set or extracted from a given passage. These simplifications neglect the fundamental role of retrieval in conversational search. To address this limitation, we introduce an open-retrieval conversational question answering (ORConvQA) setting, where we learn to retrieve evidence from a large collection before extracting answers, as a further step towards building functional conversational search systems. We create a dataset, OR-QuAC, to facilitate research on ORConvQA. We build an end-to-end system for ORConvQA, featuring a retriever, a reranker, and a reader that are all based on Transformers. Our extensive experiments on OR-QuAC demonstrate that a learnable retriever is crucial for ORConvQA. We further show that our system can make a substantial improvement when we enable history modeling in all system components. Moreover, we show that the reranker component contributes to the model performance by providing a regularization effect. Finally, further in-depth analyses are performed to provide new insights into ORConvQA.

Open-Retrieval Conversational Question Answering

TL;DR

Abstract

Open-Retrieval Conversational Question Answering

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)