MIX : a Multi-task Learning Approach to Solve Open-Domain Question Answering
Sofian Chaybouti, Achraf Saghe, Aymen Shabou
TL;DR
Open-domain QA faces scalability and cross-document answer comparison challenges. MIX proposes a simple, efficient solution by stacking a BM25 retriever, a RoBERTa-based Scorer, and an Extractor, with a multitask variant that shares the backbone to parallelize re-ranking and reading. It achieves state-of-the-art results on squad-open with RoBERTa-base and demonstrates that shorter paragraph granularity (100 tokens) improves performance. The approach offers practical benefits for search engines and points to future improvements using larger models such as RoBERTa-large.
Abstract
This paper introduces MIX, a multi-task deep learning approach to solve open-ended question-answering. First, we design our system as a multi-stage pipeline of 3 building blocks: a BM25-based Retriever to reduce the search space, a RoBERTa-based Scorer, and an Extractor to rank retrieved paragraphs and extract relevant text spans, respectively. Eventually, we further improve the computational efficiency of our system to deal with the scalability challenge: thanks to multi-task learning, we parallelize the close tasks solved by the Scorer and the Extractor. Our system is on par with state-of-the-art performances on the squad-open benchmark while being simpler conceptually.
