Table of Contents
Fetching ...

Improving Question Answering by Commonsense-Based Pre-Training

Wanjun Zhong, Duyu Tang, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin

TL;DR

This work addresses the challenge of commonsense reasoning in QA by pre-training functions that model direct and indirect relations between concepts using ConceptNet. The pre-trained commonsense scoring is then integrated with standard document-based QA models to improve multi-choice question answering across ARC, SemEval, and OpenBook QA datasets. The method combines concept-level encodings with graph-structured neighbor information, and uses a ranking-based objective with a simple scoring fusion f(a_i)=\alpha f_{doc}(a_i)+\beta f_{cs}(a_i). Overall, the results show that external commonsense knowledge provides complementary signals that improve reasoning, though the authors acknowledge limitations in disambiguation and complex logical reasoning, proposing context-aware and semantic-parsing enhancements for future work.

Abstract

Although neural network approaches achieve remarkable success on a variety of NLP tasks, many of them struggle to answer questions that require commonsense knowledge. We believe the main reason is the lack of commonsense \mbox{connections} between concepts. To remedy this, we provide a simple and effective method that leverages external commonsense knowledge base such as ConceptNet. We pre-train direct and indirect relational functions between concepts, and show that these pre-trained functions could be easily added to existing neural network models. Results show that incorporating commonsense-based function improves the baseline on three question answering tasks that require commonsense reasoning. Further analysis shows that our system \mbox{discovers} and leverages useful evidence from an external commonsense knowledge base, which is missing in existing neural network models and help derive the correct answer.

Improving Question Answering by Commonsense-Based Pre-Training

TL;DR

This work addresses the challenge of commonsense reasoning in QA by pre-training functions that model direct and indirect relations between concepts using ConceptNet. The pre-trained commonsense scoring is then integrated with standard document-based QA models to improve multi-choice question answering across ARC, SemEval, and OpenBook QA datasets. The method combines concept-level encodings with graph-structured neighbor information, and uses a ranking-based objective with a simple scoring fusion f(a_i)=\alpha f_{doc}(a_i)+\beta f_{cs}(a_i). Overall, the results show that external commonsense knowledge provides complementary signals that improve reasoning, though the authors acknowledge limitations in disambiguation and complex logical reasoning, proposing context-aware and semantic-parsing enhancements for future work.

Abstract

Although neural network approaches achieve remarkable success on a variety of NLP tasks, many of them struggle to answer questions that require commonsense knowledge. We believe the main reason is the lack of commonsense \mbox{connections} between concepts. To remedy this, we provide a simple and effective method that leverages external commonsense knowledge base such as ConceptNet. We pre-train direct and indirect relational functions between concepts, and show that these pre-trained functions could be easily added to existing neural network models. Results show that incorporating commonsense-based function improves the baseline on three question answering tasks that require commonsense reasoning. Further analysis shows that our system \mbox{discovers} and leverages useful evidence from an external commonsense knowledge base, which is missing in existing neural network models and help derive the correct answer.

Paper Structure

This paper contains 10 sections, 4 equations, 6 figures, 4 tables.

Figures (6)

  • Figure 1: Examples from ARC that require commonsense knowledge and reasoning.
  • Figure 2: A sampled subgraph from ConceptNet with "driving" as the central word.
  • Figure 3: An overview of our system for commonsense based question answering.
  • Figure 4: An example from ARC dataset. The analysis of this example could be improved if it is given the fact {"electrons", "HasA", "negative charge"} in ConceptNet.
  • Figure 5: Examples that require commonsense-based direct relations between concepts on ARC and SemEval datasets.
  • ...and 1 more figures