Table of Contents
Fetching ...

Diversifying Question Generation over Knowledge Base via External Natural Questions

Shasha Guo, Jing Zhang, Xirui Ke, Cuiping Li, Hong Chen

TL;DR

KBQG has traditionally pursued one-shot question quality, but this work argues for expressive diversity that preserves semantics and introduces the Diverse@k metric to quantify top-k per-instance diversity under a ground-truth relevance constraint. It proposes a dual forward/backward framework that injects diverse expressions from external natural questions via two simple pseudo-pair selection strategies, enabling richer paraphrase-like alternatives without relying solely on paraphrase models. Experiments on WebQuestions and PathQuestions show clear diversity gains over PLM baselines and competitive performance with ChatGPT, while maintaining relevant semantics, demonstrating the approach's practical value for downstream QA and evaluation tasks. Overall, the paper advances KBQG by combining external linguistic diversity with reliable data augmentation to yield more human-like, diverse question generation and improved downstream usefulness.

Abstract

Previous methods on knowledge base question generation (KBQG) primarily focus on enhancing the quality of a single generated question. Recognizing the remarkable paraphrasing ability of humans, we contend that diverse texts should convey the same semantics through varied expressions. The above insights make diversifying question generation an intriguing task, where the first challenge is evaluation metrics for diversity. Current metrics inadequately assess the above diversity since they calculate the ratio of unique n-grams in the generated question itself, which leans more towards measuring duplication rather than true diversity. Accordingly, we devise a new diversity evaluation metric, which measures the diversity among top-k generated questions for each instance while ensuring their relevance to the ground truth. Clearly, the second challenge is how to enhance diversifying question generation. To address this challenge, we introduce a dual model framework interwoven by two selection strategies to generate diverse questions leveraging external natural questions. The main idea of our dual framework is to extract more diverse expressions and integrate them into the generation model to enhance diversifying question generation. Extensive experiments on widely used benchmarks for KBQG demonstrate that our proposed approach generates highly diverse questions and improves the performance of question answering tasks.

Diversifying Question Generation over Knowledge Base via External Natural Questions

TL;DR

KBQG has traditionally pursued one-shot question quality, but this work argues for expressive diversity that preserves semantics and introduces the Diverse@k metric to quantify top-k per-instance diversity under a ground-truth relevance constraint. It proposes a dual forward/backward framework that injects diverse expressions from external natural questions via two simple pseudo-pair selection strategies, enabling richer paraphrase-like alternatives without relying solely on paraphrase models. Experiments on WebQuestions and PathQuestions show clear diversity gains over PLM baselines and competitive performance with ChatGPT, while maintaining relevant semantics, demonstrating the approach's practical value for downstream QA and evaluation tasks. Overall, the paper advances KBQG by combining external linguistic diversity with reliable data augmentation to yield more human-like, diverse question generation and improved downstream usefulness.

Abstract

Previous methods on knowledge base question generation (KBQG) primarily focus on enhancing the quality of a single generated question. Recognizing the remarkable paraphrasing ability of humans, we contend that diverse texts should convey the same semantics through varied expressions. The above insights make diversifying question generation an intriguing task, where the first challenge is evaluation metrics for diversity. Current metrics inadequately assess the above diversity since they calculate the ratio of unique n-grams in the generated question itself, which leans more towards measuring duplication rather than true diversity. Accordingly, we devise a new diversity evaluation metric, which measures the diversity among top-k generated questions for each instance while ensuring their relevance to the ground truth. Clearly, the second challenge is how to enhance diversifying question generation. To address this challenge, we introduce a dual model framework interwoven by two selection strategies to generate diverse questions leveraging external natural questions. The main idea of our dual framework is to extract more diverse expressions and integrate them into the generation model to enhance diversifying question generation. Extensive experiments on widely used benchmarks for KBQG demonstrate that our proposed approach generates highly diverse questions and improves the performance of question answering tasks.
Paper Structure (26 sections, 5 equations, 3 figures, 6 tables, 1 algorithm)

This paper contains 26 sections, 5 equations, 3 figures, 6 tables, 1 algorithm.

Figures (3)

  • Figure 1: Example questions generated by BART, BART+Paraphrase, and our approach on the WebQuestions (WQ) dataset. Given a set of triples with the answer(underlined), each method returns top-3 questions, where the various surface forms are marked in different colors.
  • Figure 2: Overview of our proposed method. The forward model $f_\theta$ and the backward model $b_\varphi$ are pre-trained on training data $\mathcal{D}$ and then iteratively fine-tuned on reliable pseudo pairs $\{(b_{\varphi}(Q_j), Q_j) \}_{j=1}^R$ and $\{(f_{\theta}(G_i), G_i )\}_{i=1}^S$ respectively, which are filtered by our proposed selection strategies.
  • Figure 3: The prompt for Large Language Models.