CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model

Shang-Hsuan Chiang; Ssu-Cheng Wang; Yao-Chung Fan

CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model

Shang-Hsuan Chiang, Ssu-Cheng Wang, Yao-Chung Fan

TL;DR

This work targets automatic cloze distractor generation, a key component in effective language assessment. It introduces CDGP, a two-stage framework that first uses pre-trained language models to generate candidate distractors (CSG) and then ranks them with a feature-driven (DS) selector, combining PLM confidence, embedding similarities, and POS compatibility. The approach substantially outperforms state-of-the-art methods on the DGen dataset (e.g., $NDCG@10$ rising from $14.94$ to $34.17$) and yields strong results on CLOTH, with ablations showing the Candidate Set Generator as the major contributor to gains and domain-specific PLMs providing additional improvements. This highlights the practical potential of PLM-based distractor generation for scalable, high-quality cloze tests across domains, while also noting evaluation limitations and avenues for further control of distractor difficulty.

Abstract

Manually designing cloze test consumes enormous time and efforts. The major challenge lies in wrong option (distractor) selection. Having carefully-design distractors improves the effectiveness of learner ability assessment. As a result, the idea of automatically generating cloze distractor is motivated. In this paper, we investigate cloze distractor generation by exploring the employment of pre-trained language models (PLMs) as an alternative for candidate distractor generation. Experiments show that the PLM-enhanced model brings a substantial performance improvement. Our best performing model advances the state-of-the-art result from 14.94 to 34.17 (NDCG@10 score). Our code and dataset is available at https://github.com/AndyChiangSH/CDGP.

CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model

TL;DR

rising from

) and yields strong results on CLOTH, with ablations showing the Candidate Set Generator as the major contributor to gains and domain-specific PLMs providing additional improvements. This highlights the practical potential of PLM-based distractor generation for scalable, high-quality cloze tests across domains, while also noting evaluation limitations and avenues for further control of distractor difficulty.

Abstract

Paper Structure (24 sections, 1 equation, 3 figures, 9 tables)

This paper contains 24 sections, 1 equation, 3 figures, 9 tables.

Introduction
Related Work
Methodology
CDGP Framework
Candidate Set Generator (CSG)
Distractor Selector (DS)
Performance Evaluation
Dataset
Implementation Details
Evaluation Metric
Automatic Score
Evaluation Results
Results on DGen
Results on CLOTH dataset
Comparing Fine-Tuning Strategy
...and 9 more sections

Figures (3)

Figure 1: A Cloze Test Example: the challenge to cloze test preparation lies in wrong option selection. A good wrong option selection improve the effectiveness of learner ability assessment.
Figure 2: CDGP Framework
Figure 3: The testers' feedback on the difficulty of the questions generated by CDGP (1: easiest, 5: most difficult)

CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model

TL;DR

Abstract

CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model

Authors

TL;DR

Abstract

Table of Contents

Figures (3)