CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model
Shang-Hsuan Chiang, Ssu-Cheng Wang, Yao-Chung Fan
TL;DR
This work targets automatic cloze distractor generation, a key component in effective language assessment. It introduces CDGP, a two-stage framework that first uses pre-trained language models to generate candidate distractors (CSG) and then ranks them with a feature-driven (DS) selector, combining PLM confidence, embedding similarities, and POS compatibility. The approach substantially outperforms state-of-the-art methods on the DGen dataset (e.g., $NDCG@10$ rising from $14.94$ to $34.17$) and yields strong results on CLOTH, with ablations showing the Candidate Set Generator as the major contributor to gains and domain-specific PLMs providing additional improvements. This highlights the practical potential of PLM-based distractor generation for scalable, high-quality cloze tests across domains, while also noting evaluation limitations and avenues for further control of distractor difficulty.
Abstract
Manually designing cloze test consumes enormous time and efforts. The major challenge lies in wrong option (distractor) selection. Having carefully-design distractors improves the effectiveness of learner ability assessment. As a result, the idea of automatically generating cloze distractor is motivated. In this paper, we investigate cloze distractor generation by exploring the employment of pre-trained language models (PLMs) as an alternative for candidate distractor generation. Experiments show that the PLM-enhanced model brings a substantial performance improvement. Our best performing model advances the state-of-the-art result from 14.94 to 34.17 (NDCG@10 score). Our code and dataset is available at https://github.com/AndyChiangSH/CDGP.
