Grammar Control in Dialogue Response Generation for Language Learning Chatbots
Dominik Glandorf, Peng Cui, Detmar Meurers, Mrinmaya Sachan
TL;DR
This work tackles grounding dialogue generation in pedagogy by controlling grammar forms through the English Grammar Profile (EGP) within CEFR-guided categories. It evaluates prompting, fine-tuning, and decoding strategies, finding guided decoding with Llama3 provides the best trade-off between grammar constraint satisfaction and response quality, achieving ~59.3% form inclusion while maintaining grammatical correctness comparable to GPT-3.5. A learner-proficiency simulation suggests that grammar-controlled input can boost learners' production of target forms, supporting adaptive practice across CEFR levels. The study highlights practical gains for language-learning chatbots and outlines avenues for real-teacher validation and expanded grammar inventories.
Abstract
Chatbots based on large language models offer cheap conversation practice opportunities for language learners. However, they are hard to control for linguistic forms that correspond to learners' current needs, such as grammar. We control grammar in chatbot conversation practice by grounding a dialogue response generation model in a pedagogical repository of grammar skills. We also explore how this control helps learners to produce specific grammar. We comprehensively evaluate prompting, fine-tuning, and decoding strategies for grammar-controlled dialogue response generation. Strategically decoding Llama3 outperforms GPT-3.5 when tolerating minor response quality losses. Our simulation predicts grammar-controlled responses to support grammar acquisition adapted to learner proficiency. Existing language learning chatbots and research on second language acquisition benefit from these affordances. Code available on GitHub.
