Level Up Peer Review in Education: Investigating genAI-driven Gamification system and its influence on Peer Feedback Effectiveness
Rafal Wlodarski, Leonardo da Silva Sousa, Allison Connell Pensky
TL;DR
This work addresses the gap between industry expectations and classroom feedback practices by pairing gamification with a GenAI tutor in a peer-review platform for software engineering education. The authors implement Socratique, integrating a GenAI assistant named Mr. Pepper and MDA-based game mechanics to boost participation and feedback quality in a master’s functional programming course. In a randomized controlled study, the gamified, AI-assisted condition yielded significantly higher feedback quantity and quality (notably in relevance and specificity), with nuanced temporal dynamics and strong potential for scalable education practice. The findings suggest that AI-assisted gamification can scaffold professional-review competencies and enhance feedback literacy in SE curricula, though broader validation and longitudinal analysis are needed.
Abstract
In software engineering (SE), the ability to review code and critique designs is essential for professional practice. However, these skills are rarely emphasized in formal education, and peer feedback quality and engagement can vary significantly among students. This paper introduces Socratique, a gamified peer-assessment platform integrated with Generative AI (GenAI) assistance, designed to develop students' peer-review skills in a functional programming course. By incorporating game elements, Socratique aims to motivate students to provide more feedback, while the GenAI assistant offers real-time support in crafting high quality, constructive comments. To evaluate the impact of this approach, we conducted a randomized controlled experiment with master's students comparing a treatment group with a gamified, GenAI-driven setup against a control group with minimal gamification. Results show that students in the treatment group provided significantly more voluntary feedback, with higher scores on clarity, relevance, and specificity - all key aspects of effective code and design reviews. This study provides evidence for the effectiveness of combining gamification and AI to improve peer review processes, with implications for fostering review-related competencies in software engineering curricula.
