Leveraging Peer, Self, and Teacher Assessments for Generative AI-Enhanced Feedback
Alvaro Becerra, Ruth Cobos
TL;DR
The paper tackles the challenge of delivering timely, high-quality feedback in large engineering courses by evaluating a GenAI-augmented feedback system (AICoFe) that combines teacher, peer, and self assessments of oral presentations. It provides a rigorous empirical analysis of agreement, correlation, and biases among evaluators using a validated rubric, and demonstrates the limitations of equal weighting in aggregating inputs. The authors propose a weighted input integration and bias-detection framework for the GenAI component (GePeTo), enabling context-aware, pedagogy-aligned feedback with human oversight. The findings and design recommendations offer a practical pathway to scalable, trustworthy AI-assisted feedback that preserves pedagogical validity and transparency in engineering education.
Abstract
Providing timely and meaningful feedback remains a persistent challenge in higher education, especially in large courses where teachers must balance formative depth with scalability. Recent advances in Generative Artificial Intelligence (GenAI) offer new opportunities to support feedback processes while maintaining human oversight. This paper presents an study conducted within the AICoFe (AI-based Collaborative Feedback) system, which integrates teacher, peer, and self-assessments of engineering students' oral presentations. Using a validated rubric, 46 evaluation sets were analyzed to examine agreement, correlation, and bias across evaluators. The analyses revealed consistent overall alignment among sources but also systematic variations in scoring behavior, reflecting distinct evaluative perspectives. These findings informed the proposal of an enhanced GenAI model within AICoFe system, designed to integrate human assessments through weighted input aggregation, bias detection, and context-aware feedback generation. The study contributes empirical evidence and design principles for developing GenAI-based feedback systems that combine data-based efficiency with pedagogical validity and transparency.
