Representational Alignment Supports Effective Machine Teaching
Ilia Sucholutsky, Katherine M. Collins, Maya Malaviya, Nori Jacoby, Weiyang Liu, Theodore R. Sumers, Michalis Korakakis, Umang Bhatt, Mark Ho, Joshua B. Tenenbaum, Brad Love, Zachary A. Pardos, Adrian Weller, Thomas L. Griffiths
TL;DR
This work investigates representational alignment as a core determinant of teaching effectiveness, introducing GRADE to controllably manipulate teacher accuracy and teacher-student representation similarity on a grid-based task. Through machine-machine and machine-human experiments, the authors derive a dyadic utility curve linking alignment, accuracy, and learning outcomes, and show that alignment can compensate for lower accuracy in some cases. Extending to classroom-scale settings, they demonstrate that class size and representational diversity moderate alignment benefits, and propose GRADE-Match to optimally assign students to teachers. The findings highlight that future AI tutors and educational systems should prioritize alignment with human learners in addition to raw task performance, informing mechanisms for personalized teaching and teacher matching. The work also provides a theoretical formulation and rich appendix detailing simulations, human experiments, and classroom simulations, enabling future replication and extension.
Abstract
A good teacher should not only be knowledgeable, but should also be able to communicate in a way that the student understands -- to share the student's representation of the world. In this work, we introduce a new controlled experimental setting, GRADE, to study pedagogy and representational alignment. We use GRADE through a series of machine-machine and machine-human teaching experiments to characterize a utility curve defining a relationship between representational alignment, teacher expertise, and student learning outcomes. We find that improved representational alignment with a student improves student learning outcomes (i.e., task accuracy), but that this effect is moderated by the size and representational diversity of the class being taught. We use these insights to design a preliminary classroom matching procedure, GRADE-Match, that optimizes the assignment of students to teachers. When designing machine teachers, our results suggest that it is important to focus not only on accuracy, but also on representational alignment with human learners.
