Creating a customisable Socratic AI physics tutor
Eugenio Tufino, Bor Gregorcic
TL;DR
Role engineering via Gemini Gems enables tailoring LLMs into specialized physics tutors by scripting a pedagogical persona and grounding with Knowledge documents. The paper demonstrates two illustrative use cases: multimodal feedback on force diagrams using course-specific notation and electromagnetism reasoning guided by pre-trained knowledge. The results show that a role-engineered Gem can sustain a Socratic dialogue and scaffold step-by-step problem solving, in contrast to unmodified models that output direct solutions, with a key formula such as $\mathcal{E} = B \cdot L \cdot v$ featured in the EM case. While promising as an accessible path for educators to create tailored learning tools, the work remains qualitative and acknowledges potential inaccuracies, underscoring the need for systematic evaluation and expansion to other physics roles.
Abstract
This paper explores role engineering as an effective paradigm for customizing Large Language Models (LLMs) into specialized AI tutors for physics education. We demonstrate this methodology by designing a Socratic physics problem-solving tutor using Google's Gemini Gems feature, defining its pedagogical behavior through a detailed 'script' that specifies its role and persona. We present two illustrative use cases: the first demonstrates the Gem's multimodal ability to analyze a student's hand-drawn force diagram and apply notational rules from a 'Knowledge' file; the second showcases its capacity to guide conceptual reasoning in electromagnetism using its pre-trained knowledge without using specific documents provided by the instructor. Our findings show that the 'role-engineered' Gem successfully facilitates a Socratic dialogue, in stark contrast to a standard Gemini model, which tends to immediately provide direct solutions. We conclude that role engineering is a pivotal and accessible method for educators to transform a general-purpose 'solution provider' into a reliable pedagogical tutor capable of engaging students in an active reflection process. This approach offers a powerful tool for both instructors and students, while also highlighting the importance of addressing the technology's inherent limitations, such as the potential for occasional inaccuracies.
