Teach AI How to Code: Using Large Language Models as Teachable Agents for Programming Education
Hyoungwook Jin, Seonghee Lee, Hyungyu Shin, Juho Kim
TL;DR
This paper presents TeachYou, an LBT system where learners teach an LLM-based tutee (AlgoBo) to code algorithms. It introduces the Reflect-Respond prompting pipeline to confine and update AlgoBo's knowledge, Mode-shifting to induce elaboration, and Teaching Helper to provide metacognitive feedback. Technical evaluation shows that Reflect-Respond can configure, persist, and adapt AlgoBo's knowledge state, while a 40-person user study demonstrates that Mode-shifting significantly increases knowledge-building density (effect size $d=0.71$, $p=0.03$) and enhances perceived usefulness for learning. The work argues for cost-efficient, personalized, and scalable LBT with teachable agents, and discusses implications for design, personalization, and classroom deployment.
Abstract
This work investigates large language models (LLMs) as teachable agents for learning by teaching (LBT). LBT with teachable agents helps learners identify knowledge gaps and discover new knowledge. However, teachable agents require expensive programming of subject-specific knowledge. While LLMs as teachable agents can reduce the cost, LLMs' expansive knowledge as tutees discourages learners from teaching. We propose a prompting pipeline that restrains LLMs' knowledge and makes them initiate "why" and "how" questions for effective knowledge-building. We combined these techniques into TeachYou, an LBT environment for algorithm learning, and AlgoBo, an LLM-based tutee chatbot that can simulate misconceptions and unawareness prescribed in its knowledge state. Our technical evaluation confirmed that our prompting pipeline can effectively configure AlgoBo's problem-solving performance. Through a between-subject study with 40 algorithm novices, we also observed that AlgoBo's questions led to knowledge-dense conversations (effect size=0.71). Lastly, we discuss design implications, cost-efficiency, and personalization of LLM-based teachable agents.
