EduThink4AI: Bridging Educational Critical Thinking and Multi-Agent LLM Systems
Xinmeng Hou, Ziting Chang, Zhouquan Lu, Chen Wenli, Liang Wan, Wei Feng, Hai Hu, Qing Guo
TL;DR
This paper addresses the brittleness of current LLM educational tutors in promoting genuine critical thinking and resilience to adversarial prompts. It introduces EDU-Prompting, a modular, four-agent framework that bridges educational critical-thinking theories with LLM agent design through two phases: Phase I generates Raw Answers and Validity Cues via zero-shot agents, and Phase II applies Critique and Aggregation agents using zero-shot Chain-of-Thought reasoning to produce a final, bias-aware explanation. The approach is validated across standard benchmark suites for truthfulness, logical coherence, and bias, and is demonstrated in a full-stack college-writing application with three experimental conditions. Results show significant improvements in content truthfulness, logical soundness, and safety, while maintaining compatibility with existing prompting frameworks and enabling flexible deployment in educational settings.
Abstract
Large language models (LLMs) have demonstrated significant potential as educational tutoring agents, capable of tailoring hints, orchestrating lessons, and grading with near-human finesse across various academic domains. However, current LLM-based educational systems exhibit critical limitations in promoting genuine critical thinking, failing on over one-third of multi-hop questions with counterfactual premises, and remaining vulnerable to adversarial prompts that trigger biased or factually incorrect responses. To address these gaps, we propose \textbf{EDU-Prompting}, a novel multi-agent framework that bridges established educational critical thinking theories with LLM agent design to generate critical, bias-aware explanations while fostering diverse perspectives. Our systematic evaluation across theoretical benchmarks and practical college-level critical writing scenarios demonstrates that EDU-Prompting significantly enhances both content truthfulness and logical soundness in AI-generated educational responses. The framework's modular design enables seamless integration into existing prompting frameworks and educational applications, allowing practitioners to directly incorporate critical thinking catalysts that promote analytical reasoning and introduce multiple perspectives without requiring extensive system modifications.
