SlideBot: A Multi-Agent Framework for Generating Informative, Reliable, Multi-Modal Presentations

Eric Xie; Danielle Waterfield; Michael Kennedy; Aidong Zhang

SlideBot: A Multi-Agent Framework for Generating Informative, Reliable, Multi-Modal Presentations

Eric Xie, Danielle Waterfield, Michael Kennedy, Aidong Zhang

TL;DR

SlideBot presents a modular, multi-agent framework for generating informative, reliable, and practical university-level presentations by grounding outputs in external sources and applying evidence-based instructional design (CLT and CTML). The pipeline decouples content retrieval, structured planning, and LaTeX Beamer code generation, coordinated by a central Moderator, and augments slides with instructor-facing comments and figure macros. Empirical evaluations in AI/biomedical education show SlideBot outperforms Microsoft Copilot and direct prompting across informativeness, reliability, and practicality, driven more by architectural decomposition than by base model size. The work demonstrates a scalable, flexible approach to AI-assisted slides that mitigates hallucinations and supports instructor customization, with clear directions for future enhancements and broader domain deployment.

Abstract

Large Language Models (LLMs) have shown immense potential in education, automating tasks like quiz generation and content summarization. However, generating effective presentation slides introduces unique challenges due to the complexity of multimodal content creation and the need for precise, domain-specific information. Existing LLM-based solutions often fail to produce reliable and informative outputs, limiting their educational value. To address these limitations, we introduce SlideBot - a modular, multi-agent slide generation framework that integrates LLMs with retrieval, structured planning, and code generation. SlideBot is organized around three pillars: informativeness, ensuring deep and contextually grounded content; reliability, achieved by incorporating external sources through retrieval; and practicality, which enables customization and iterative feedback through instructor collaboration. It incorporates evidence-based instructional design principles from Cognitive Load Theory (CLT) and the Cognitive Theory of Multimedia Learning (CTML), using structured planning to manage intrinsic load and consistent visual macros to reduce extraneous load and enhance dual-channel learning. Within the system, specialized agents collaboratively retrieve information, summarize content, generate figures, and format slides using LaTeX, aligning outputs with instructor preferences through interactive refinement. Evaluations from domain experts and students in AI and biomedical education show that SlideBot consistently enhances conceptual accuracy, clarity, and instructional value. These findings demonstrate SlideBot's potential to streamline slide preparation while ensuring accuracy, relevance, and adaptability in higher education.

SlideBot: A Multi-Agent Framework for Generating Informative, Reliable, Multi-Modal Presentations

TL;DR

Abstract

SlideBot: A Multi-Agent Framework for Generating Informative, Reliable, Multi-Modal Presentations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (20)