PresentCoach: Dual-Agent Presentation Coaching through Exemplars and Interactive Feedback
Sirui Chen, Jinsong Zhou, Xinli Xu, Xiaoyu Yang, Litao Guo, Ying-Cong Chen
TL;DR
PresentCoach addresses the gap in presentation training by integrating a slide-aware exemplar generator with a multimodal, audience-aware Coach Agent in a closed feedback loop. The Ideal Presentation Agent creates a personalized model presentation in the user’s own voice, while the Coach Agent provides Observation-Impact-Suggestion feedback and simulates audience responses to guide practice. A controlled study (N=24) shows that rehearsal with PresentCoach reduces public-speaking anxiety, maintains a moderate cognitive workload, and yields high usability, with participants perceiving clear role differentiation and synergistic benefits from the dual-agent interaction. The work demonstrates a scalable, human-centered approach to deliberate practice in educational and professional contexts, offering concrete targets and actionable guidance for improvement.
Abstract
Effective presentation skills are essential in education, professional communication, and public speaking, yet learners often lack access to high-quality exemplars or personalized coaching. Existing AI tools typically provide isolated functionalities such as speech scoring or script generation without integrating reference modeling and interactive feedback into a cohesive learning experience. We introduce a dual-agent system that supports presentation practice through two complementary roles: the Ideal Presentation Agent and the Coach Agent. The Ideal Presentation Agent converts user-provided slides into model presentation videos by combining slide processing, visual-language analysis, narration script generation, personalized voice synthesis, and synchronized video assembly. The Coach Agent then evaluates user-recorded presentations against these exemplars, conducting multimodal speech analysis and delivering structured feedback in an Observation-Impact-Suggestion (OIS) format. To enhance the authenticity of the learning experience, the Coach Agent incorporates an Audience Agent, which simulates the perspective of a human listener and provides humanized feedback reflecting audience reactions and engagement. Together, these agents form a closed loop of observation, practice, and feedback. Implemented on a robust backend with multi-model integration, voice cloning, and error handling mechanisms, the system demonstrates how AI-driven agents can provide engaging, human-centered, and scalable support for presentation skill development in both educational and professional contexts.
