Coaching Copilot: Blended Form of an LLM-Powered Chatbot and a Human Coach to Effectively Support Self-Reflection for Leadership Growth
Riku Arakawa, Hiromu Yakura
TL;DR
This work addresses the challenge of fostering deep self-reflection for leadership growth, proposing a blended approach that combines an LLM-powered chatbot with human coaches. Through a design workshop and a two-week field study with ten coach–client pairs, it provides empirical evidence on where chatbots can help, what roles humans should play, and how clients perceive and engage with the technology. The authors offer an actionable guideline for deploying chatbot-assisted reflection in executive coaching and delineate the current limits of chatbot capability, emphasizing the value of human-in-the-loop for deeper introspection. The findings have practical implications for scalable leadership development and suggest broader applicability of blended AI–human coaching in other reflection-intensive settings.
Abstract
Chatbots' role in fostering self-reflection is now widely recognized, especially in inducing users' behavior change. While the benefits of 24/7 availability, scalability, and consistent responses have been demonstrated in contexts such as healthcare and tutoring to help one form a new habit, their utilization in coaching necessitating deeper introspective dialogue to induce leadership growth remains unexplored. This paper explores the potential of such a chatbot powered by recent Large Language Models (LLMs) in collaboration with professional coaches in the field of executive coaching. Through a design workshop with them and two weeks of user study involving ten coach-client pairs, we explored the feasibility and nuances of integrating chatbots to complement human coaches. Our findings highlight the benefits of chatbots' ubiquity and reasoning capabilities enabled by LLMs while identifying their limitations and design necessities for effective collaboration between human coaches and chatbots. By doing so, this work contributes to the foundation for augmenting one's self-reflective process with prevalent conversational agents through the human-in-the-loop approach.
