Demonstration Based Explainable AI for Learning from Demonstration Methods
Morris Gu, Elizabeth Croft, Dana Kulic
TL;DR
This work addresses the interpretability gap in Learning from Demonstration by introducing an adaptive explanatory feedback system that generates and selectively presents trajectory demonstrations from the learned policy via MaxEntIRL. The method categorizes explanatory trajectories as successful or unsuccessful based on termination states in $T_u$ and maintains a balanced sampling ratio to visualize both outcomes, enabling users to form a more accurate mental model. A two-condition user study (EF vs NF) with 26 participants on a grid-world navigation task shows that explanatory feedback improves robot performance and teaching efficiency, and enhances user understanding as reflected in prediction accuracy, though perception-related metrics show limited change in a between-subjects design. The findings suggest that XAI-driven explanatory feedback can meaningfully augment human-to-robot teaching, with potential for generalization to broader LfD systems and tasks, and point to future work implementing more complex tasks and disambiguating local versus global policy understanding.
Abstract
Learning from Demonstration (LfD) is a powerful type of machine learning that can allow novices to teach and program robots to complete various tasks. However, the learning process for these systems may still be difficult for novices to interpret and understand, making effective teaching challenging. Explainable artificial intelligence (XAI) aims to address this challenge by explaining a system to the user. In this work, we investigate XAI within LfD by implementing an adaptive explanatory feedback system on an inverse reinforcement learning (IRL) algorithm. The feedback is implemented by demonstrating selected learnt trajectories to users. The system adapts to user teaching by categorizing and then selectively sampling trajectories shown to a user, to show a representative sample of both successful and unsuccessful trajectories. The system was evaluated through a user study with 26 participants teaching a robot a navigation task. The results of the user study demonstrated that the proposed explanatory feedback system can improve robot performance, teaching efficiency and user understanding of the robot.
