Learning from Ambiguous Demonstrations with Self-Explanation Guided Reinforcement Learning
Yantian Zha, Lin Guan, Subbarao Kambhampati
TL;DR
Ambiguity in human demonstrations can destabilize reinforcement learning from demonstrations (RLfD). The authors introduce SERLfD, a framework that learns self-explanations of predicate utilities via a Self-Explanation Network (SE-Net) and uses them to augment states or reward signals within a GAN-IRL–inspired training loop. The approach jointly trains a generator agent and the SE-Net, grounding predicates in a predefined vocabulary and leveraging success/failure buffers to identify task-relevant relations. Empirical results across continuous robotic domains and a discrete Pacman task show improved training stability and performance over strong RLfD baselines and GAN-IRL variants, demonstrating SERLfD’s effectiveness in mitigating ambiguity in demonstrations.
Abstract
Our work aims at efficiently leveraging ambiguous demonstrations for the training of a reinforcement learning (RL) agent. An ambiguous demonstration can usually be interpreted in multiple ways, which severely hinders the RL-Agent from learning stably and efficiently. Since an optimal demonstration may also suffer from being ambiguous, previous works that combine RL and learning from demonstration (RLfD works) may not work well. Inspired by how humans handle such situations, we propose to use self-explanation (an agent generates explanations for itself) to recognize valuable high-level relational features as an interpretation of why a successful trajectory is successful. This way, the agent can provide some guidance for its RL learning. Our main contribution is to propose the Self-Explanation for RL from Demonstrations (SERLfD) framework, which can overcome the limitations of traditional RLfD works. Our experimental results show that an RLfD model can be improved by using our SERLfD framework in terms of training stability and performance.
