End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations
Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, Qing Li
TL;DR
This work tackles explainability and sample efficiency in neuro-symbolic reinforcement learning by proposing INSIGHT, which jointly learns manageable structured states and coordinate-based symbolic policies from visual inputs. It achieves this by distilling vision foundation models into a scalable perception module, coupling it with an Extrapolation-based Logic (EQL) policy learner guided by a neural actor, and augmenting it with a GPT-4 powered textual explanations pipeline grounded through concept grounding. The approach yields state refinement with reward signals, competitive Atari results, and meaningful natural-language explanations for policies and decisions. The findings suggest that end-to-end refinement of structured states, combined with neural guidance and accessible explanations, enhances both performance and transparency in NS-RL, with practical implications for trustworthy AI.
Abstract
Neuro-symbolic reinforcement learning (NS-RL) has emerged as a promising paradigm for explainable decision-making, characterized by the interpretability of symbolic policies. NS-RL entails structured state representations for tasks with visual observations, but previous methods cannot refine the structured states with rewards due to a lack of efficiency. Accessibility also remains an issue, as extensive domain knowledge is required to interpret symbolic policies. In this paper, we present a neuro-symbolic framework for jointly learning structured states and symbolic policies, whose key idea is to distill the vision foundation model into an efficient perception module and refine it during policy learning. Moreover, we design a pipeline to prompt GPT-4 to generate textual explanations for the learned policies and decisions, significantly reducing users' cognitive load to understand the symbolic policies. We verify the efficacy of our approach on nine Atari tasks and present GPT-generated explanations for policies and decisions.
