A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making
Carlos Núñez-Molina, Pablo Mesejo, Juan Fernández-Olivares
TL;DR
SDM spans AP and RL, each with strengths in planning and learning but limitations in data efficiency, generalization, and interpretability. The paper surveys symbolic, subsymbolic, and hybrid SDM methods, and introduces Learn-to-Plan as a bridge between AP and RL, while also detailing approaches to learn SDP structure (action models and domain knowledge). It offers a two-dimensional taxonomy—solution method (AP, RL, learn-to-plan) and knowledge representation (symbolic, subsymbolic, hybrid)—and argues that neurosymbolic AI, which combines planning with learning and symbolic-subsymbolic representations, is the most promising route toward an ideal SDM. The analysis highlights five desirable properties for SDM methods (applicability, ease of use, efficiency, interpretability, generalizability) and advocates integrating planning and learning to achieve these goals, especially in discrete $MDP$/$POMDP$ settings. Overall, the work provides a comprehensive framework and roadmap for developing unified, interpretable, and data-efficient SDM systems that leverage the strengths of both symbolic planning and deep learning.
Abstract
In the field of Sequential Decision Making (SDM), two paradigms have historically vied for supremacy: Automated Planning (AP) and Reinforcement Learning (RL). In the spirit of reconciliation, this article reviews AP, RL and hybrid methods (e.g., novel learn to plan techniques) for solving Sequential Decision Processes (SDPs), focusing on their knowledge representation: symbolic, subsymbolic, or a combination. Additionally, it also covers methods for learning the SDP structure. Finally, we compare the advantages and drawbacks of the existing methods and conclude that neurosymbolic AI poses a promising approach for SDM, since it combines AP and RL with a hybrid knowledge representation.
