Hierarchical Neuro-Symbolic Decision Transformer
Ali Baheri, Cecilia O. Alm
TL;DR
The paper addresses the challenge of long-horizon decision-making under uncertainty by uniting high-level symbolic planning with a transformer-based low-level policy. It introduces a hierarchical neuro-symbolic decision transformer that uses a bidirectional interface to translate symbolic operators into sub-goals for a decision transformer and to map raw states back into symbolic predicates, preserving interpretability while leveraging deep sequence modeling. Theoretical results provide hierarchical regret and PAC bounds that quantify how planning and execution errors propagate through the system. Empirical evaluation on stochastic grid-worlds demonstrates improved success rates and sample efficiency over purely symbolic, purely neural, and existing hierarchical baselines, with notable robustness to action-noise and task complexity. The approach offers a principled framework for scalable, interpretable, and resilient sequential decision-making with potential extensions to continuous domains and robotics.
Abstract
We present a hierarchical neuro-symbolic control framework that tightly couples a classical symbolic planner with a transformer-based policy to address long-horizon decision-making under uncertainty. At the high level, the planner assembles an interpretable sequence of operators that guarantees logical coherence with task constraints, while at the low level each operator is rendered as a sub-goal token that conditions a decision transformer to generate fine-grained actions directly from raw observations. This bidirectional interface preserves the combinatorial efficiency and explainability of symbolic reasoning without sacrificing the adaptability of deep sequence models, and it permits a principled analysis that tracks how approximation errors from both planning and execution accumulate across the hierarchy. Empirical studies in stochastic grid-world domains demonstrate that the proposed method consistently surpasses purely symbolic, purely neural and existing hierarchical baselines in both success and efficiency, highlighting its robustness for sequential tasks.
