Symbolic Working Memory Enhances Language Models for Complex Rule Application

Siyuan Wang; Zhongyu Wei; Yejin Choi; Xiang Ren

Symbolic Working Memory Enhances Language Models for Complex Rule Application

Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren

TL;DR

This work proposes augmenting LLMs with external working memory and introducing a neurosymbolic framework for rule application that matches predicates and variables of symbolic rules and facts to ground applicable rules at each step.

Abstract

Large Language Models (LLMs) have shown remarkable reasoning performance but struggle with multi-step deductive reasoning involving a series of rule application steps, especially when rules are presented non-sequentially. Our preliminary analysis shows that while LLMs excel in single-step rule application, their performance drops significantly in multi-step scenarios due to the challenge in rule grounding. It requires anchoring the applicable rule and supporting facts at each step, amidst multiple input rules, facts, and inferred facts. To address this, we propose augmenting LLMs with external working memory and introduce a neurosymbolic framework for rule application. The memory stores facts and rules in both natural language and symbolic forms, enabling precise tracking. Utilizing this memory, our framework iteratively performs symbolic rule grounding and LLM-based rule implementation. The former matches predicates and variables of symbolic rules and facts to ground applicable rules at each step. Experiments indicate our framework's effectiveness in rule application and its robustness across various steps and settings~\footnote{Code and data are available at \url{https://github.com/SiyuanWangw/RuleApplication}.}.

Symbolic Working Memory Enhances Language Models for Complex Rule Application

TL;DR

Abstract

Paper Structure (37 sections, 6 figures, 19 tables)

This paper contains 37 sections, 6 figures, 19 tables.

Introduction
Preliminary
Problem Definition
External Working Memory
Working Memory Composition
Symbolic Formulation
Memory Schema
Framework
Working Memory Initialization
Symbolic Rule Grounding
LLM-based Rule Implementation
Final Answer Prediction
Experiments
Setup
Datasets
...and 22 more sections

Figures (6)

Figure 1: Performance of GPT-4 using scratchpad Chain-of-Thought (CoT) reasoning across various rule application steps on CLUTRR sinha2019clutrr, with an example of two-step rule application shown above.
Figure 2: An illustration of the working memory.
Figure 3: The workflow of our neurosymbolic rule application framework based on working memory. Details of the memory schema and natural language expressions of facts and rules are omitted in the memory for simplicity.
Figure 4: Examples of predicate and variable matching.
Figure 5: Performance across varying steps of rule application.
...and 1 more figures

Symbolic Working Memory Enhances Language Models for Complex Rule Application

TL;DR

Abstract

Symbolic Working Memory Enhances Language Models for Complex Rule Application

Authors

TL;DR

Abstract

Table of Contents

Figures (6)