A Neural Rewriting System to Solve Algorithmic Problems

Flavio Petruzzellis; Alberto Testolin; Alessandro Sperduti

A Neural Rewriting System to Solve Algorithmic Problems

Flavio Petruzzellis, Alberto Testolin, Alessandro Sperduti

TL;DR

The paper tackles the challenge of systematic, compositional generalization in neural models by targeting formula simplification through a modular Neural Rewriting System. It introduces a trio of neural components—Selector $sel$, Solver $sol$, and Combiner $com$—that mirror a classic rewriting algorithm, enabling iterative leaf-level reductions of nested formulas. Empirical results across ListOps, Arithmetic, and Algebra show that the Neural Rewriting System achieves stronger out-of-distribution generalization than both a neural data routing baseline and GPT-4 prompting, while revealing that leaf selection on long inputs is the principal bottleneck. The work provides a principled, interpretable architecture for symbolic reasoning with neural networks and identifies concrete directions (e.g., improving length generalization and selector reliability) for future progress in algorithmic generalization.

Abstract

Modern neural network architectures still struggle to learn algorithmic procedures that require to systematically apply compositional rules to solve out-of-distribution problem instances. In this work, we focus on formula simplification problems, a class of synthetic benchmarks used to study the systematic generalization capabilities of neural architectures. We propose a modular architecture designed to learn a general procedure for solving nested mathematical formulas by only relying on a minimal set of training examples. Inspired by rewriting systems, a classic framework in symbolic artificial intelligence, we include in the architecture three specialized and interacting modules: the Selector, trained to identify solvable sub-expressions; the Solver, mapping sub-expressions to their values; and the Combiner, replacing sub-expressions in the original formula with the solution provided by the Solver. We benchmark our system against the Neural Data Router, a recent model specialized for systematic generalization, and a state-of-the-art large language model (GPT-4) probed with advanced prompting strategies. We demonstrate that our approach achieves a higher degree of out-of-distribution generalization compared to these alternative approaches on three different types of formula simplification problems, and we discuss its limitations by analyzing its failures.

A Neural Rewriting System to Solve Algorithmic Problems

TL;DR

, Solver

, and Combiner

—that mirror a classic rewriting algorithm, enabling iterative leaf-level reductions of nested formulas. Empirical results across ListOps, Arithmetic, and Algebra show that the Neural Rewriting System achieves stronger out-of-distribution generalization than both a neural data routing baseline and GPT-4 prompting, while revealing that leaf selection on long inputs is the principal bottleneck. The work provides a principled, interpretable architecture for symbolic reasoning with neural networks and identifies concrete directions (e.g., improving length generalization and selector reliability) for future progress in algorithmic generalization.

Abstract

Paper Structure (22 sections, 7 figures)

This paper contains 22 sections, 7 figures.

Related works
Formula simplification problems
Neural Rewriting System
Selector Module
Solver Module
Combiner Module
Experiments and results
Datasets
Experiments
Neural Rewriting System
Neural Data Router
GPT-4
Results
Discussion
Acknowledgements
...and 7 more sections

Figures (7)

Figure 1: Examples of solution of ListOps, arithmetic and algebraic formulas. Formulas $f$ are reduced to atomic values $e$ by iteratively solving leaf formulas $f^L$, highlighted in yellow.
Figure 2: Schematic representation of the Neural Rewriting System.
Figure 3: Performance of the Neural Data Router, GPT-4 and Neural Rewriting System on ListOps, arithmetic and algebraic formulas.
Figure 4: Input length against the average confidence score of 1,000 outputs of the Selector. The vertical lines represent the maximum input length in the training set.
Figure 5: Errors committed by the Neural Rewriting System classified by cause of error. Error bars corresponding to a complexity split refer to models where the Selector generates 10, 100 or 1,000 outputs per input and (optionally) employs the Dynamic Windowing mechanism. Data splits are defined by their nesting level N and number of arguments A.
...and 2 more figures

A Neural Rewriting System to Solve Algorithmic Problems

TL;DR

Abstract

A Neural Rewriting System to Solve Algorithmic Problems

Authors

TL;DR

Abstract

Table of Contents

Figures (7)