NSA: Neuro-symbolic ARC Challenge
Paweł Batorski, Jannik Brinkmann, Paul Swoboda
TL;DR
This work tackles the Abstraction and Reasoning Corpus (ARC) by proposing NSA, a neuro-symbolic pipeline that combines a transformer-based proposal generator with a symbolic, graph-based DSL search (ARGA/ARGAe). The DSL encodes transformations as graph abstractions with filters and primitives, while the transformer suggests the right primitives and their order, with test-time adaptation via synthetic task generation to tailor the proposals to each task. Pre-training on a large corpus of synthetic ARC-like tasks and subsequent fine-tuning during inference enable efficient search within ARC's 30-minute per-task limit, leading to state-of-the-art performance on the ARC evaluation set (27% improvement over baselines). The results demonstrate the value of integrating learned proposal generation with structured symbolic search for abstract visual reasoning, and point to scalable directions in extending the DSL and tuning adaptation strategies under compute constraints.
Abstract
The Abstraction and Reasoning Corpus (ARC) evaluates general reasoning capabilities that are difficult for both machine learning models and combinatorial search methods. We propose a neuro-symbolic approach that combines a transformer for proposal generation with combinatorial search using a domain-specific language. The transformer narrows the search space by proposing promising search directions, which allows the combinatorial search to find the actual solution in short time. We pre-train the trainsformer with synthetically generated data. During test-time we generate additional task-specific training tasks and fine-tune our model. Our results surpass comparable state of the art on the ARC evaluation set by 27% and compare favourably on the ARC train set. We make our code and dataset publicly available at https://github.com/Batorskq/NSA.
