LARS-VSA: A Vector Symbolic Architecture For Learning with Abstract Rules
Mohamed Mejri, Chandramouli Amarnath, Abhijit Chatterjee
TL;DR
LARS-VSA introduces a vector symbolic architecture leveraging hyperdimensional computing to address the relational bottleneck in abstract reasoning. It develops an HD attention mechanism (HDSymbolicAttention) with explicit vector binding between object representations and symbolic rules, including a binarized variant for efficiency, and a architecture that aggregates multi-head HD relations with minimal information loss. Across discriminative relational tasks, purely relational sequence-to-sequence tasks, and math problem-solving, LARS-VSA achieves higher accuracy and substantially lower memory and latency than baselines like Abstractor and Transformer, especially in data-scarce regimes. The approach demonstrates robust relational reasoning with low overhead, offering a scalable neuro-symbolic alternative for learning abstract rules, though it notes temporal interference and future work to push toward fully HD-only decoding.
Abstract
Human cognition excels at symbolic reasoning, deducing abstract rules from limited samples. This has been explained using symbolic and connectionist approaches, inspiring the development of a neuro-symbolic architecture that combines both paradigms. In parallel, recent studies have proposed the use of a "relational bottleneck" that separates object-level features from abstract rules, allowing learning from limited amounts of data . While powerful, it is vulnerable to the curse of compositionality meaning that object representations with similar features tend to interfere with each other. In this paper, we leverage hyperdimensional computing, which is inherently robust to such interference to build a compositional architecture. We adapt the "relational bottleneck" strategy to a high-dimensional space, incorporating explicit vector binding operations between symbols and relational representations. Additionally, we design a novel high-dimensional attention mechanism that leverages this relational representation. Our system benefits from the low overhead of operations in hyperdimensional space, making it significantly more efficient than the state of the art when evaluated on a variety of test datasets, while maintaining higher or equal accuracy.
