Spatial-Aware Decision-Making with Ring Attractors in Reinforcement Learning Systems
Marcos Negre Saura, Richard Allmendinger, Wei Pan, Theodore Papamarkou
TL;DR
The paper addresses efficient, uncertainty-aware action selection in reinforcement learning for spatially structured tasks. It introduces ring attractors to encode the action space on a circular topology, implemented via a CTRNN-based exogenous model and a DL-based RA module integrated into DRL, with uncertainty carried through a Bayesian linear layer. Key contributions include a novel spatial-action policy mechanism, intrinsic uncertainty handling, and a reusable DL component; the method achieves a state-of-the-art $53\%$ improvement on the Atari 100k benchmark over baselines, particularly in spatial games. The results suggest that explicit spatial action encoding and attractor-based temporal filtering improve learning speed and robustness, with potential extensions to multi-agent and safe RL domains.
Abstract
Ring attractors, mathematical models inspired by neural circuit dynamics, provide a biologically plausible mechanism to improve learning speed and accuracy in Reinforcement Learning (RL). Serving as specialized brain-inspired structures that encode spatial information and uncertainty, ring attractors explicitly encode the action space, facilitate the organization of neural activity, and enable the distribution of spatial representations across the neural network in the context of Deep Reinforcement Learning (DRL). These structures also provide temporal filtering that stabilizes action selection during exploration, for example, by preserving the continuity between rotation angles in robotic control or adjacency between tactical moves in game-like environments. The application of ring attractors in the action selection process involves mapping actions to specific locations on the ring and decoding the selected action based on neural activity. We investigate the application of ring attractors by both building an exogenous model and integrating them as part of DRL agents. Our approach significantly improves state-of-the-art performance on the Atari 100k benchmark, achieving a 53% increase in performance over selected baselines.
