A recurrent vision transformer shows signatures of primate visual attention

Jonathan Morgan; Badr Albanna; James P. Herman

A recurrent vision transformer shows signatures of primate visual attention

Jonathan Morgan, Badr Albanna, James P. Herman

TL;DR

The paper presents a Recurrent Vision Transformer that injects spatial working memory into self-attention to emulate primate visual attention. Trained with sparse reinforcement learning on a cued orientation-change task, the model demonstrates classic attentional benefits, anticipatory memory-guided allocation, and causal-like perturbation effects, closely mirroring primate data. A key finding is that multiplicative memory feedback within a memory-attention loop is essential to reproduce the full spectrum of primate-like attention signatures. This work advances biologically plausible AI by coupling memory, attention, and reward-driven learning, offering a framework to study how perception, memory, and decision-making co-evolve in dynamic environments.

Abstract

Attention is fundamental to both biological and artificial intelligence, yet research on animal attention and AI self attention remains largely disconnected. We propose a Recurrent Vision Transformer (Recurrent ViT) that integrates self-attention with recurrent memory, allowing both current inputs and stored information to guide attention allocation. Trained solely via sparse reward feedback on a spatially cued orientation change detection task, a paradigm used in primate studies, our model exhibits primate like signatures of attention, including improved accuracy and faster responses for cued stimuli that scale with cue validity. Analysis of self-attention maps reveals dynamic spatial prioritization with reactivation prior to expected changes, and targeted perturbations produce performance shifts similar to those observed in primate frontal eye fields and superior colliculus. These findings demonstrate that incorporating recurrent feedback into self attention can capture key aspects of primate visual attention.

A recurrent vision transformer shows signatures of primate visual attention

TL;DR

Abstract

A recurrent vision transformer shows signatures of primate visual attention

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)