Egocentric Visual Navigation through Hippocampal Sequences
Xiao-Xiong Lin, Yuk Hoi Yiu, Christian Leibold
TL;DR
The paper investigates how hippocampal theta sequences could emerge from intrinsic CA3 circuitry when driven by highly sparse dentate gyrus inputs. It introduces a minimal, biologically inspired model: a sparsified DG projection feeding a fixed CA3 sequence generator that acts as a shift-register reservoir, coupled to an actor–critic RL learner for egocentric navigation in a vision-based maze. Under sparse input, this CA3-based reservoir outperforms comparable LSTM cores, producing place-field–like tuning, orthogonalized DG representations, and distance-dependent population kernels, while performance degrades for dense inputs where LSTMs excel. The work provides a mechanistic account of hippocampal sequences with tangible RL implications, predicting how sequence length, sparsity, and remapping shape navigation and neural representations in the brain and offering a scalable inductive bias for reinforcement learning in sparse sensory regimes.
Abstract
Sequential activation of place-tuned neurons in an animal during navigation is typically interpreted as reflecting the sequence of input from adjacent positions along the trajectory. More recent theories about such place cells suggest sequences arise from abstract cognitive objectives like planning. Here, we propose a mechanistic and parsimonious interpretation to complement these ideas: hippocampal sequences arise from intrinsic recurrent circuitry that propagates activity without readily available input, acting as a temporal memory buffer for extremely sparse inputs. We implement a minimal sequence generator inspired by neurobiology and pair it with an actor-critic learner for egocentric visual navigation. Our agent reliably solves a continuous maze without explicit geometric cues, with performance depending on the length of the recurrent sequence. Crucially, the model outperforms LSTM cores under sparse input conditions (16 channels, ~2.5% activity), but not under dense input, revealing a strong interaction between representational sparsity and memory architecture. In contrast to LSTM agents, hidden sequence units develop localized place fields, distance-dependent spatial kernels, and task-dependent remapping, while inputs orthogonalize and spatial information increases across layers. These phenomena align with neurobiological data and are causal to performance. Together, our results show that sparse input synergizes with sequence-generating dynamics, providing both a mechanistic account of place cell sequences in the mammalian hippocampus and a simple inductive bias for reinforcement learning based on sparse egocentric inputs in navigation tasks.
