Table of Contents
Fetching ...

A Biologically Interpretable Cognitive Architecture for Online Structuring of Episodic Memories into Cognitive Maps

E. A. Dzhivelikian, A. I. Panov

TL;DR

This work tackles how cognitive maps and hippocampal-like structures can emerge from online, biologically plausible learning rules. It proposes a two-level memory architecture where the first level perfectly stores episodic sequences via a deterministic HMM, and a second level merges first-level states into clusters using Successor Features (SF) similarity, enabling generalization while preserving information with a discount factor $\gamma \in (0,1)$. In a partially observable grid-world, SF-guided merges improve next-observation prediction and better reflect the environment's transition structure, while reducing the second-level state space. Overall, the approach advances biologically grounded cognitive mapping in AI by integrating local Hebbian-like learning with SF-based memory structuring and a two-tier memory hierarchy.

Abstract

Cognitive maps provide a powerful framework for understanding spatial and abstract reasoning in biological and artificial agents. While recent computational models link cognitive maps to hippocampal-entorhinal mechanisms, they often rely on global optimization rules (e.g., backpropagation) that lack biological plausibility. In this work, we propose a novel cognitive architecture for structuring episodic memories into cognitive maps using local, Hebbian-like learning rules, compatible with neural substrate constraints. Our model integrates the Successor Features framework with episodic memories, enabling incremental, online learning through agent-environment interaction. We demonstrate its efficacy in a partially observable grid-world, where the architecture autonomously organizes memories into structured representations without centralized optimization. This work bridges computational neuroscience and AI, offering a biologically grounded approach to cognitive map formation in artificial adaptive agents.

A Biologically Interpretable Cognitive Architecture for Online Structuring of Episodic Memories into Cognitive Maps

TL;DR

This work tackles how cognitive maps and hippocampal-like structures can emerge from online, biologically plausible learning rules. It proposes a two-level memory architecture where the first level perfectly stores episodic sequences via a deterministic HMM, and a second level merges first-level states into clusters using Successor Features (SF) similarity, enabling generalization while preserving information with a discount factor . In a partially observable grid-world, SF-guided merges improve next-observation prediction and better reflect the environment's transition structure, while reducing the second-level state space. Overall, the approach advances biologically grounded cognitive mapping in AI by integrating local Hebbian-like learning with SF-based memory structuring and a two-tier memory hierarchy.

Abstract

Cognitive maps provide a powerful framework for understanding spatial and abstract reasoning in biological and artificial agents. While recent computational models link cognitive maps to hippocampal-entorhinal mechanisms, they often rely on global optimization rules (e.g., backpropagation) that lack biological plausibility. In this work, we propose a novel cognitive architecture for structuring episodic memories into cognitive maps using local, Hebbian-like learning rules, compatible with neural substrate constraints. Our model integrates the Successor Features framework with episodic memories, enabling incremental, online learning through agent-environment interaction. We demonstrate its efficacy in a partially observable grid-world, where the architecture autonomously organizes memories into structured representations without centralized optimization. This work bridges computational neuroscience and AI, offering a biologically grounded approach to cognitive map formation in artificial adaptive agents.

Paper Structure

This paper contains 7 sections, 7 equations, 9 figures, 3 algorithms.

Figures (9)

  • Figure 1: Dependence of the similarity between episodic memory SFs and the true SFs on the size and purity of the first-level state cluster. Results are averaged over five state partitions and three 10x10 grid-world environments with 10 colors and random coloring. The colored shading corresponds to the 95% confidence interval.
  • Figure 2: Dependence of the accuracy of cluster merging based on SFs on their size and purity. Results are averaged over five state partitions and three 10x10 grid-world environments with 10 colors and random coloring. The colored shading corresponds to the 95% confidence interval.
  • Figure 3: Mean merging accuracy (denoted by colour) based on SF representations as a function of the true position in the 10x10 grid-world environment for three different maps with ten observation states, indicated by numbers in each position. For each map and cluster size, results are averaged over five different cluster partitions.
  • Figure 4: Distribution of the purity of randomly formed clusters (random partitions) of first-level states, depending on cluster size and the number of clones. Results are shown for 1000.0 random partitions.
  • Figure 5: Factor graph of the memory model with mergers and a possible neural implementation of the merging process.
  • ...and 4 more figures

Theorems & Definitions (7)

  • Definition 2.1: Partially Observable Environment
  • Definition 2.2: World Model
  • Definition 2.3: Policy
  • Definition 2.4: Discrete Partially Observable Environment
  • Definition 2.5: Compact Subgraph
  • Definition 2.6: Markov Subspace
  • Definition 2.7: Markov Radius