A Machine with Short-Term, Episodic, and Semantic Memory Systems

Taewoon Kim; Michael Cochez; Vincent François-Lavet; Mark Neerincx; Piek Vossen

A Machine with Short-Term, Episodic, and Semantic Memory Systems

Taewoon Kim, Michael Cochez, Vincent François-Lavet, Mark Neerincx, Piek Vossen

TL;DR

This work modeled an agent with short-term, episodic, and semantic memory systems, each of which is modeled with a knowledge graph, and shows that this deep Q-learning based agent successfully learns whether a long-term memory should be forgotten, or rather be stored in the episodic or semantic memory system.

Abstract

Inspired by the cognitive science theory of the explicit human memory systems, we have modeled an agent with short-term, episodic, and semantic memory systems, each of which is modeled with a knowledge graph. To evaluate this system and analyze the behavior of this agent, we designed and released our own reinforcement learning agent environment, "the Room", where an agent has to learn how to encode, store, and retrieve memories to maximize its return by answering questions. We show that our deep Q-learning based agent successfully learns whether a short-term memory should be forgotten, or rather be stored in the episodic or semantic memory systems. Our experiments indicate that an agent with human-like memory systems can outperform an agent without this memory structure in the environment.

A Machine with Short-Term, Episodic, and Semantic Memory Systems

TL;DR

Abstract

Paper Structure (17 sections, 2 equations, 7 figures, 2 tables, 2 algorithms)

This paper contains 17 sections, 2 equations, 7 figures, 2 tables, 2 algorithms.

Introduction
Methodology
The Room Environment
Human-Like Memory Systems
Memory Storage
Memory Retrieval
The Deep Q-learning Reinforcement Learning Agent
Experimental Setup
The Environment and Agent Parameters
The Deep Q-learning and Other Training Parameters
Results
Deep Q-learning Performance by Agent
Understanding What the Agent Learns
Related Work
Conclusions
...and 2 more sections

Figures (7)

Figure 1: The memory systems of the agent. The long-term (explicit) memory systems consist of episodic and semantic memory systems.
Figure 2: An episodic memory and a semantic memory represented as a knowledge graph.
Figure 3: The Q-network diagram, where the short-term $\bm{M}_{o}$, episodic $\bm{M}_{e}$, and semantic $\bm{M}_{s}$ memory systems are given as the initial input. The LSTM blocks output the last hidden states of the LSTMs. The final output is the state-action values of the $Q$ function.
Figure 4: An example of converting an episodic memory system into a KGE representation.
Figure 5: Training, validation, and test results of the agents with the memory capacity of 32. All of them were run five times with different random seeds. The standard deviation is shown together with the average. As for training, instead of total rewards per episode, we report loss over time since the varying epsilon value affects the rewards. The maximum total rewards per episode are 128, since in total of 128 questions are asked per episode.
...and 2 more figures

A Machine with Short-Term, Episodic, and Semantic Memory Systems

TL;DR

Abstract

A Machine with Short-Term, Episodic, and Semantic Memory Systems

Authors

TL;DR

Abstract

Table of Contents

Figures (7)