Deep reinforcement learning with time-scale invariant memory
Md Rysul Kabir, James Mochizuki-Freeman, Zoran Tiganj
TL;DR
This work addresses how artificial agents can learn temporal relationships across diverse time scales by introducing a scale-invariant memory mechanism. It builds a CogRNN that uses a real-domain Laplace transform and a learned inverse transform to produce a log-compressed, time-cell–like memory, enabling robust, scale-free learning in deep RL. Across interval timing, discrimination, DMS, and reproduction tasks, CogRNN matches or exceeds LSTM/RNN performance and generalizes across temporal scales, with neural activity mirroring biological time cells. The approach offers a principled bridge between neuroscience and AI, suggesting scale-invariant representations as a path to rapid adaptation in changing temporal environments.
Abstract
The ability to estimate temporal relationships is critical for both animals and artificial agents. Cognitive science and neuroscience provide remarkable insights into behavioral and neural aspects of temporal credit assignment. In particular, scale invariance of learning dynamics, observed in behavior and supported by neural data, is one of the key principles that governs animal perception: proportional rescaling of temporal relationships does not alter the overall learning efficiency. Here we integrate a computational neuroscience model of scale invariant memory into deep reinforcement learning (RL) agents. We first provide a theoretical analysis and then demonstrate through experiments that such agents can learn robustly across a wide range of temporal scales, unlike agents built with commonly used recurrent memory architectures such as LSTM. This result illustrates that incorporating computational principles from neuroscience and cognitive science into deep neural networks can enhance adaptability to complex temporal dynamics, mirroring some of the core properties of human learning.
