A Taxonomy of Recurrent Learning Rules
Guillermo Martín-Sánchez, Sander Bohté, Sebastian Otte
TL;DR
This work analyzes the gradient computation for recurrent networks by unifying Backpropagation Through Time (BPTT), Real-Time Recurrent Learning (RTRL), and the online, local learning rule e-prop. It formalizes a common computational-graph framework, derives RTRL from BPTT through re-expression of implicit and explicit recurrences, and then casts e-prop as a causal, local approximation that discards non-causal explicit interactions. A key contribution is the introduction of a family of m-order e-prop, which progressively reintroduces higher-order (more distant) temporal and cross-neuron dependencies to improve gradient accuracy while preserving causality and locality to varying degrees. The framework clarifies the trade-offs between computational cost, memory, and gradient fidelity, and highlights practical pathways for online learning in recurrent architectures including RSNNs and LSTMs.
Abstract
Backpropagation through time (BPTT) is the de facto standard for training recurrent neural networks (RNNs), but it is non-causal and non-local. Real-time recurrent learning is a causal alternative, but it is highly inefficient. Recently, e-prop was proposed as a causal, local, and efficient practical alternative to these algorithms, providing an approximation of the exact gradient by radically pruning the recurrent dependencies carried over time. Here, we derive RTRL from BPTT using a detailed notation bringing intuition and clarification to how they are connected. Furthermore, we frame e-prop within in the picture, formalising what it approximates. Finally, we derive a family of algorithms of which e-prop is a special case.
