EchoLSTM: A Self-Reflective Recurrent Network for Stabilizing Long-Range Memory
Prasanth K K, Shubham Sharma
TL;DR
The paper tackles the difficulty of preserving long-range memory in recurrent models under noisy inputs by introducing Output-Conditioned Gating, a self-reflective mechanism that modulates memory gates with the model's own past inferences. The EchoLSTM combines this gating with a lightweight attention layer to retain critical information over long sequences, achieving strong results on synthetic distractor tasks and the ListOps benchmark while remaining parameter-efficient. Theoretical analysis and ablations reveal that OCG stabilizes gate dynamics and enhances gradient flow, and empirical results show robust memory retention and attention-driven denoising. The approach offers a practical, energy-efficient alternative to Transformers for long-sequence modeling with broad potential applications and impact.
Abstract
Standard Recurrent Neural Networks, including LSTMs, struggle to model long-range dependencies, particularly in sequences containing noisy or misleading information. We propose a new architectural principle, Output-Conditioned Gating, which enables a model to perform self-reflection by modulating its internal memory gates based on its own past inferences. This creates a stabilizing feedback loop that enhances memory retention. Our final model, the EchoLSTM, integrates this principle with an attention mechanism. We evaluate the EchoLSTM on a series of challenging benchmarks. On a custom-designed Distractor Signal Task, the EchoLSTM achieves 69.0% accuracy, decisively outperforming a standard LSTM baseline by 33 percentage points. Furthermore, on the standard ListOps benchmark, the EchoLSTM achieves performance competitive with a modern Transformer model, 69.8% vs. 71.8%, while being over 5 times more parameter-efficient. A final Trigger Sensitivity Test provides qualitative evidence that our model's self-reflective mechanism leads to a fundamentally more robust memory system.
