A Framework for Inference Inspired by Human Memory Mechanisms
Xiangyu Zeng, Jie Lin, Piao Hu, Ruizheng Huang, Zhicheng Zhang
TL;DR
The paper advances PMI, a cognitive-inspired framework that unifies perception, dual-layer memory (working memory $M_w$ and long-term memory $M_l$), and inference to improve relational reasoning. It introduces external (competitive, top-$k$) and internal (outer-product) memory write channels, and an inference mechanism that retrieves from both memories via content-based addressing and fuses results to form the current understanding $U^t$ with $U^t = \alpha U_w^t + (1-\alpha) U_{wl}^t$. Across tasks—Sort-of-CLEVR, bAbI, equilateral triangle detection, language modeling, and CIFAR-10—the PMI-TR variant consistently improves convergence speed and accuracy, with visualizations indicating memory consolidation as a core contributor to performance. The results suggest memory-augmented architectures anchored in MMS and GWT can enhance both perception and reasoning, with applicability to diverse modalities and backbone models like Transformers and CNNs. Key contributions include a global, dual-level memory framework, differentiable memory precipitation, and empirical evidence of improved relational inference and long-range dependency handling, supported by attention-pattern analyses. $M_w$ and $M_l$ are central to the mechanism, with $M_w \in \mathbb{R}^{N\times D_m}$ and $M_l \in \mathbb{R}^{C\times N\times D_m}$, and the final cognition blends memories via $U^t = \alpha U_w^t + (1-\alpha) U_{wl}^t$.
Abstract
How humans and machines make sense of current inputs for relation reasoning and question-answering while putting the perceived information into context of our past memories, has been a challenging conundrum in cognitive science and artificial intelligence. Inspired by human brain's memory system and cognitive architectures, we propose a PMI framework that consists of perception, memory and inference components. Notably, the memory module comprises working and long-term memory, with the latter endowed with a higher-order structure to retain extensive and complex relational knowledge and experience. Through a differentiable competitive write access, current perceptions update working memory, which is later merged with long-term memory via outer product associations, reducing information conflicts and averting memory overflow. In the inference module, relevant information is retrieved from two separate memory origins and associatively integrated to attain a more comprehensive and precise interpretation of current perceptions. We exploratively apply our PMI to improve prevailing Transformers and CNN models on question-answering tasks like bAbI-20k and Sort-of-CLEVR datasets, as well as detecting equilateral triangles, language modeling and image classification tasks, and in each case, our PMI enhancements consistently outshine their original counterparts significantly. Visualization analyses reveal that relational memory consolidation, along with the interaction and integration of information from diverse memory sources, substantially contributes to the model effectiveness on inference tasks.
