Time Cell Inspired Temporal Codebook in Spiking Neural Networks for Enhanced Image Generation
Linghao Feng, Dongcheng Zhao, Sicheng Shen, Yiting Dong, Guobin Shen, Yi Zeng
TL;DR
The paper tackles the challenge of temporally coherent generative modeling with spiking neural networks by introducing a hippocampal time cell–inspired temporal codebook within a Spiking VQ-VAE. It combines spike-based encoding, a discrete temporal latent space, and autoregressive generation via transformers (with both spiking and non-spiking variants) to produce high-quality, temporally consistent images across diverse datasets, including high-resolution neuromorphic data. Empirical results show state-of-the-art performance among SNN-based generative models and clear benefits from incorporating temporal information, supported by ablation and destruction studies that highlight the codebook’s critical role. The work underscores the importance of temporal dynamics in neuromorphic generation and points to scalable, temporally aware SNN architectures as a promising direction for future research.
Abstract
This paper presents a novel approach leveraging Spiking Neural Networks (SNNs) to construct a Variational Quantized Autoencoder (VQ-VAE) with a temporal codebook inspired by hippocampal time cells. This design captures and utilizes temporal dependencies, significantly enhancing the generative capabilities of SNNs. Neuroscientific research has identified hippocampal "time cells" that fire sequentially during temporally structured experiences. Our temporal codebook emulates this behavior by triggering the activation of time cell populations based on similarity measures as input stimuli pass through it. We conducted extensive experiments on standard benchmark datasets, including MNIST, FashionMNIST, CIFAR10, CelebA, and downsampled LSUN Bedroom, to validate our model's performance. Furthermore, we evaluated the effectiveness of the temporal codebook on neuromorphic datasets NMNIST and DVS-CIFAR10, and demonstrated the model's capability with high-resolution datasets such as CelebA-HQ, LSUN Bedroom, and LSUN Church. The experimental results indicate that our method consistently outperforms existing SNN-based generative models across multiple datasets, achieving state-of-the-art performance. Notably, our approach excels in generating high-resolution and temporally consistent data, underscoring the crucial role of temporal information in SNN-based generative modeling.
