HuiduRep: A Robust Self-Supervised Framework for Learning Neural Representations from Extracellular Recordings
Feng Cao, Zishuo Feng, Jicong Zhang, Wei Shi
TL;DR
HuiduRep addresses spike-sorting challenges under low SNR and electrode drift by marrying contrastive self-supervision with a denoising autoencoder in a MoCo-v3–style framework. A transformer-based encoder processes physiologically augmented views, and a Gaussian Mixture Model clusters learned spike representations to yield a label-free sorting pipeline. Across diverse datasets, including synthetic drift and real ground-truth recordings, HuiduRep demonstrates robust performance and often outperforms state-of-the-art tools like KiloSort4 and MountainSort5, with ablations underscoring the DAE’s role in cross-domain robustness. The work suggests a promising direction for generalizable extracellular signal processing and could extend to other bioelectrical signals such as EMG, ECoG, and EEG.
Abstract
Extracellular recordings are transient voltage fluctuations in the vicinity of neurons, serving as a fundamental modality in neuroscience for decoding brain activity at single-neuron resolution. Spike sorting, the process of attributing each detected spike to its corresponding neuron, is a pivotal step in brain sensing pipelines. However, it remains challenging under low signal-to-noise ratio (SNR), electrode drift and cross-session variability. In this paper, we propose HuiduRep, a robust self-supervised representation learning framework that extracts discriminative and generalizable features from extracellular recordings. By integrating contrastive learning with a denoising autoencoder, HuiduRep learns latent representations that are robust to noise and drift. With HuiduRep, we develop a spike sorting pipeline that clusters spike representations without ground truth labels. Experiments on hybrid and real-world datasets demonstrate that HuiduRep achieves strong robustness. Furthermore, the pipeline outperforms state-of-the-art tools such as KiloSort4 and MountainSort5. These findings demonstrate the potential of self-supervised spike representation learning as a foundational tool for robust and generalizable processing of extracellular recordings. Code is available at: https://github.com/IgarashiAkatuki/HuiduRep
