Attention for Causal Relationship Discovery from Biological Neural Dynamics
Ziyu Lu, Anika Tabassum, Shruti Kulkarni, Lu Mi, J. Nathan Kutz, Eric Shea-Brown, Seung-Hwan Lim
TL;DR
The paper addresses learning Granger causality in nonlinear, dynamic neural networks by introducing Causalformer, a transformer-based model whose decoder cross attention encodes directed interactions among neurons during multivariate time-series forecasting. It demonstrates, on synthetic data generated with the Izhikevich model, that the cross-attention-derived connectivity can match or surpass traditional Multivariate Granger Causality (MVGC) methods, particularly as network size grows. The approach provides a scalable, nonlinear alternative for causal representation learning in neuroscience and offers a framework for interpreting inter-neuronal influences from large-scale neural population recordings. While promising, the work notes limitations related to real-world data non-stationarity, partial observability, and identifiability, outlining avenues for future enhancement such as handling spike-train data and broader methodological benchmarks.
Abstract
This paper explores the potential of the transformer models for learning Granger causality in networks with complex nonlinear dynamics at every node, as in neurobiological and biophysical networks. Our study primarily focuses on a proof-of-concept investigation based on simulated neural dynamics, for which the ground-truth causality is known through the underlying connectivity matrix. For transformer models trained to forecast neuronal population dynamics, we show that the cross attention module effectively captures the causal relationship among neurons, with an accuracy equal or superior to that for the most popular Granger causality analysis method. While we acknowledge that real-world neurobiology data will bring further challenges, including dynamic connectivity and unobserved variability, this research offers an encouraging preliminary glimpse into the utility of the transformer model for causal representation learning in neuroscience.
