Exploring Neural Granger Causality with xLSTMs: Unveiling Temporal Dependencies in Complex Data
Harsh Poonia, Felix Divo, Kristian Kersting, Devendra Singh Dhami
TL;DR
This work tackles identifying non-linear, long-range Granger causal relations in multivariate time series by introducing GC-xLSTM, which combines a sparsity-enforcing initial projection with multiple xLSTM forecasters to recover the GC graph $(\mathcal{V},\mathcal{E})$. It contributes a novel reduction loss and proximal optimization that enforce strict input sparsity, enabling robust edge discovery while preserving forecasting performance. The authors provide theoretical discussion on the model’s approximation capabilities and demonstrate superior performance across six diverse datasets (e.g., Lorenz-96, fMRI, Molène, MoCap, and Company Fundamentals) compared with strong baselines, yielding higher accuracy, balanced accuracy, and AUROC. This approach offers scalable, interpretable GC detection in complex time series and has potential for broad applicability in science and industry where understanding temporal interdependencies is critical.
Abstract
Causality in time series can be challenging to determine, especially in the presence of non-linear dependencies. Granger causality helps analyze potential relationships between variables, thereby offering a method to determine whether one time series can predict-Granger cause-future values of another. Although successful, Granger causal methods still struggle with capturing long-range relations between variables. To this end, we leverage the recently successful Extended Long Short-Term Memory (xLSTM) architecture and propose Granger causal xLSTMs (GC-xLSTM). It first enforces sparsity between the time series components by using a novel dynamic loss penalty on the initial projection. Specifically, we adaptively improve the model and identify sparsity candidates. Our joint optimization procedure then ensures that the Granger causal relations are recovered robustly. Our experimental evaluation on six diverse datasets demonstrates the overall efficacy of GC-xLSTM.
