CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting

Xingyu Zhang; Wenwen Qiang; Siyu Zhao; Huijie Guo; Jiangmeng Li; Chuxiong Sun; Changwen Zheng

CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting

Xingyu Zhang, Wenwen Qiang, Siyu Zhao, Huijie Guo, Jiangmeng Li, Chuxiong Sun, Changwen Zheng

TL;DR

This work addresses multivariate time series forecasting by distinguishing causal roles among historical variables instead of treating all histories equally. It introduces an all-to-one paradigm and CAIFormer, a causal-informed Transformer that partitions each target's history into Endogenous Sub-segment, Direct Causal Sub-segment, and Collider Causal Sub-segment, discarding Spurious Correlation Sub-segments. CAIFormer employs three blocks—ESPB, DCSPB, and CCSPB—and uses DAG-guided masks derived from the PC algorithm to separate intrinsic dynamics, direct causal influences, and collider-driven dependencies, with a collider constraint implemented via a projection to reduce generalization gaps. Empirical results on six real-world datasets with horizons up to $S=720$ demonstrate improved accuracy and robustness over strong baselines, with ablations confirming the contribution of each component and the benefit of leveraging learned causal structure for attention guidance.

Abstract

Most existing multivariate time series forecasting methods adopt an all-to-all paradigm that feeds all variable histories into a unified model to predict their future values without distinguishing their individual roles. However, this undifferentiated paradigm makes it difficult to identify variable-specific causal influences and often entangles causally relevant information with spurious correlations. To address this limitation, we propose an all-to-one forecasting paradigm that predicts each target variable separately. Specifically, we first construct a Structural Causal Model from observational data and then, for each target variable, we partition the historical sequence into four sub-segments according to the inferred causal structure: endogenous, direct causal, collider causal, and spurious correlation. The prediction relies solely on the first three causally relevant sub-segments, while the spurious correlation sub-segment is excluded. Furthermore, we propose Causal Informed Transformer (CAIFormer), a novel forecasting model comprising three components: Endogenous Sub-segment Prediction Block, Direct Causal Sub-segment Prediction Block, and Collider Causal Sub-segment Prediction Block, which process the endogenous, direct causal, and collider causal sub-segments, respectively. Their outputs are then combined to produce the final prediction. Extensive experiments on multiple benchmark datasets demonstrate the effectiveness of the CAIFormer.

CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting

TL;DR

Abstract

CAIFormer: A Causal Informed Transformer for Multivariate Time Series Forecasting

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (6)