Semantic-Enhanced Time-Series Forecasting via Large Language Models
Hao Liu, Chun Yang, Zhang xiaoxing, Xiaobin Zhu
TL;DR
SE-LLM addresses the modality gap between time-series data and linguistic representations by introducing Temporal-Semantic Cross-Correlation (TSCC) and Time-Adapter to fuse temporal structure into LLM-based forecasting while keeping the backbone LLM frozen for efficiency. The TSCC component uses cross-attention and AM-VAE-based anomaly modeling to enrich the semantic space with temporal patterns, and the Time-Adapter injects long- and short-term temporal dependencies into attention via parallel LSTMs. A sliding-window preprocessing further reduces computational costs, enabling scalable forecasting across diverse datasets and horizons. Empirical results across ETTh1, Weather, Traffic, ECL, Solar, and M4 demonstrate state-of-the-art accuracy, robust zero-shot generalization, and notable efficiency gains over prior methods.
Abstract
Time series forecasting plays a significant role in finance, energy, meteorology, and IoT applications. Recent studies have leveraged the generalization capabilities of large language models (LLMs) to adapt to time series forecasting, achieving promising performance. However, existing studies focus on token-level modal alignment, instead of bridging the intrinsic modality gap between linguistic knowledge structures and time series data patterns, greatly limiting the semantic representation. To address this issue, we propose a novel Semantic-Enhanced LLM (SE-LLM) that explores the inherent periodicity and anomalous characteristics of time series to embed into the semantic space to enhance the token embedding. This process enhances the interpretability of tokens for LLMs, thereby activating the potential of LLMs for temporal sequence analysis. Moreover, existing Transformer-based LLMs excel at capturing long-range dependencies but are weak at modeling short-term anomalies in time-series data. Hence, we propose a plugin module embedded within self-attention that models long-term and short-term dependencies to effectively adapt LLMs to time-series analysis. Our approach freezes the LLM and reduces the sequence dimensionality of tokens, greatly reducing computational consumption. Experiments demonstrate the superiority performance of our SE-LLM against the state-of-the-art (SOTA) methods.
