LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting
Haoxin Liu, Zhiyuan Zhao, Jindong Wang, Harshavardhan Kamarthi, B. Aditya Prakash
TL;DR
This work addresses zero-shot time-series forecasting by reframing LLM prompting through two novel components: TimeDecomp, which splits forecasting into short-term and long-term subtasks with distinct reasoning strategies, and TimeBreath, which inserts periodic breaks to reassess forecasting mechanisms. Together, LSTPrompt provides a structured, CoT-based prompt that guides LLMs to leverage multiple forecasting strategies within a horizon $H$ while learning from past data over a lookback window $L$. Empirical results across benchmark and concurrent datasets show that LSTPrompt achieves frequent top-zero-shot performance, often surpassing supervised TSF models in drift-heavy settings and competing with TSF-specific foundation models. The findings highlight the potential of tailored prompting strategies to unlock robust zero-shot TSF with general-purpose LLMs, while also acknowledging limitations in interpretability and potential information leakage in dataset prompts. Overall, LSTPrompt advances zero-shot TSF by embedding TSF-aware reasoning and adaptive mechanism reassessment into prompt design, with practical implications for efficient, scalable forecasting.
Abstract
Time-series forecasting (TSF) finds broad applications in real-world scenarios. Prompting off-the-shelf Large Language Models (LLMs) demonstrates strong zero-shot TSF capabilities while preserving computational efficiency. However, existing prompting methods oversimplify TSF as language next-token predictions, overlooking its dynamic nature and lack of integration with state-of-the-art prompt strategies such as Chain-of-Thought. Thus, we propose LSTPrompt, a novel approach for prompting LLMs in zero-shot TSF tasks. LSTPrompt decomposes TSF into short-term and long-term forecasting sub-tasks, tailoring prompts to each. LSTPrompt guides LLMs to regularly reassess forecasting mechanisms to enhance adaptability. Extensive evaluations demonstrate consistently better performance of LSTPrompt than existing prompting methods, and competitive results compared to foundation TSF models.
