LLMForecaster: Improving Seasonal Event Forecasts with Unstructured Textual Data
Hanyu Zhang, Chuck Arvin, Dmitry Efimov, Michael W. Mahoney, Dominique Perrault-Joncas, Shankar Ramasubramanian, Andrew Gordon Wilson, Malcolm Wolff
TL;DR
Problem: Traditional demand forecasting often underutilizes unstructured textual information about products, limiting anticipation of holiday-driven surges. Approach: Introduces LLMForecaster as a forecast post-processor that, given base forecast $f_{i,t}$, text features and numeric features, outputs $f^*_{i,t} = e^{\hat{\lambda}_{i,t}} f_{i,t}$ by learning $\hat{\lambda}_{i,t}$ via a fine-tuned LLM with LoRA. Contributions/findings: In an industry-scale retail setting, the method yields statistically significant improvements in forecast accuracy across multiple holidays, particularly when using the Holiday-Encoding Prompt. Impact: Enables better inventory planning and reduces stockouts during seasonal events by augmenting existing forecasting pipelines with unstructured data.
Abstract
Modern time-series forecasting models often fail to make full use of rich unstructured information about the time series themselves. This lack of proper conditioning can lead to obvious model failures; for example, models may be unaware of the details of a particular product, and hence fail to anticipate seasonal surges in customer demand in the lead up to major exogenous events like holidays for clearly relevant products. To address this shortcoming, this paper introduces a novel forecast post-processor -- which we call LLMForecaster -- that fine-tunes large language models (LLMs) to incorporate unstructured semantic and contextual information and historical data to improve the forecasts from an existing demand forecasting pipeline. In an industry-scale retail application, we demonstrate that our technique yields statistically significantly forecast improvements across several sets of products subject to holiday-driven demand surges.
