Forecasting GDP in Europe with Textual Data
Luca Barbaglia, Sergio Consoli, Sebastiano Manzan
TL;DR
The paper develops FiGAS-based, aspect-specific sentiment indicators from a large multilingual news corpus to forecast GDP and other macro variables for five European economies. By integrating these indicators through a mixed-frequency framework (U-MIDAS) and robust inference (double-lasso with multiple testing adjustments), the authors demonstrate incremental predictive power over standard macro and survey signals, with effects varying by country and horizon. Out-of-sample tests reveal substantial reductions in forecast errors, particularly at longer horizons and during recessions, while robustness checks extend findings to unemployment, IPI, and CPI. The work highlights the value of high-frequency, text-derived sentiment for real-time economic monitoring and suggests avenues for further refinement, including expanded vocabularies and nonlinear modeling. The practical impact lies in providing forecasters with timely, country-specific signals that complement traditional indicators, improving nowcasting and forecasting under real-time data constraints.
Abstract
We evaluate the informational content of news-based sentiment indicators for forecasting Gross Domestic Product (GDP) and other macroeconomic variables of the five major European economies. Our data set includes over 27 million articles for 26 major newspapers in 5 different languages. The evidence indicates that these sentiment indicators are significant predictors to forecast macroeconomic variables and their predictive content is robust to controlling for other indicators available to forecasters in real-time.
