ARMAr-LASSO: Mitigating the Impact of Predictor Serial Correlation on the LASSO
Simone Tonini, Francesca Chiaromonte, Alessandro Giovannelli
TL;DR
The paper identifies a fundamental challenge for LASSO in time-series regressions: serial dependence in predictors and errors creates spurious correlations that impair estimation and forecasting. It introduces ARMAr-LASSO, which pre-whitens predictors via ARMA filtering and then applies LASSO to the ARMA residuals plus a small number of lagged responses, providing both finite-sample and asymptotic guarantees. The authors derive a density approximation for sample correlations under AR(1) structure, establish LASSO oracle inequalities and high-dimensional results under near-epoch dependence, and illustrate substantial gains through simulations and a Euro-area macroeconomic forecasting application. The method yields more parsimonious, accurate models and forecasts than several LASSO-based benchmarks, demonstrating robustness to factor structure and various ARMA specifications. These results offer a practical, theoretically grounded tool for high-dimensional time-series modeling where serial dependence threatens standard penalized regression approaches.
Abstract
We explore estimation and forecast accuracy for sparse linear models, focusing on scenarios where both predictors and errors carry serial correlations. We establish a clear link between predictor serial correlation and the performance of the LASSO, showing that even orthogonal or weakly correlated stationary AR processes can lead to significant spurious correlations due to their serial correlations. To address this challenge, we propose a novel approach named ARMAr-LASSO ({\em ARMA residuals LASSO}), which applies the LASSO to predictors that have been pre-whitened with ARMA filters and lags of dependent variable. We derive both asymptotic results and oracle inequalities for the ARMAr-LASSO, demonstrating that it effectively reduces estimation errors while also providing an effective forecasting and feature selection strategy. Our findings are supported by extensive simulations and an application to real-world macroeconomic data, which highlight the superior performance of the ARMAr-LASSO for handling sparse linear models in the context of time series.
