A general framework for multi-step ahead adaptive conformal heteroscedastic time series forecasting
Martim Sousa, Ana Maria Tomé, José Moreira
TL;DR
AEnbMIMOCQR presents a model-agnostic framework that delivers distribution-free, multi-step ahead prediction intervals for volatile time series by integrating adaptive conformal inference (ACI), ensemble batch conformalized quantile regression (EnbCQR), and a multi-input multi-output (MIMO) forecasting strategy. The method uses a sliding window of non-conformity scores and horizon-wise CP corrections to achieve near-nominal coverage across $H$ steps while adapting quickly to distribution shifts and heteroscedasticity, without data splitting. Empirical results on real (NN5) and synthetic datasets show improved coverage accuracy and shorter, input-dependent interval widths relative to EnbPI, EnbCQR, and ARIMA, with MIOU indicating better conditional nominal coverage in the synthetic setting. The work extends CP to multivariate, multi-step forecasting settings and provides open-source code, highlighting practical impact in domains with evolving data distributions and uncertain futures, such as finance, energy, and urban planning.
Abstract
This paper introduces a novel model-agnostic algorithm called adaptive ensemble batch multi-input multi-output conformalized quantile regression (AEnbMIMOCQR} that enables forecasters to generate multi-step ahead prediction intervals for a fixed pre-specified miscoverage rate in a distribution-free manner. Our method is grounded on conformal prediction principles, however, it does not require data splitting and provides close to exact coverage even when the data is not exchangeable. Moreover, the resulting prediction intervals, besides being empirically valid along the forecast horizon, do not neglect heteroscedasticity. AEnbMIMOCQR is designed to be robust to distribution shifts, which means that its prediction intervals remain reliable over an unlimited period of time, without entailing retraining or imposing unrealistic strict assumptions on the data-generating process. Through methodically experimentation, we demonstrate that our approach outperforms other competitive methods on both real-world and synthetic datasets. The code used in the experimental part and a tutorial on how to use AEnbMIMOCQR can be found at the following GitHub repository: https://github.com/Quilograma/AEnbMIMOCQR.
