Some variation of COBRA in sequential learning setup
Aryan Bhambu, Arabin Kumar Dey
TL;DR
This work extends the COBRA (Combined Regression Strategy) ensemble to a sequential, multivariate time-series setting by introducing Dynamic Proximity Ensemble (DPE) and Partition-Dynamic Proximity Ensemble (PaDPE). Both approaches build a frame-based representation of time-series data, train multiple machines, and combine predictions using consensus-based weights governed by a proximity threshold $\epsilon$ and a consensus parameter $\alpha$, with PaDPE partitioning the training data to enhance robustness. Bayesian optimisation (BOA) via Tree-based Parzen Estimators is employed to automatically tune hyperparameters, outperforming grid search across eight datasets that span cryptocurrency, stock indices, and short-term load forecasting. Empirical results show that DPE often achieves the best RMSE/MAPE and that BOA-driven configurations (especially in PaDPE) yield superior performance; Wilcoxon's tests confirm statistically significant differences among models. The study demonstrates strong potential for COBRA-based ensembles in dynamic, high-dimensional forecasting tasks and outlines directions for improved dynamic prediction and interval estimation.
Abstract
This research paper introduces innovative approaches for multivariate time series forecasting based on different variations of the combined regression strategy. We use specific data preprocessing techniques which makes a radical change in the behaviour of prediction. We compare the performance of the model based on two types of hyper-parameter tuning Bayesian optimisation (BO) and Usual Grid search. Our proposed methodologies outperform all state-of-the-art comparative models. We illustrate the methodologies through eight time series datasets from three categories: cryptocurrency, stock index, and short-term load forecasting.
