Comprehensive Modeling Approaches for Forecasting Bitcoin Transaction Fees: A Comparative Study
Jiangqin Ma, Erfan Mahmoudinia
TL;DR
This study tackles Bitcoin transaction fee forecasting over a 24-hour horizon using six approaches, including SARIMAX, Prophet, Time2Vec (with and without attention), TFT, and a SARIMAX-Gradient Boosting hybrid, on a 91-day, 11,809-record dataset with 23 features. Through expanding-window 5-fold cross-validation and a fixed 144-block test set, traditional statistical models (notably SARIMAX) outperform deep learning architectures, with SARIMAX achieving the best cross-validation and test metrics ($MAE$ and $RMSE$) and deep models suffering from data limitations. The hybrid model offers robustness by combining linear trend capture with non-linear adjustments, while smoothing effects in long-horizon forecasts provide practical guidance for fee-sensitive planning. The results highlight the current practical advantage of classical methods in this domain and establish a baseline for future enhancements involving hybridization, transfer learning, and interpretable deep architectures for cryptocurrency analytics.
Abstract
Transaction fee prediction in Bitcoin's ecosystem represents a crucial challenge affecting both user costs and miner revenue optimization. This study presents a systematic evaluation of six predictive models for forecasting Bitcoin transaction fees across a 24-hour horizon (144 blocks): SARIMAX, Prophet, Time2Vec, Time2Vec with Attention, a Hybrid model combining SARIMAX with Gradient Boosting, and the Temporal Fusion Transformer (TFT). Our approach integrates comprehensive feature engineering spanning mempool metrics, network parameters, and historical fee patterns to capture the multifaceted dynamics of fee behavior. Through rigorous 5-fold cross-validation and independent testing, our analysis reveals that traditional statistical approaches outperform more complex deep learning architectures. The SARIMAX model achieves superior accuracy on the independent test set, while Prophet demonstrates strong performance during cross-validation. Notably, sophisticated deep learning models like Time2Vec and TFT show comparatively lower predictive power despite their architectural complexity. This performance disparity likely stems from the relatively constrained training dataset of 91 days, suggesting that deep learning models may achieve enhanced results with extended historical data. These findings offer significant practical implications for cryptocurrency stakeholders, providing empirically-validated guidance for fee-sensitive decision making while illuminating critical considerations in model selection based on data constraints. The study establishes a foundation for advanced fee prediction while highlighting the current advantages of traditional statistical methods in this domain.
